➤回放视频在这里☟



1.InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks
论文地址:https://arxiv.org/abs/2312.14238
代码地址:
https://github.com/OpenGVLab/InternVL
2.MVBench:A Comprehensive Multi-modal Video Understanding Benchmark
论文地址:https://arxiv.org/abs/2311.17005
代码地址:
https://github.com/OpenGVLab/Ask-Anything/tree/main/video_chat2
3.EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World
论文地址:https://arxiv.org/abs/2403.16182
代码地址:
https://github.com/OpenGVLab/EgoExoLearn
4.Scale Decoupled Distillation
论文地址:https://arxiv.org/abs/2403.13512
代码地址:
https://github.com/shicaiwei123/SDD-CVPR2024
5.Point Transformer V3: Simpler, Faster, Stronger
论文地址:https://arxiv.org/abs/2312.10035
代码地址:https://github.com/Pointcept/Pointcept
6.Language-driven All-in-one Adverse Weather Removal
论文地址:https://arxiv.org/abs/2312.01381
代码地址:https://github.com/noxsine/LDR
7.LDP: Language-driven Dual-Pixel Image Defocus Deblurring Network
论文地址:https://arxiv.org/abs/2307.09815
代码地址:
https://github.com/noxsine/LDP
极市平台专注分享计算机视觉前沿资讯和技术干货,特邀请行业内专业牛人嘉宾为大家分享视觉领域内的干货及经验,目前已成功举办130期线上分享。近期在线分享可点击以下合集查看:
# CV技术社群邀请函 #
备注:姓名-学校/公司-研究方向-城市(如:小极-北大-目标检测-深圳)
即可申请加入极市目标检测/图像分割/工业检测/人脸/医学影像/3D/SLAM/自动驾驶/超分辨率/姿态估计/ReID/GAN/图像增强/OCR/视频理解等技术交流群
每月大咖直播分享、真实项目需求对接、求职内推、算法竞赛、干货资讯汇总、与 10000+来自港科大、北大、清华、中科院、CMU、腾讯、百度等名校名企视觉开发者互动交流~

