视频理解综述

中国拖拉机手2025-08-19 1:32

CVPR2025

CVPR 2025 Accepted Papers

https://github.com/BradyFU/Video-MME?tab=readme-ov-file

1、Awesome-LLMs-for-Video-Understanding

https://github.com/yunlong10/Awesome-LLMs-for-Video-Understanding

https://arxiv.org/pdf/2312.17432v4 （2407修订版）

From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding

https://arxiv.org/pdf/2409.18938

https://github.com/Vincent-ZHQ/LV-LLMs

paperwithcode Video Understanding

https://paperswithcode.com/task/video-understanding/latest

Awesome-Multimodal-Large-Language-Models

https://github.com/yfzhang114/Awesome-Multimodal-Large-Language-Models?tab=readme-ov-file

MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs

https://arxiv.org/pdf/2411.15296

万字长文总结多模态大模型评估最新进展 - yearn的文章 - 知乎

https://zhuanlan.zhihu.com/p/16815782175

另一个Awesome-Multimodal-Large-Language-Models

https://github.com/BradyFU/Awesome-Multimodal-Large-Language-Models?tab=readme-ov-file

MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models

多模态学习有什么好的研究方向？ - 梦想成真的回答 - 知乎

https://www.zhihu.com/question/332876504/answer/130142183129

上一篇：OpenCV对椒盐处理后的视频进行均值滤波处理

下一篇：功能强大！开源免费的视频翻译、音视频转录工具

热门推荐

01如何新建文件夹？电脑新建文件夹的4种方法 02GitHub 镜像站点 03国内可直接用、免费额度/永久免费的大模型API清单（含 SiliconFlow、火山、阿里、智谱、百度、Kimi、DeepSeek、DMXAPI 等）042026年7月AI圈大地震：GPT-5.6被政府限制、Claude入驻Slack、Anthropic自研芯片 05微信历史版本含下载地址（ Windows PC | 安卓 | MAC ）及设置微信不更新 06AI 编程 IDE 全景解析 2026：Agent 全面接管开发链路 072026 国产 AI 大模型横评：DeepSeek、通义千问、Kimi、文心一言、星火、豆包谁更能打？08AI科技热点日报 | 2026年07月01日 09Agnes AI 免费 API 接入指南：文本、生图、生视频，一套接口全免费 102026 年 AI 大模型 & AI 编程工具实战全总结