Video-CCAM
A lightweight and flexible video multilingual model developed by the Tencent QQ Multimedia Research Team.
CommonProductVideoVideo UnderstandingMultilingual Models
Video-CCAM is a series of flexible video multilingual models (Video-MLLM) developed by the Tencent QQ Multimedia Research Team, aimed at enhancing video-language understanding, particularly suitable for both short and long video analysis. It achieves this through Causal Cross-Attention Masks. Video-CCAM has shown outstanding performance across multiple benchmark tests, especially in MVBench, VideoVista, and MLVU. The source code has been rewritten to streamline the deployment process.
Video-CCAM Visit Over Time
Monthly Visits
494758773
Bounce Rate
37.69%
Page per Visit
5.7
Visit Duration
00:06:29