Video-CCAM

A lightweight and flexible video multilingual model developed by the Tencent QQ Multimedia Research Team.

CommonProductVideoVideo UnderstandingMultilingual Models
Video-CCAM is a series of flexible video multilingual models (Video-MLLM) developed by the Tencent QQ Multimedia Research Team, aimed at enhancing video-language understanding, particularly suitable for both short and long video analysis. It achieves this through Causal Cross-Attention Masks. Video-CCAM has shown outstanding performance across multiple benchmark tests, especially in MVBench, VideoVista, and MLVU. The source code has been rewritten to streamline the deployment process.
Visit

Video-CCAM Visit Over Time

Monthly Visits

499904316

Bounce Rate

37.31%

Page per Visit

5.8

Visit Duration

00:06:52

Video-CCAM Visit Trend

Video-CCAM Visit Geography

Video-CCAM Traffic Sources

Video-CCAM Alternatives