AIbase
Product LibraryTool Navigation

VideoGLaMM

Public

[CVPR 2025 ?]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

Creat2024-10-31T20:00:44
Update2025-03-24T17:59:00
https://mbzuai-oryx.github.io/VideoGLaMM/
52
Stars
0
Stars Increase