SlowFast-LLaVA

A large language model for video understanding and reasoning that does not require training.

CommonProductProductivityVideo Question AnsweringMultimodal Learning
SlowFast-LLaVA is a zero-training multimodal large language model designed for video understanding and reasoning. It achieves performance comparable to or even better than state-of-the-art video large language models across various video question-answering tasks and benchmarks, without the need for fine-tuning on any data.
Visit

SlowFast-LLaVA Visit Over Time

Monthly Visits

503747431

Bounce Rate

37.31%

Page per Visit

5.7

Visit Duration

00:06:44

SlowFast-LLaVA Visit Trend

SlowFast-LLaVA Visit Geography

SlowFast-LLaVA Traffic Sources

SlowFast-LLaVA Alternatives