VQAScore
VQAScore, a novel evaluation metric and benchmark for text-to-vision generation, is introduced. VQAScore, based on the CLIP-FlanT5 model, achieves state-of-the-art performance in evaluating text-to-image/video/3D generation. It serves as a powerful alternative to CLIPScore. GenAI-Bench, a benchmark dataset, provides real-world testing texts with rich semantic combinations, allowing for a comprehensive assessment of generative model performance.
CommonProductImageText generationVision generation
VQAScore Visit Over Time
Monthly Visits
4079
Bounce Rate
52.73%
Page per Visit
1.2
Visit Duration
00:00:07