VQAScore

VQAScore, a novel evaluation metric and benchmark for text-to-vision generation, is introduced. VQAScore, based on the CLIP-FlanT5 model, achieves state-of-the-art performance in evaluating text-to-image/video/3D generation. It serves as a powerful alternative to CLIPScore. GenAI-Bench, a benchmark dataset, provides real-world testing texts with rich semantic combinations, allowing for a comprehensive assessment of generative model performance.

CommonProductImageText generationVision generation
Visit

VQAScore Visit Over Time

Monthly Visits

2146

Bounce Rate

61.50%

Page per Visit

1.0

Visit Duration

00:00:00

VQAScore Visit Trend

VQAScore Visit Geography

VQAScore Traffic Sources

VQAScore Alternatives