Griffon
High-resolution multi-modal perception LVLM
CommonProductImageMulti-modalHigh-resolution
Griffon is the first high-resolution (over 1K) LVLM with localization capabilities, able to describe everything in the region of your interest. In its latest version, Griffon supports visual language grounding. You can input an image or some descriptions. Griffon excels in REC, object detection, object counting, visual/phrase localization, and REG. Pricing: Free trial.
Griffon Visit Over Time
Monthly Visits
488643166
Bounce Rate
37.28%
Page per Visit
5.7
Visit Duration
00:06:37