Griffon is the first high-resolution (over 1K) LVLM with localization capabilities, able to describe everything in the region of your interest. In its latest version, Griffon supports visual language grounding. You can input an image or some descriptions. Griffon excels in REC, object detection, object counting, visual/phrase localization, and REG. Pricing: Free trial.