Ferret-UI-Llama8b

A multimodal large language model based on Llama-3-8B, focused on UI tasks.

CommonProductProgrammingMultimodalLarge Language Model
Ferret-UI is the first multimodal large language model (MLLM) centered on user interfaces, specifically designed for gesture expression, localization, and reasoning tasks. Built on Gemma-2B and Llama-3-8B, it is capable of performing complex user interface tasks. This version aligns with Apple's research paper and serves as a powerful tool for image-to-text tasks, excelling in dialogue and text generation.
Visit

Ferret-UI-Llama8b Visit Over Time

Monthly Visits

19075321

Bounce Rate

45.07%

Page per Visit

5.5

Visit Duration

00:05:32

Ferret-UI-Llama8b Visit Trend

Ferret-UI-Llama8b Visit Geography

Ferret-UI-Llama8b Traffic Sources

Ferret-UI-Llama8b Alternatives