Aya Vision 8B

An 800-million parameter multilingual vision-language model supporting OCR, image captioning, visual reasoning, and more.

CommonProductImageMultilingualVision-Language Model
CohereForAI's Aya Vision 8B is an 800-million parameter multilingual vision-language model optimized for various visual language tasks, supporting OCR, image captioning, visual reasoning, summarization, and question answering. Based on the C4AI Command R7B language model and incorporating the SigLIP2 visual encoder, it supports 23 languages and features a 16K context length. Key advantages include multilingual support, powerful visual understanding capabilities, and broad applicability. Released with open-source weights, it aims to advance the global research community. Users must adhere to C4AI's acceptable use policy under the CC-BY-NC license.
Visit

Aya Vision 8B Visit Over Time

Monthly Visits

26103677

Bounce Rate

43.69%

Page per Visit

5.5

Visit Duration

00:04:43

Aya Vision 8B Visit Trend

Aya Vision 8B Visit Geography

Aya Vision 8B Traffic Sources

Aya Vision 8B Alternatives