Pixtral-12B-2409

A multimodal model with 12 billion parameters, integrating a visual encoder for image and text processing.

CommonProductProductivityMultimodalImage Processing
Pixtral-12B-2409 is a multimodal model developed by the Mistral AI team, featuring a 12 billion parameter multimodal decoder and a 400 million parameter visual encoder. The model excels in multimodal tasks, supports images of varying sizes, and maintains cutting-edge performance on text benchmarks. It is suitable for advanced applications requiring the processing of image and text data, such as image description generation and visual question answering.
Visit

Pixtral-12B-2409 Visit Over Time

Monthly Visits

20899836

Bounce Rate

46.04%

Page per Visit

5.2

Visit Duration

00:04:57

Pixtral-12B-2409 Visit Trend

Pixtral-12B-2409 Visit Geography

Pixtral-12B-2409 Traffic Sources

Pixtral-12B-2409 Alternatives