Pixtral-12B-2409
A multimodal model with 12 billion parameters, integrating a visual encoder for image and text processing.
CommonProductProductivityMultimodalImage Processing
Pixtral-12B-2409 is a multimodal model developed by the Mistral AI team, featuring a 12 billion parameter multimodal decoder and a 400 million parameter visual encoder. The model excels in multimodal tasks, supports images of varying sizes, and maintains cutting-edge performance on text benchmarks. It is suitable for advanced applications requiring the processing of image and text data, such as image description generation and visual question answering.
Pixtral-12B-2409 Visit Over Time
Monthly Visits
19075321
Bounce Rate
45.07%
Page per Visit
5.5
Visit Duration
00:05:32