Mistral AI Launches Powerful Edge AI Model Ministral 8B, Supporting 128,000 Tokens

AIbase基地

Published inAI News · 3 min read · Oct 17, 2024

196

Recently, French AI startup Mistral AI announced their new generation of language models — Ministral3B and Ministral8B.

These new models are part of the "Ministraux" series, designed specifically for edge devices and edge computing scenarios, supporting a context length of up to 128,000 tokens. This means that these models not only have powerful processing capabilities but can also be used in situations where data privacy and local processing are particularly important.

Mistral states that the Ministraux series models are well-suited for a range of applications, such as local translation, offline smart assistants, data analysis, and autonomous robotics. To further enhance efficiency, the Ministraux models can also be combined with larger language models (like Mistral Large) as effective intermediaries in multi-step workflows.

In terms of performance, Mistral's benchmark tests show that Ministral3B and 8B outperform many同类models in multiple categories, such as Google's Gemma22B and Meta's Llama3.18B. Notably, despite having fewer parameters, Ministral3B outperforms its predecessor Mistral7B in certain tests.

In fact, Ministral8B excels in all tests, particularly in knowledge, common sense, function calling, and multilingual capabilities.

Regarding pricing, these new models from Ministral AI are already available via API. Ministral8B costs $0.10 per million tokens, while Ministral3B is $0.04. Additionally, Mistral offers the model weights for Ministral8B Instruct for research purposes. It is worth noting that these new models from Mistral will soon be available through cloud partners like Google Vertex and AWS.

Key Points:
- 🚀 Mistral AI introduces Ministral3B and 8B, supporting up to 128,000 tokens in context length.
- 💡 These models are suitable for local translation, offline assistants, data analysis, and autonomous robotics.
- 💰 Pricing: Ministral8B costs $0.10 per million tokens, Ministral3B is $0.04.

Qwen3-LiveTranslate-Flash Achieves a Groundbreaking 3-Second Live Translation Delay, Setting a New Industry Record

Qwen3-LiveTranslate-Flash is a multilingual real-time audio-visual translation system launched by Qwen, supporting offline and real-time translation for 18 major languages and various dialects. Its core innovation is the visual context enhancement technology, which not only understands speech but also enhances translation accuracy by combining visual information, bringing a breakthrough in cross-language communication.

Claude Code 2.0 Revolutionary Upgrade: Checkpoints + VS Code Plugin, Programming Efficiency Increases by 3 Times

Anthropic launches Claude Code v2.0 and Sonnet4.5 model, advancing AI programming tools to become a programming partner. The new version enhances autonomy and integration, supports handling complex tasks, and achieves seamless collaboration between the terminal and IDE, drawing widespread attention from the developer community.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Mistral AI Launches Powerful Edge AI Model Ministral 8B, Supporting 128,000 Tokens

AIbase基地

This article is from AIbase Daily

AI News Recommendations

China's First Vertical Large Model for the Sheep Industry Released: Su Wu Large Model Empowers Smart Sheep Farming

Alibaba Qwen-VL-30B-A3B New Model Released, Stronger Performance in Mathematics and Video Processing

Microsoft CEO Shifts Focus to Artificial Intelligence, Delegates Some Responsibilities to New Business Chief

Paycom Lays Off More Than 500 Employees, Replacing Worker Positions with AI

Amazon AI Technology Enhances NBA Fans' New Data Analysis Experience

Trump Signs Order to Invest $50 Million to Help Children's Cancer AI Research

Qwen3-LiveTranslate-Flash Achieves a Groundbreaking 3-Second Live Translation Delay, Setting a New Industry Record

Robot Vision Makes a Big Leap! New Model Helps AI Understand the 3D World, Success Rate Increased by 31%

Claude Code 2.0 Revolutionary Upgrade: Checkpoints + VS Code Plugin, Programming Efficiency Increases by 3 Times

OpenAI Makes Major Appointment! Former Google Executive Kim Kyounghoon to Lead South Korean Operations

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Mistral AI Launches Powerful Edge AI Model Ministral 8B, Supporting 128,000 Tokens

AIbase基地

This article is from AIbase Daily

AI News Recommendations

China's First Vertical Large Model for the Sheep Industry Released: Su Wu Large Model Empowers Smart Sheep Farming

Alibaba Qwen-VL-30B-A3B New Model Released, Stronger Performance in Mathematics and Video Processing

Microsoft CEO Shifts Focus to Artificial Intelligence, Delegates Some Responsibilities to New Business Chief

Paycom Lays Off More Than 500 Employees, Replacing Worker Positions with AI

Amazon AI Technology Enhances NBA Fans' New Data Analysis Experience

Trump Signs Order to Invest $50 Million to Help Children's Cancer AI Research

Qwen3-LiveTranslate-Flash Achieves a Groundbreaking 3-Second Live Translation Delay, Setting a New Industry Record

Robot Vision Makes a Big Leap! New Model Helps AI Understand the 3D World, Success Rate Increased by 31%

Claude Code 2.0 Revolutionary Upgrade: Checkpoints + VS Code Plugin, Programming Efficiency Increases by 3 Times

OpenAI Makes Major Appointment! Former Google Executive Kim Kyounghoon to Lead South Korean Operations

GEO Services