French AI Startup Les Ministraux Launches Brand New Lightweight Models, Outperforming Llama 3!

AIbase基地

Published inAI News · 4 min read · Oct 23, 2024

206

French AI startup Les Ministraux has introduced two new lightweight models, Ministral3B and Ministral8B, designed specifically for edge devices, with 3 billion and 8 billion parameters respectively. These models have performed exceptionally well in instruction-following benchmark tests, with Ministral3B surpassing Llama38B and Mistral7B, and Ministral8B outperforming these models in all aspects except for coding capabilities.

Test results indicate that the performance of Ministral3B and Ministral8B is comparable to that of open-source models like Gemma2 and Llama3.1. Both models support up to 128k context lengths and set new benchmarks in knowledge, common sense, reasoning, function calling, and efficiency for models with less than 10B parameters. Ministral8B also features a sliding window attention mechanism for faster and more efficient memory inference. They can be fine-tuned for various use cases, such as managing complex AI agent workflows or creating specialized task assistants.

Researchers have conducted multiple benchmark tests on Les Ministraux models, covering knowledge and common sense, code, math, and multilingual aspects. During the pre-training phase, Ministral3B outperformed Gema22B and Llama3.23B. Ministral8B, compared to Llama3.18B and Mistral7B, excelled in all aspects except for coding capabilities. In the fine-tuned instruction model phase, Ministral3B achieved the best results in various benchmarks, while Ministral8B slightly lagged behind Gema29B only on the Wild bench.

The introduction of Les Ministraux models provides users with a high computational efficiency and low-latency solution, meeting the growing demand for local-first inference in critical applications. Users can apply these models to scenarios such as on-device translation, offline smart assistants, and autonomous robots. The pricing for Ministral8B is $0.1 per million tokens, and for Ministral3B, it's $0.04 per million tokens.

It's worth noting that Mistral previously gained recognition in the AI community for open-sourcing several models via magnet links. However, the company has faced controversy this year for not being as open as before. There are rumors that Microsoft will acquire a stake in Mistral and invest in it, meaning Mistral's models will be hosted on Azure AI. Reddit users have noticed that Mistral has removed its commitment to open-source from its official website. Some of the company's models have also started to charge, including the newly released Ministral3B and Ministral8B.

Details: https://mistral.ai/news/ministraux/

PhysX-Anything: Single Image Generation of Simulatable 3D Assets: Explicitly Preserving Joints and Physical Parameters - Open Source

Nanyang Technological University and the Shanghai Artificial Intelligence Lab jointly launched the open-source framework PhysX-Anything, which can generate complete 3D assets including geometry, joints, and physical parameters from a single RGB image, directly usable for robot training. Technical highlights include: a coarse-to-fine process that first predicts overall physical properties before refining components; a new compressed 3D representation method to avoid physical distortion caused by visual bias.

Shanghai Cyberspace Administration Launches Crackdown, 54 Illegal AI Applications Removed, 3 Websites Penalized

The Shanghai Cyberspace Administration launched a special law enforcement campaign targeting the 'misuse of AI', aiming to address illegal activities related to generative AI. The operation focused on issues such as AI face-swapping and voice-changing, which pose threats to personal privacy and disrupt the online ecosystem. During the enforcement, some companies were found not to have developed and used AI technology in accordance with regulations, and it was emphasized that strengthening supervision is needed to prevent risks.

Google: Plan to Improve Performance by 1000 Times with the Same Cost and Energy Consumption in the Next 4-5 Years

Google released its strongest AI model, the Gemini3 series, which surpassed OpenAI and attracted attention, leading to a rise in stock price. The head of Google Cloud AI stated that Google plans to enhance computing power, storage, and network capabilities at the same cost and energy consumption over the next 4-5 years.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

French AI Startup Les Ministraux Launches Brand New Lightweight Models, Outperforming Llama 3!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

PhysX-Anything: Single Image Generation of Simulatable 3D Assets: Explicitly Preserving Joints and Physical Parameters - Open Source

Shanghai Cyberspace Administration Launches Crackdown, 54 Illegal AI Applications Removed, 3 Websites Penalized

Alibaba Qwen App Makes Its Debut: Downloads Exceed 10 Million in a Week, Setting a New Record for AI Application Growth!

Leading AI Models Perform Poorly in Complex Physical Tasks and Still Require Human Assistance

How Far is AI from the Nobel Prize? Top Models Fail in the CritPt Benchmark for Doctoral-Level Physics with Accuracy Below 10%

Google: Plan to Improve Performance by 1000 Times with the Same Cost and Energy Consumption in the Next 4-5 Years

Google Gemini 3 Quickly Tops LMArena Rankings, Musk and Altman Send Congratulations

3-Day Emergency Expansion 8 Rounds: Ant Lingguang Enables Ordinary People to Master AI

Ling Guang Rises 8 Rounds in 3 Days and Climbs to No. 6 on App Store China Free Chart

Wispr Secures $25 Million in Series B Funding: User Growth Surpasses 100 Times Annually, Plans to Develop Its Own ASR to Reduce Error Rate to 10%

GEO Services