The new generation language model ModernBERT is released, with a processing speed four times faster and lower cost for tasks like RAG.

AIbase基地

Published inAI News · 5 min read · Dec 23, 2024

305

Recently, Answer.AI and LightOn jointly released the open-source language model ModernBERT, which is a significant upgrade over Google's BERT. According to the developers, ModernBERT has made remarkable improvements in processing speed, efficiency, and quality. The model can operate four times faster than its predecessor while using less memory.

The design of ModernBERT allows it to handle texts of up to 8192 tokens, which is a 16-fold increase over the typical 512-token limit of existing encoding models. Additionally, ModernBERT is the first extensively trained programming code encoding model, scoring over 80 on the StackOverflow Q&A dataset, setting a new record for coding models.

In the General Language Understanding Evaluation (GLUE), ModernBERT-Large achieved an optimal balance between processing speed and accuracy, with a processing time of about 20 milliseconds per token and a score of 90. The development team vividly compares ModernBERT to a finely tuned Honda Civic, emphasizing its reliability and efficiency in everyday applications.

Compared to existing large language models like GPT-4, ModernBERT significantly reduces costs for large-scale text processing. The cost for each query with GPT-4 is several cents, while ModernBERT can run locally, making it faster and cheaper. For example, the FineWeb Edu project incurred a cost of $60,000 when filtering 15 billion tokens using the BERT model, while even using Google's Gemini Flash decoder, the cost exceeded $1 million.

The development team states that ModernBERT is well-suited for various practical applications, including retrieval-augmented generation (RAG) systems, code search, and content moderation. Unlike GPT-4, which requires specialized hardware, ModernBERT can run efficiently on standard consumer-grade gaming GPUs.

Currently, ModernBERT offers two versions: the base model with 139 million parameters and the large version with 395 million parameters. Both versions are now available on Hugging Face, and users can directly replace their existing BERT models with them. The development team plans to release a larger version next year but does not have plans for multimodal capabilities. To encourage the development of new applications, they have also launched a competition that will reward the top five demonstrators with $100 and a six-month Hugging Face Pro subscription.

Since Google launched BERT in 2018, it has been one of the most popular language models, with over 68 million downloads per month on Hugging Face.

Project link: https://huggingface.co/blog/modernbert

Key Points:
🌟 ModernBERT is four times faster than BERT and can handle texts of up to 8192 tokens.
💰 Compared to GPT-4, ModernBERT significantly reduces costs for large-scale text processing and operates more efficiently.
📊 The model excels at handling programming code, scoring over 80 on the StackOverflow Q&A dataset, setting a new record.

ModernBERT TsinghuaBERT Answer.AI LightOn

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Institution: Downgrade the year-over-year growth rate of AI server shipments in 2025

North American large CSPs remain the main driving force behind the demand for AI servers, supported by tier-2 data centers as well as sovereign cloud projects in the Middle East and Europe. Overall demand remains stable. Driven by the demand from North American CSPs and OEMs, it is expected that AI server shipments will continue to grow at double-digit rates in 2025. However, due to changes in the international situation, the year-over-year growth rate of global AI server shipments in 2025 has been revised downward to 24.3%.

Jul 2, 2025

190

WeChat AI Search Accused of Forced 'Opening the Box' to Name, Turning into Hyperlink Instantly - Tencent Responds: Only Integrates Public Information

The newly launched AI search function in WeChat has attracted widespread attention due to allegations of leaking personal privacy. Recently, many users reported on social platforms that this function can generate a personal resume with a name hyperlink in one click, causing concerns about privacy security among users. According to user feedback, the controversy surrounding WeChat AI Search mainly focuses on its automatic identification mechanism. When users encounter names in WeChat official account articles, the system automatically converts the name into a blue hyperlink. Clicking this link will force the AI system to generate a detailed information page containing personal resume, as well as display all

Jul 2, 2025

210

JD.com's Embodied Intelligence Strategy Accelerates Rapidly, JoyInside Collaboration Map Exposed

According to NetEase Technology, JD.com's layout in the field of embodied intelligence is accelerating rapidly. The embodied intelligence brand JoyInside under JD.com has reached cooperation with more than ten leading robot companies, becoming the core engine for JD.com to seize the smart robot market. According to insiders, JoyInside is supported by JD's large model technology, focusing on providing smart interaction capabilities between robots and consumers. Its product strategy focuses on scenario-based applications such as one person, one dog, and one toy. Since its launch, the brand has successfully attracted leading enterprises from multiple niche fields to join.

Jul 2, 2025

240

Foxconn Launches Its First AI Inference Large Model FoxBrain, Trademark Application Submitted

Recently, Hon Hai Precision Industrial Co., Ltd. (commonly known as Foxconn) submitted a trademark registration application for "FoxBrain" to the Trademark Office of the National Intellectual Property Administration. This AI inference large model is not only Foxconn's first attempt but also the first AI model of this type in Taiwan. According to public information, the international classification of this trademark is scientific instruments, and it is currently in the "waiting for substantive examination" status. "FoxBrain" is an AI inference large model launched by the Hon Hai Research Institute, covering data analysis

Jul 2, 2025

270

Zhipu AI Launches GLM-4.1V-Thinking Open Source! A New Leader in Multimodal Reasoning, Challenging Top Models Worldwide

Jul 2, 2025

270

Zhipu AI Open Sources GLM-4.1V-Thinking: A Breakthrough in Multimodal Reasoning

Zhipu AI officially open-sources its latest general vision model, GLM-4.1V-Thinking, based on the GLM-4V architecture, which introduces a chain-of-thought reasoning mechanism, significantly enhancing its capabilities for complex cognitive tasks. The model supports multimodal inputs such as images, videos, and documents, and excels in diverse scenarios including long video understanding, image question answering, subject problem-solving, text recognition, document interpretation, grounding, GUI Agent, and code generation, covering a wide range of industry application needs. GLM-4.1V-9B-Thinking

Jul 2, 2025

310

AI Daily: Baidu Launches Drawn-Imagine Platform and MuseSteamer; Alibaba's Audio-Driven Full-Body Digital Human Model OmniAvatar

Welcome to the [AI Daily] section! Here is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and learn about innovative AI product applications. Click to learn more about new AI products: https://top.aibase.com/1、Open Source End-to-End Speech Large Model Step-Audio-AQAA: Understand audio and directly generate natural speech. Step-Audio-AQAA is an open source end-to-end speech large model,

Jul 2, 2025

250

Ant Group's Medical AI Platform Wins SAIL Award at 2025 World Artificial Intelligence Conference

Jul 2, 2025

220

Foxconn's Parent Company Registers a Trademark for an AI Inference Large Model

Jul 2, 2025

Baidu Launches the HuiXiang Platform and MuseSteamer: AI-Generated Video with a Single Image to Create Professional-Level Movies!

At today's Baidu AI DAY technology open day, Baidu's commercial R&D team officially launched its self-developed video generation model MuseSteamer and the accompanying video product platform **HuiXiang**. This innovation aims to create a comprehensive video generation solution by combining generative AI and multimodal technology, to meet the strong demand for native content production in scenarios such as search, advertising, and recommendations. The MuseSteamer video generation model series is rich, currently including Turbo, Lite, Pro, and

Jul 2, 2025

510

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

The new generation language model ModernBERT is released, with a processing speed four times faster and lower cost for tasks like RAG.

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Institution: Downgrade the year-over-year growth rate of AI server shipments in 2025

WeChat AI Search Accused of Forced 'Opening the Box' to Name, Turning into Hyperlink Instantly - Tencent Responds: Only Integrates Public Information

JD.com's Embodied Intelligence Strategy Accelerates Rapidly, JoyInside Collaboration Map Exposed

Foxconn Launches Its First AI Inference Large Model FoxBrain, Trademark Application Submitted

Zhipu AI Launches GLM-4.1V-Thinking Open Source! A New Leader in Multimodal Reasoning, Challenging Top Models Worldwide

Zhipu AI Open Sources GLM-4.1V-Thinking: A Breakthrough in Multimodal Reasoning

AI Daily: Baidu Launches Drawn-Imagine Platform and MuseSteamer; Alibaba's Audio-Driven Full-Body Digital Human Model OmniAvatar

Ant Group's Medical AI Platform Wins SAIL Award at 2025 World Artificial Intelligence Conference

Foxconn's Parent Company Registers a Trademark for an AI Inference Large Model

Baidu Launches the HuiXiang Platform and MuseSteamer: AI-Generated Video with a Single Image to Create Professional-Level Movies!