AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

ByteDance Open-Sources Multi-SWE-bench to Drive Intelligent Upgrades for Large Model Code

AIbase基地

Published inAI News · 4 min read · Apr 10, 2025

ByteDance's Doubao large model team recently announced the open-sourcing of Multi-SWE-bench, the industry's first multilingual code repair benchmark dataset. This breakthrough facilitates the evaluation and improvement of large models' "automatic bug fixing" capabilities.

With the rapid development of large model technology, code generation tasks have become a key area for testing model intelligence. While code repair benchmarks like SWE-bench can measure a model's programming intelligence, they have significant limitations. They focus solely on Python, failing to assess cross-lingual generalization capabilities. Furthermore, their limited task difficulty restricts the evaluation of large models in complex development scenarios, hindering further advancements in code intelligence.

Code Ability Scores for Different Models

Multi-SWE-bench addresses these limitations. Building upon SWE-bench, it significantly expands coverage to include seven mainstream programming languages: Java, TypeScript, C, C++, Go, Rust, and JavaScript. It comprises 1632 repair tasks sourced from real-world open-source repositories. These tasks have undergone rigorous screening and manual verification to ensure reliability. Multi-SWE-bench also introduces a difficulty level system (easy, medium, hard), enabling a more comprehensive evaluation of model performance across different skill levels.

Experiments using this dataset show that current large language models perform reasonably well on Python repair tasks, but their average repair rate for other languages is less than 10%, highlighting the challenge of multilingual code repair for large models.

Some mainstream models show superior performance in Python but underperform significantly with other languages. Furthermore, the model's repair rate decreases as task difficulty increases.

To support the application of reinforcement learning in automatic programming, the team also open-sourced Multi-SWE-RL. This provides 4723 instances and a corresponding reproducible Docker environment, supporting one-click startup and automatic evaluation, creating a standardized data foundation for RL training. Additionally, the team has launched an open-source community initiative, inviting developers and researchers to participate in dataset expansion, new method evaluation, and other efforts to jointly advance the RL for Code ecosystem.

The ByteDance Doubao large model team hopes that Multi-SWE-bench will propel automatic programming technology to new heights. They plan to continue expanding its coverage to help large models make greater strides in the field of "automated software engineering."

Multi-SWE-bench BeanbagLargeModel CodeRepairBenchmarkDataset LargeModelCodeAbilityEvaluation

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

ByteDance Launches Multi-SWE-bench, Setting a New Standard for Automated Multi-Language Code Repair

Bug fixing is a persistent challenge in software development. ByteDance's Doubao large model team has exciting news: they've launched Multi-SWE-bench, the first multi-lingual software engineering (SWE) dataset. This new dataset aims to evaluate and improve the ability of large models to automatically fix code errors. Multi-SWE-bench significantly expands the scope compared to previous single-language datasets. It covers not only Python, but also...

Apr 10, 2025

240

Doubao: The video generation model 'VideoWorld' has been open-sourced, achieving pure visual learning

Feb 10, 2025

5.4k

Doubao's Large Model Claims to Match GPT-4, Revealing Its Capability of 3 Million Long Texts for the First Time

Dec 31, 2024

5.2k

Douyin Vice President Denies a Price War for Large Models: Promoting the Inclusive Development and Application of AI Technology

Today, in response to rumors that ByteDance might initiate a price war for large models, Douyin Vice President Li Liang issued a statement on social media, clearly stating that this is not a price war. Li Liang pointed out that the Doubao large model has reduced costs through technological innovation, with significant optimizations in algorithms, software engineering, and hardware solutions. He mentioned that the pricing of 0.3 yuan per 1,000 tokens not only has a considerable gross profit but also follows a transparent pricing strategy, which is not the traditional 'list price discount' model.

Dec 19, 2024

1.7k

Doubao Large Model Family Upgraded, Launches Powerful Visual Understanding Model

Dec 18, 2024

4.3k

ByteDance Volcano Engine Global AI Search Released: Supports Multimodal Search and Quality Real-Time Content from Douyin

At the 2024 Volcano Engine FORCE Power Conference in Winter, ByteDance also launched the Global AI Search on Volcano Engine. This service integrates contextual search recommendations, enterprise private domain information integration, and connected Q&A services, closely aligning enterprise information, business needs, and user demands to help businesses achieve more precise recommendations and broader information discovery.

Dec 18, 2024

4.9k

AI Daily: ByteDance Launches Image Editing Model SeedEdit; Suno Releases V4 Music Generation Model; Google's Latest AI Video Creation Tool Vids

Welcome to the 【AI Daily】 column! This is your daily guide to exploring the world of artificial intelligence. Every day we present to you the hot topics in the AI field, focusing on developers to help you gain insights into technological trends and understand innovative AI product applications. Check out the new AI products here: https://top.aibase.com/ 1. The Doubao Big Model Team officially released the image editing model SeedEdit, making P-picture magic come true! Grammy-nominated band returns to the spotlight.

Nov 11, 2024

1.7k

ByteDance's Doubao Large Model to Release Video Generation Model on September 24

Today, ByteDance's Volcano Engine announced that the Doubao Large Model will release a video generation model on September 24, bringing upgrades to more capabilities in the model family. It is understood that the Doubao Large Model was officially launched at the Volcano Engine Original Power Conference on May 15, 2024.

Sep 18, 2024

5.0k