ByteDance Launches Multi-SWE-bench, Setting a New Standard for Automated Multi-Language Code Repair

AIbase基地

Published inAI News · 3 min read · Apr 10, 2025

In the world of software development, bug fixing is always a headache. Now, ByteDance's Doubao large model team has good news: they've officially launched the first multilingual Software Engineering (SWE) dataset – Multi-SWE-bench. This new dataset aims to evaluate and improve large models' ability to automatically fix code errors.

Compared to previous monolingual datasets, Multi-SWE-bench significantly expands its scope. This dataset not only covers Python but also includes seven mainstream programming languages: Java, Go, Rust, C, C++, TypeScript, and JavaScript, truly achieving a "full-stack engineering" benchmark. This means developers using any of these languages can benefit.

ByteDance Douyin Doubao Large Model

The dataset's construction process is also noteworthy. Multi-SWE-bench contains 1632 real-world programming examples, all sourced from GitHub issue reports. To ensure quality, these examples have undergone standardized testing and review by professional developers, ensuring each sample has a clear problem description, a valid fix patch, and a reproducible test environment.

The Doubao large model team hopes that this new dataset will promote systematic evaluation of large models in multiple mainstream programming languages and real-world code environments, thereby improving their automatic programming capabilities and moving towards a more practical and engineering-oriented direction. This effort can not only save developers time but also improve software development efficiency and quality.

In actual development, bug fixing is not just a technical issue; it's a significant factor affecting project progress and team morale. Therefore, the launch of Multi-SWE-bench may be a crucial step towards automated software engineering in the future.

This new dataset from ByteDance marks a significant step forward in automated code repair technology and promises to bring convenience to developers worldwide.

Multi-lingual Software Engineering Dataset Multi-SWE-bench Code Bug Fixing Doubao Large Model

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

ByteDance Open-Sources Multi-SWE-bench to Drive Intelligent Upgrades for Large Model Code

Apr 10, 2025

210

Bytedance's Doubao Large Model Team Holds All-Hands Meeting, Exploring New Heights in AI

Amidst the booming development of the artificial intelligence field, Bytedance's Doubao large model team (Seed) recently held an all-hands meeting, marking a significant decision regarding the team's future direction. The meeting was co-hosted by Zhu Wenjia and the newly appointed head of AI fundamental research, Wu Yonghui. This was the first time the two leaders appeared together, attracting widespread attention from the industry. At the meeting, Zhu Wenjia and Wu Yonghui clearly stated that the Seed team's primary goal is to "explore the upper limit of intelligence," which will serve as the core guidance for the team's future work. They pointed out that exploring...

Mar 19, 2025

550

Groundbreaking Release! Seedream2.0's Text-to-Image Technology Unveiled, Reshaping Industry Landscape

Today, the Doubao Large Model team officially released a technical report on its text-to-image technology, publicly disclosing for the first time the technical details of the Seedream2.0 image generation model. This encompasses the entire process, from data construction and pre-training framework to post-training RLHF, marking a significant breakthrough in the text-to-image field. Since its launch on the Doubao app and Jimeng in early December 2024, Seedream2.0 has served over 100 million C-end users and has become a favorite among professional designers. Compared to mainstream models like Ideogram2.0 and Midjourney V6.1, it...

Mar 12, 2025

1.0k

ByteDance Open-Sources COMET: A Technology Boosting Large Model Training Efficiency by 1.7x

ByteDance's Doubao large model team recently announced a breakthrough in addressing key bottlenecks in Mixture-of-Experts (MoE) architecture, open-sourcing a significant optimization technology called COMET. This technology dramatically improves large model training efficiency, achieving a remarkable 1.7x speedup and a 40% reduction in training costs. Image Note: Image generated by AI, image licensing provider Midjourney. COMET has been deployed in ByteDance's multi-thousand-GPU cluster training, resulting in millions of GPU hours saved.

Mar 10, 2025

1.3k

ByteDance Releases Doubao Large Model 1.5 Pro, Performance Surpassing GPT-4o and Claude3.5Sonnet

ByteDance officially launches its latest Doubao large model 1.5 Pro (Doubao-1.5-pro), which demonstrates outstanding comprehensive capabilities in various fields, successfully surpassing the well-known GPT-4o and Claude3.5Sonnet in the industry. The release of this model marks an important step forward for ByteDance in the field of artificial intelligence. Doubao 1.5 Pro adopts a novel sparse MoE (Mixture of Experts) architecture, utilizing a smaller set of activation parameters for pre-training. This design's innovation...

Jan 22, 2025

30.9k

No Increase in Price with Increased Features! Doubao Large Model 1.5 Officially Released including Doubao-1.5-Vision-Pro and More

On January 22, 2025, Volcano Engine, a subsidiary of ByteDance, officially announced the release of Doubao Large Model 1.5, which is now fully available on the Volcano Ark platform. The newly released Doubao Large Model 1.5 has achieved significant performance improvements across multiple domains, reaching a globally leading level of comprehensive capabilities, marking another important breakthrough for ByteDance in the field of artificial intelligence.

Jan 22, 2025

9.1k

Doubao Large Model Release: 8 Key Moments of 2024 - From AI Rising Star to Complete Breakthrough

Dec 30, 2024

4.1k

Lenovo AI Desktop Assistant Integrates Doubao Large Model

Lenovo announced a partnership with Volcano Engine to integrate the Doubao large model into its AI desktop assistant Ruyi (AI Stick), launching three new features: AI Search, AI Writing, and AI Chat. This marks an important step for Ruyi in providing personalized AI services. The AI Search feature, characterized by its clean and intelligent design, offers users a more precise and efficient search experience, ensuring that every click yields reliable and useful results. AI Writing provides robust assistance for users' creations, whether it's in-depth articles, personal blogs, marketing copy, or...

Dec 27, 2024

2.6k

Doubao Large Model Family Upgraded, Launches Powerful Visual Understanding Model

Dec 18, 2024

4.3k

ByteDance: Doubao Video Generation Model to Officially Launch Services in January 2025

During the 2024 Volcano Engine FORCE Power Conference · Winter, Volcano Engine announced a brand new upgrade of the Doubao Large Model family, showcasing its latest advancements in the field of artificial intelligence. The daily tokens usage of the Doubao Large Model has exceeded 4 trillion, reflecting an increase of over 33 times since its release in May, demonstrating its widespread application and rapid growth in the industry.

Dec 18, 2024

3.2k

AI News

AI Daily

AI Timeline

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview