MAG-SQL: Improving Text-to-SQL Conversion Accuracy to 61% Using Multi-Agent Generative Methods

AIbase基地

Published inAI News · 4 min read · Aug 20, 2024

345

In the field of Natural Language Processing (NLP), the Text-to-SQL technology is rapidly evolving. This technology enables ordinary users to query databases using natural language without needing to master SQL, a specialized programming language. However, as database structures become increasingly complex, accurately converting natural language into SQL commands poses a significant challenge.

Recent research teams from South China University of Technology and Tsinghua University have proposed a novel solution — MAG-SQL (Multi-Agent Generation Model), aimed at enhancing the effectiveness of Text-to-SQL. This method leverages multiple intelligent agents working collaboratively to improve the accuracy of SQL generation.

The working principle of MAG-SQL is quite ingenious. Its core components include the "Soft Schema Linker," "Target-Condition Decomposer," "Sub-SQL Generator," and "Sub-SQL Corrector." Initially, the Soft Schema Linker filters out the most relevant database columns for the query, reducing unnecessary information interference and enhancing the accuracy of the generated SQL commands. Subsequently, the Target-Condition Decomposer breaks down complex queries into smaller sub-queries for easier handling.

Following this, the Sub-SQL Generator creates sub-SQL queries based on previous results, ensuring that SQL commands are refined step-by-step. Finally, the Sub-SQL Corrector is responsible for correcting any errors in the generated SQL, further enhancing overall accuracy. This multi-step processing approach allows MAG-SQL to perform exceptionally well with complex databases.

In recent tests, MAG-SQL has shown impressive results on the BIRD dataset. When using the GPT-4 model, the system achieved an execution accuracy of 61.08%, significantly outperforming the traditional GPT-4's 46.35%. Even with the GPT-3.5 model, MAG-SQL's accuracy reached 57.62%, surpassing the previous MAC-SQL method. Additionally, MAG-SQL performed excellently on another complex dataset, Spider, demonstrating its good versatility.

The introduction of MAG-SQL not only enhances the accuracy of Text-to-SQL but also provides new insights for solving complex queries. This multi-agent framework, through iterative refinement, greatly enhances the capabilities of large language models in practical applications, especially when dealing with complex databases and high-difficulty queries.

Paper link: https://arxiv.org/pdf/2408.07930

Key points:

📊 Accuracy Improvement: MAG-SQL achieved an execution accuracy of 61.08% on the BIRD dataset, far exceeding the traditional GPT-4's 46.35%.

🔍 Multi-Agent Collaboration: This method utilizes multiple agents to work collaboratively, making the SQL generation process more efficient and accurate.

💡 Broad Application Potential: MAG-SQL performed well on other datasets like Spider, indicating its good usability and applicability.

Mistral AI Releases Devstral2507: Designed for Code-Centric Language Modeling

Mistral AI launched the Devstral2507 series with two AI models: the open-source Devstral Small1.1 (24 billion parameters, SWE-Bench score of 53.6%) and the enterprise version Devstral Medium2507 (score of 61.6%). Small1.1 supports a 128k context window and local deployment, while Medium2507 outperforms some commercial models. Both are optimized for code reasoning and program synthesis, and support integration with agent frameworks.

AI Daily: Tencent Huyaun Launches 3D Generation Large Model Hunyuan3D-PolyGen; DingTalk AI Spreadsheet Makes a Big Entry; Alibaba Launches Multimodal Large Language Model HumanOmniV2

1.Tencent's Hunyuan3D-PolyGen boosts 3D modeling efficiency by 70% with BPT tech. 2.Alibaba's HumanOmniV2 achieves 69.33% accuracy in multilingual input. 3.DingTalk AI processes 1k tasks/hour with 'spreadsheet-as-document'. 4.Baidu PaddleOCR3.1 improves 37-language recognition by 30%. 5.Microsoft Deep Research opens API. 6.HKPolyU & OPPO's DLoRAL speeds video enhancement 10x. 7.Google opens MCP Toolbox for SQL. 8.Microsoft Win11 to add AI dynamic....

One-click to HD! Hong Kong Polytechnic University Collaborates with OPPO to Open-source DLoRAL, Bringing Revolutionary Breakthroughs in Video Super-resolution

PolyU & OPPO developed DLoRAL, a video super-resolution framework using dual LoRA: CLoRA for temporal consistency and DLoRA for spatial details. Its two-stage training balances quality and speed (10× faster inference), with open-source code/models available. Limited in tiny text recovery but promising for real-time applications.....

DLoRAL: Open-Source Video HD Enhancement Framework Developed by Hong Kong Polytechnic University and OPPO

Hong Kong Polytechnic University and OPPO Research Institute jointly released the open-source video super-resolution framework DLoRAL, which generates high-definition videos in one step using diffusion models. The framework adopts a dual LoRA architecture: C-LoRA maintains temporal consistency between frames, while D-LoRA enhances spatial details. Through a two-stage training strategy, it optimizes temporal coherence and high-frequency information. Compared to traditional methods, DLoRAL improves inference speed by 10 times while maintaining smoothness, significantly enhancing image details, and providing an efficient open-source solution for video HD enhancement.

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

MAG-SQL: Improving Text-to-SQL Conversion Accuracy to 61% Using Multi-Agent Generative Methods

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Mistral AI Releases Devstral2507: Designed for Code-Centric Language Modeling

Microsoft Saves $500 Million in Costs Using AI Technology and Lays Off Nearly 10,000 Employees Again

NVIDIA Collaborates with Hong Kong University and Others to Launch Fast KV Cache, Aiding in Accelerating Diffusion Models

AI Daily: Tencent Huyaun Launches 3D Generation Large Model Hunyuan3D-PolyGen; DingTalk AI Spreadsheet Makes a Big Entry; Alibaba Launches Multimodal Large Language Model HumanOmniV2

Breaking Traditions, Moliang Technology Secures Millions in Funding to Drive a New Era of Multimodal Tactile Sensors!

Apple and Columbia University Collaborate to Develop AI System SceneScout to Assist Blind People with Street View Navigation

One-click to HD! Hong Kong Polytechnic University Collaborates with OPPO to Open-source DLoRAL, Bringing Revolutionary Breakthroughs in Video Super-resolution

DLoRAL: Open-Source Video HD Enhancement Framework Developed by Hong Kong Polytechnic University and OPPO

Feidu Technology Launches Zhenrong Large Model, the Digital Twin Enters a New Intelligent Era!

Stream-Omni: Supports Various Modalities Combination Interaction, Opening the Era of Text, Vision, and Speech Integration