The Ma Yi Team Discovers: Fine-tuning Multimodal Large Models Leads to Catastrophic Forgetting

站长之家

Published inAI News · 2 min read · Sep 28, 2023

116

As GPT-4 was released, multi-modal large models (MLLM) became a hot topic. The team led by Ma Yi proposed the EMT framework to evaluate catastrophic forgetting in MLLMs after fine-tuning. Experiments revealed that while fine-tuning MLLMs improved performance on the fine-tuning dataset, it also led to a decline in performance on other datasets. During the fine-tuning process, MLLMs generated hallucination texts related to the fine-tuning dataset, overlooking the original issues. This research provides a framework and benchmarks for subsequent work, and further optimization is needed in model design and training techniques. The Ma Yi team conducted the first systematic evaluation of catastrophic forgetting in MLLMs, balancing trade-offs between different capabilities.

Multimodal Large Models Catastrophic Forgetting Ma Yi Team

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

A Daily: Moonlight Open-Sources Large Model Kimi K2; Zhiyuan Fully Open-Sources RoboBrain 2.0; Tongyi Qianwen Launches Qwen Chat Desktop Client

Moon's dark side opens trillion-parameter Kimi K2 model; RoboBrain2.0 enhances robot cognition; Alibaba's Qwen adds image generation; IndexTTS2 revolutionizes voice cloning; HuggingFace's Reachy Mini sells well; Meta enables real-time video generation; PixVerse adds multi-keyframe; Tesla Grok supports AMD only; OpenAI delays open-source release; Liquid AI's LFM2 boosts edge AI; AI 'time travel' trend goes viral.....

Jul 14, 2025

OpenAI Delays First Open-Source Large Model Release, Ensuring Safety Becomes Top Priority

OpenAI announced the postponement of its first open-source large model release, with CEO Sam Altman stating that more time is needed for safety testing and risk assessment. This new model, which has performance comparable to o3-mini, may be named 'Open Model,' but the extent of its openness remains unclear. Research Vice President Aidan Clark emphasized that the company maintains strict standards for open source, as the model cannot be recalled once released. Although the delay disappointed some users, OpenAI believes ensuring safety and taking a responsible approach is more important. This decision will shape the future of models.

Jul 14, 2025

130

OpenAI Postpones Open-Source Large Model Release, Prioritizes Safety Testing

OpenAI announced the postponement of the open-source large model release. CEO Sam Altman stated that more time is needed for safety testing. The model was originally scheduled to be released this week but is now delayed until next week to ensure its safety and reliability. Altman emphasized that once the model is released, it cannot be recalled and must be handled with caution. This is OpenAI's first attempt to release a downloadable self-running model, aimed at providing powerful tools for researchers and small businesses. Although the delay is disappointing, the community generally understands the importance of safety testing and believes it is crucial for the AI ecosystem.

Jul 14, 2025

Major Release! Moonshot Introduces Open-Source Large Model Kimi K2 with Trillion Parameters

MoonDark releases trillion-parameter open-source model Kimi K2, featuring MoE with 32B active params. Excels in coding/math, supports tool use & code execution. Uses MuonClip optimizer, offers API & base model. Key competition to closed-source AI, available via official platforms.....

Jul 14, 2025

Reverse Acquisition Reappears: Google Acquires Part of Windsurf's Technology and Core Team

After OpenAI's $3B Windsurf acquisition failed, Google DeepMind secured a $2.4B non-exclusive tech license and poached key talent. Windsurf's founders and top researchers will join Google while the company remains independent, showcasing a new 'reverse acquisition' trend in AI where giants boost capabilities through hires and licensing. Most of Windsurf's 250-member team will stay to develop AI coding tools. The OpenAI deal collapsed due to Micro....

Jul 14, 2025

City Commercial Banks Are Launching a Trend of Large Model Bidding, with Million-Level Investments Becoming a New Industry Opportunity!

Jul 11, 2025

Personification of Large AI Models: Grok 4 and Empathy with Musk?

Jul 11, 2025

vivo New Multimodal Model Launches! AI's Ability to Understand GUI Interfaces is Upgraded Again!

vivo launches BlueLM-2.5-3B, a 2.9B parameter multimodal model excelling in GUI understanding, text processing, and logical reasoning. It features dual thinking modes and efficient training for cost-effective deployment.....

Jul 10, 2025

170

AI Daily: xAI Shockingly Launches Grok4; Microsoft Opensources New Phi-4-mini Version; Shanghai has Cumulatively 82 Large Models Passed Filing

1. xAI launches Grok4 with enhanced math/coding capabilities; 2. Microsoft open-sources efficient Phi-4-mini for edge devices; 3. Shanghai approves 82 specialized AI models; 4. Hugging Face releases Reachy Mini robot; 5. Perplexity debuts Comet AI browser; 6. OpenAI plans first open-weight model; 7. Google releases GPU-friendly MedGemma; 8. OpenAI acquires AI hardware firm for $6.5B.....

Jul 10, 2025

Shanghai has completed the filing of 82 large models

At the 2025 World Artificial Intelligence Conference, it was revealed that Shanghai has filed 82 large models and is actively promoting AI demonstration applications in key industries such as manufacturing and finance. Xuhui Moshu Space and Pudong Moli Community have become industrial carriers, gathering 500 and 200 AI companies respectively. Shanghai has established a full-cycle financing support system from the early stages to the mature stage through national and municipal artificial intelligence funds, with a focus on key areas such as computing power and language data.

Jul 10, 2025

120

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

The Ma Yi Team Discovers: Fine-tuning Multimodal Large Models Leads to Catastrophic Forgetting

站长之家

This article is from AIbase Daily

AI News Recommendations

A Daily: Moonlight Open-Sources Large Model Kimi K2; Zhiyuan Fully Open-Sources RoboBrain 2.0; Tongyi Qianwen Launches Qwen Chat Desktop Client

OpenAI Delays First Open-Source Large Model Release, Ensuring Safety Becomes Top Priority

OpenAI Postpones Open-Source Large Model Release, Prioritizes Safety Testing

Major Release! Moonshot Introduces Open-Source Large Model Kimi K2 with Trillion Parameters

Reverse Acquisition Reappears: Google Acquires Part of Windsurf's Technology and Core Team

City Commercial Banks Are Launching a Trend of Large Model Bidding, with Million-Level Investments Becoming a New Industry Opportunity!

Personification of Large AI Models: Grok 4 and Empathy with Musk?

vivo New Multimodal Model Launches! AI's Ability to Understand GUI Interfaces is Upgraded Again!

AI Daily: xAI Shockingly Launches Grok4; Microsoft Opensources New Phi-4-mini Version; Shanghai has Cumulatively 82 Large Models Passed Filing

Shanghai has completed the filing of 82 large models