AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation MCP

Subverting Cognition! AI Self-Training Crashes 9 Times, Oxford and Cambridge Discover Fatal Weaknesses in AI

AIbase基地

Published inAI News · 6 min read · Jul 25, 2024

215

What would happen if AI were to retrain itself using images it generated? Recently, researchers from Stanford University and the University of California, Berkeley conducted an experiment that yielded surprising results.

The researchers found that when AI image generation models were retrained using images they themselves generated, these models produced highly distorted images. Worse still, this distortion was not limited to the text prompts used for retraining; once a model was "contaminated," it was difficult to fully recover, even if only real images were used for subsequent retraining.

The experiment began with an open-source model called Stable Diffusion (SD). The researchers first selected 70,000 high-quality face images from the FFHQ face dataset and automatically categorized them. They then used these real images as input to generate 900 images consistent with specific demographic characteristics through the Stable Diffusion model.

Next, the researchers used these generated images for iterative retraining of the model. They found that regardless of the proportion of self-generated images in the retraining dataset, the model would eventually collapse, with a sharp decline in the quality of the generated images. Even when the retraining dataset contained only 3% self-generated images, the phenomenon of model collapse persisted.

The experimental results showed that the baseline version of the Stable Diffusion model generated images that were consistent with text prompts and had high visual quality. However, after iterative retraining, the generated images began to exhibit semantic inconsistencies and visual distortions. The researchers also found that model collapse not only affected image quality but also resulted in a lack of diversity in the generated images.

To verify this, the researchers conducted control experiments, attempting to mitigate the impact of model collapse by adjusting the color histogram of the generated images and removing low-quality images. However, the results indicated that these measures were not effective in preventing model collapse.

The researchers also explored whether the model could recover after being "contaminated." They found that while in some cases, the quality of the generated images improved after multiple iterations of retraining, signs of model collapse still persisted. This suggests that once a model is "contaminated," the impact could be long-term, even irreversible.

This study reveals an important issue: currently popular diffusion-based text-to-image generation AI systems are highly sensitive to data "contamination." This contamination can occur unintentionally, such as by indiscriminately scraping images from online resources, or it can be a targeted attack, such as deliberately placing "contaminated" data on websites.

Facing these challenges, the researchers proposed some possible solutions, such as using image authenticity detectors to exclude AI-generated images, or adding watermarks to generated images. While these methods are not perfect, combined, they could significantly reduce the risk of data "contamination."

This study reminds us that the development of AI technology is not without risks. We need to handle AI-generated content more cautiously to ensure it does not have a long-term negative impact on our models and datasets. Future research needs to further explore how to make AI systems more resilient to this type of data "contamination" or develop technologies that can accelerate the "healing" of models.

Paper link: https://arxiv.org/pdf/2311.12202

AI Image Generation Stable Diffusion Retraining Distorted Images

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

AI Daily Report - June 30th: Baidu Open Sources the WENXIN Large Model 4.5 Series; Tongyi Qianwen Multimodal Generation Model Qwen VLo

Welcome to the AIbase [AI Daily Report] section! Spend three minutes a day to learn about the latest AI events, helping you understand AI industry trends and innovative AI product applications. For more AI news, visit: https://www.aibase.com/zh1. Baidu officially releases the WENXIN Large Model 4.5 series and fully opens it to the public, featuring ten new models with various parameter configurations. These models are trained and inferred using the PaddlePaddle framework, achieving a FLOPs utilization rate of 47%, and perform well in multi-modal text tasks.

Jun 30, 2025

200

Meta 3.2 Billion Dollar Talent Acquisition from OpenAI! The AI Talent War Has Exploded, Will the Industry Landscape Change?

Jun 30, 2025

150

Test Article

The internal testing project of Xiaomi, "AI Toolkit," has officially announced the end of its phased testing and plans to suspend service starting July 5, 2025. As an important AI project incubated internally by Xiaomi, the AI Toolkit aims to explore and integrate cutting-edge AI technologies, providing users with a series of innovative features and experiences. Although the specific testing functions and application scenarios have not been fully disclosed, its name suggests its positioning as a multifunctional AI toolset. During the recent testing period, the AI Toolkit has gathered some Xiaomi employees

Jun 30, 2025

Test Article

The internal testing project of Xiaomi, "AI Toolbox," has officially announced the end of its phased internal testing and plans to suspend services starting from July 5, 2025. As an important AI project incubated internally by Xiaomi, the AI Toolbox aims to explore and integrate cutting-edge AI technologies, providing users with a series of innovative features and experiences. Although the specific internal testing functions and application scenarios have not been fully disclosed, its name suggests its positioning as a multifunctional AI toolkit. During the recent internal testing period, the AI Toolbox has gathered some Xiaomi employees

Jun 30, 2025

Zhihu Direct Answer Upgrades Knowledge Base Function, Deeply Integrates Community Content to Create an Immersive AI Q&A Experience

Jun 30, 2025

160

New Open Source AI System OmniGen 2: Integrates Image and Text Generation Like GPT-4o

Jun 30, 2025

190

The Internal Testing Period of Xiaomi AI Toolbox Ends, Service Will Be Suspended Starting July 5

The internal testing project "Xiaomi AI Toolbox" has officially announced the end of its phased internal testing and plans to suspend service starting July 5, 2025. "AI Toolbox" is an important AI project incubated internally by Xiaomi, aimed at exploring and integrating cutting-edge AI technologies to provide users with a series of innovative features and experiences. Although the specific internal testing functions and application scenarios have not been fully disclosed, its name suggests its positioning as a multifunctional AI toolset. During the recent internal testing period, "AI Toolbox" has gathered some Xiaomi employees and core users.

Jun 30, 2025

150

The 'In-Depth Research' Feature of Doubao is Now in Testing on the Doubao APP, Web Version, and Desktop Version

Recently, the Doubao APP, web version, and desktop version platforms have introduced a new feature test - the 'In-Depth Research' feature has been officially launched, offering users free trial. This feature aims to help users efficiently handle complex tasks by quickly integrating massive in-depth information and generating detailed research reports or visualized web results.

Jun 30, 2025

220

AI Parenting Video: How to Earn Over 600 Per Day Using Trending Topics and AI Tools - Detailed Step-by-Step Breakdown

Jun 30, 2025

AI Parenting Video: How to Earn Over 600 Per Day Using Trending Topics and AI Tools - Detailed Step-by-Step Breakdown

Monetization Idea: Use AI tools to create parenting conversation videos and post them on video platforms. Monetize through traffic sharing, account sales, and tutorial sales. Suitable for parents with parenting experience, young people who enjoy video creation, and individuals with basic knowledge of AI technology. Difficulty level is moderate, requiring proficiency in using AI tools and video editing software. Operation Process Method ** Step 1: Find Benchmark Videos ** Open the Qing Dou mini program, browse related parenting videos. Find the videos you are interested in and extract their scripts. ** Step 2:

Jun 30, 2025

110