Claiming $5 Million! YouTube Creator Sues OpenAI, Accusing of Unauthorized Use of Video Transcription Content

AIbase基地

Published inAI News · 5 min read · Aug 6, 2024

389

Recently, a YouTube creator from Massachusetts, David Millette, has filed a class-action lawsuit against OpenAI, alleging that the company used transcriptions of millions of YouTube videos to train their generative AI models without permission. According to the complaint filed by Millette's attorneys in the U.S. District Court for the Northern District of California, OpenAI is accused of secretly transcribing his videos and those of other creators to train models for ChatGPT and other generative AI products.

youtube

The complaint states that OpenAI has profited from the creators' work by collecting this data, which violates copyright laws and YouTube's terms of service, which prohibit the use of videos for applications independent of its service. Millette's attorneys write in the complaint that OpenAI's AI products have become more valuable due to the use of training data that was not consented to, credited, or compensated.

The law firm representing Millette seeks a jury trial and demands over $5 million in damages on behalf of all potentially affected YouTube users and creators.

It is well known that generative AI models do not possess true intelligence. They learn the likelihood and patterns of data occurrences by processing large samples of data such as movies, recordings, and papers. Many models' training data is sourced from public websites and datasets online. Although companies claim their data scraping complies with the principle of "fair use," many copyright holders disagree and have resorted to litigation to halt this practice.

Video transcriptions have become an important training data source, especially as other data sources have dried up. According to Originality.AI, over 35% of the top websites worldwide have now blocked OpenAI's web crawlers. Additionally, research from MIT's Data Source Initiative shows that about 25% of high-quality data sources have been restricted, making training data for AI models more scarce.

It is worth noting that OpenAI's Whisper model is specifically designed to transcribe video audio to collect more training data. According to The New York Times, after transcribing over a million hours of YouTube videos, OpenAI used these transcriptions to train their GPT-4 model, sparking internal discussions that this might violate YouTube's rules.

Key Points:

🔍 YouTuber David Millette has filed a class-action lawsuit against OpenAI, accusing it of using video transcriptions for AI training without permission.

💰 Millette seeks over $5 million in damages, representing all affected YouTube creators.

🚫 The data sources for generative AI models face increasingly stringent restrictions, with many top websites having blocked OpenAI's crawlers.

Generative AI OpenAI ChatGPT YouTube

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Sudden Shakeup in the Short Drama Industry! Hongguo Merges Real Actors and AI Rankings: AI-Generated Realistic Dramas Reach Higher Popularity Than Live-Action Dramas for the First Time

The AI short drama 'Bodhi Descends: Real-AI Version' tops the trending chart on Hongguo Short Drama platform, marking a breakthrough for AI-generated content in popularity.....

Apr 10, 2026

220

Lenovo's New Fiscal Year Will Launch AI Desktop: From Tianxi Ecosystem to Full-Scenario Intelligence

Lenovo unveils 'AI Host' at its 2026/2027 Kickoff, aiming to accelerate AI adoption in enterprise and personal settings by integrating AI natively into hardware for scalable deployment.....

Apr 10, 2026

190

AI Content Creation Has Surpassed Humans, Creative Crisis Is Becoming More Severe

AI-generated content now surpasses human-created content online, with a sharp rise since ChatGPT's 2022 launch. It's projected to dominate by 2025, raising concerns about efficiency versus creativity.....

Apr 10, 2026

140

Ant Group Wins Championship at Top Computer Vision Conference, Achieving Practical-Level Advancement in AIGC Detection

Ant Group won the championship in the "Robustness Sample Testing in Complex Real-World Scenarios" and "Facial Enhancement Anomaly Detection" tracks at the CVPR 2026 NTIRE Challenge. This achievement helps enhance risk identification capabilities in scenarios such as payment, content review, and financial authentication. In response to the increasing challenges of deepfakes and misuse of AIGC, as well as the insufficiency of detection models in real-world scenarios and multi-modal large model iterations, this breakthrough provides important technical support.

Apr 10, 2026

230

Meta AI App Rises to Fifth on App Store, Muse Spark Drives Download Surge

Meta's new AI model Muse Spark, developed by a team led by former Scale AI head Alexandr Wang, quickly boosted Meta AI's U.S. App Store ranking from 57th to 5th, with significant first-day download growth, reflecting strong market interest in AI technology.....

Apr 10, 2026

270

AI Daily: MiniMax Launches Music 2.6; Coze 2.5 Major Upgrade; AI Personality Test Product SBTI Goes Viral Online

Welcome to the [AI Daily] column! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. AI personality test product SBTI goes viral online: The AI personality test product SBTI, which uses absurd tags and AI synthesis technology, quickly goes viral online with its absurd abstract tags and deconstructive expressions.

Apr 10, 2026

290

The Explosion of the Domestic Agent Ecosystem! Xiaomi MiMo-V2 Joins the Top Framework Hermes and Opens a 14-Day Free Trial

Xiaomi's self-developed large model MiMo-V2 series has officially joined the globally top-tier open-source Agent framework Hermes Agent, achieving a strong combination. Developers can directly call Xiaomi's flagship model through Nous Portal after updating the framework. At the same time, Xiaomi has launched a two-week free trial "family pack" activity to reward developers.

Apr 10, 2026

420

Physical AI Devices Will See a Surge: Expected Shipments Reach 145 Million Units by 2035

Counterpoint Research predicts that the total shipments of global physical AI devices will reach 145 million units from 2025 to 2035, with drones, robots, and autonomous vehicles accounting for 59 million, 48 million, and 38 million units respectively. The report indicates that the humanoid robot market is growing the fastest, with the cumulative installation volume expected to exceed 100,000 units by 2028.

Apr 10, 2026

160

AI Music Enters the Cover Era! MiniMax Launches Music 2.6: Introducing New Cover Function and Agent Skills

MiniMax launches the new generation AI music generation model Music 2.6, achieving a comprehensive upgrade from the underlying engine to the creation tools. Core optimizations include significantly reducing generation latency, improving the coherence of music structure, enhancing audio quality and listening experience, and adding new creative functions such as 'music continuation'. This update aims to provide creators with a more accurate and smooth music generation experience, expanding the boundaries of AI music interaction.

Apr 10, 2026

340

Core Talent Loss in ByteDance's Seed Team: 70 People Left in One Year, Tencent and Alibaba as Main Destinations

In the past year, nearly 70 technical talents have left ByteDance's AI core department, the Seed Team, moving to leading tech companies and AI startups, reflecting the intensifying competition for talent in large model development in China. The team was established in 2023, focusing on cutting-edge research in LLMs, speech, vision, and world models. Its Doubao large model has already supported over 50 application scenarios including Doubao and Koala.

Apr 10, 2026

360

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Claiming $5 Million! YouTube Creator Sues OpenAI, Accusing of Unauthorized Use of Video Transcription Content

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Sudden Shakeup in the Short Drama Industry! Hongguo Merges Real Actors and AI Rankings: AI-Generated Realistic Dramas Reach Higher Popularity Than Live-Action Dramas for the First Time

Lenovo's New Fiscal Year Will Launch AI Desktop: From Tianxi Ecosystem to Full-Scenario Intelligence

AI Content Creation Has Surpassed Humans, Creative Crisis Is Becoming More Severe

Ant Group Wins Championship at Top Computer Vision Conference, Achieving Practical-Level Advancement in AIGC Detection

Meta AI App Rises to Fifth on App Store, Muse Spark Drives Download Surge

AI Daily: MiniMax Launches Music 2.6; Coze 2.5 Major Upgrade; AI Personality Test Product SBTI Goes Viral Online

The Explosion of the Domestic Agent Ecosystem! Xiaomi MiMo-V2 Joins the Top Framework Hermes and Opens a 14-Day Free Trial

Physical AI Devices Will See a Surge: Expected Shipments Reach 145 Million Units by 2035

AI Music Enters the Cover Era! MiniMax Launches Music 2.6: Introducing New Cover Function and Agent Skills

Core Talent Loss in ByteDance's Seed Team: 70 People Left in One Year, Tencent and Alibaba as Main Destinations

AI News Recommendations

Sudden Shakeup in the Short Drama Industry! Hongguo Merges Real Actors and AI Rankings: AI-Generated Realistic Dramas Reach Higher Popularity Than Live-Action Dramas for the First Time

Lenovo's New Fiscal Year Will Launch AI Desktop: From Tianxi Ecosystem to Full-Scenario Intelligence

AI Content Creation Has Surpassed Humans, Creative Crisis Is Becoming More Severe

Ant Group Wins Championship at Top Computer Vision Conference, Achieving Practical-Level Advancement in AIGC Detection

Meta AI App Rises to Fifth on App Store, Muse Spark Drives Download Surge

AI Daily: MiniMax Launches Music 2.6; Coze 2.5 Major Upgrade; AI Personality Test Product SBTI Goes Viral Online

The Explosion of the Domestic Agent Ecosystem! Xiaomi MiMo-V2 Joins the Top Framework Hermes and Opens a 14-Day Free Trial

Physical AI Devices Will See a Surge: Expected Shipments Reach 145 Million Units by 2035

AI Music Enters the Cover Era! MiniMax Launches Music 2.6: Introducing New Cover Function and Agent Skills

Core Talent Loss in ByteDance's Seed Team: 70 People Left in One Year, Tencent and Alibaba as Main Destinations

GEO Services