OpenAI Faces Copyright Risks for Using Game Content to Train Sora

AIbase基地

Published inAI News · 5 min read · Dec 12, 2024

296

OpenAI recently launched its video generation model Sora, but the training data for this model may contain a lot of copyrighted game content, raising concerns about legal issues. Sora can generate videos of up to 20 seconds in length based on user text prompts or images, supporting various aspect ratios and resolutions.

At the time of its release, OpenAI mentioned that Sora's training data included Minecraft videos, sparking curiosity about other potential game content that may have been used.

In practical tests, Sora was able to generate videos resembling several well-known games, including a clone that looked like Super Mario Bros, first-person shooters inspired by Call of Duty and Counter-Strike, as well as segments similar to the 90s Ninja Turtles arcade fighting games. Additionally, Sora seems to have mastered the style of Twitch streaming, generating characters similar to popular streamers Auronplay and Pokimane.

However, OpenAI has not detailed the sources of data used by Sora. While OpenAI stated that it used "publicly available" data and obtained licensed data from stock media libraries like Shutterstock, this does not eliminate legal risks. Intellectual property lawyer Joshua Weigensberg pointed out that if Sora's training data indeed includes real gameplay videos, it likely involves the reproduction of copyrighted materials.

AI generation models like Sora are based on probabilistic learning, identifying patterns through large datasets. However, this can also lead to outputs that closely resemble the training data, causing dissatisfaction among creators, and more individuals are starting to seek legal remedies.

The handling of game content is particularly complex because video playback involves not only the copyrights of game developers but also the unique videos created by players. If courts determine that copyright infringement occurred during the training of AI models, developers could face greater legal risks.

Although AI companies may win some legal disputes, this does not mean that users of these models can be completely exempt from liability. Generated content may touch on multiple legal areas, including copyright, trademark rights, and portrait rights. Therefore, developers must exercise extra caution when training AI models.

Key Points:
🎮 OpenAI's newly launched video generation model Sora may have been trained on data containing game content, facing legal risks.
🧑‍⚖️ Intellectual property experts state that copyright issues related to game content are complex and involve multiple rights holders.
⚖️ The legal responsibilities for AI-generated content may affect not only developers but also ordinary users.

Sora OpenAI Minecraft Video Generation Model

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Yinghe Yimei Collaborates with Beijing Tiantan Hospital to Launch the World's First Comprehensive Cranial CT Auxiliary Report Large Model

Beijing Tiantan Hospital and Yinghe Yimai jointly released 'Dr. Xiaojun 2.0', the world's first full-disease-coverage CT-assisted report generation model for cranial scans. Leveraging Tiantan's massive cranial CT data and Yinghe's foundation model with AI Agent technology, it automates the entire process from image analysis to diagnostic reporting, significantly enhancing neuroimaging diagnostic standards.....

Apr 24, 2026

180

The World's First Large Model for Full-Disease Coverage in Cranial CT Auxiliary Report Generation is Launched!

Yinghe Yimei and Beijing Tiantan Hospital jointly launched the world's first large model for full-disease coverage in cranial CT auxiliary report generation, "Xiao Jun Doctor 2.0", on April 24 in Beijing. This AI product aims to improve the efficiency and accuracy of medical imaging reports through advanced technology, attracting widespread attention from medical professionals and tech enthusiasts.

Apr 24, 2026

150

AI Daily: DeepSeek-V4 Preview Version Officially Released; Tesla In-Car Voice Accesses Doubao; Meituan Secretly Trials a Trillion-Level AI Large Model

Welcome to the [AI Daily] section! Here is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technology trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. DeepSeek-V4 Preview Version Officially Released: 1M Long Context Enters an Era of Universal Accessibility. DeepSeek-V4 Preview Version Officially Released, with 1M Long Context Capabilities, and

Apr 24, 2026

130

Hong Kong Stock Market's Large Model Stocks Plunge! Zhipu and Minimax Suffer Heavy Losses After Deepseek V4 Release

In the Hong Kong stock market, shares of Zhipu Technology and Minimax fell significantly after the release of Deepseek V4, a highly anticipated deep learning model with technical upgrades and enhanced features. This unexpected downturn in these major AI concept stocks sparked widespread investor discussion.....

Apr 24, 2026

160

Cambricon Announces Full Series Model Day0 Compatibility and Open-Sourcing of Optimized Code for DeepSeek-V4

Cambricon announced the completion of Day0 compatibility for the entire series of DeepSeek-V4 models. Based on the vLLM inference framework, it covers the 285B parameter Flash version and the 1.6T parameter Pro version. By optimizing sparse attention and compressed structures with its self-developed Torch-MLU-Ops operator library, the model can run stably on Cambricon hardware on the day of release. The related code has been open-sourced to GitHub.

Apr 24, 2026

210

Cambrian Successfully Compatible with DeepSeek-V4 Promotes Efficient AI Model Operation

Cambricon announced successful Day 0 adaptation of DeepSeek-V4, an open-source AI model by DeepSeek, achieving stable operation on launch day. Using its self-developed fusion operator library Torch-MLU-Ops, it accelerated modules like Compressor and mHC, significantly boosting inference efficiency. The vLLM inference framework was also adopted for a more efficient AI experience.....

Apr 24, 2026

200

Meituan Secretly Launches a Trillion-Parameter AI Large Model! Currently Only Open to Invited Users

Meituan has recently launched a trillion-parameter AI large model test. The model is trained entirely on domestic computing power clusters, marking a significant breakthrough in the application of domestic technology. It is currently only available to invited users and has not been widely released, demonstrating Meituan's leading position in the AI field.

Apr 24, 2026

250

Kunlun Tech Launches 4+3 Strategy: From Technical Foundation to Business Cycle

According to Kunlun Tech's 2025 annual report, the company's revenue reached 8.198 billion yuan, an increase of 44.78% year-on-year, with overseas revenue reaching 7.723 billion yuan, up 49.91%. The company introduced the "4+3 Strategy", clearly defining the development direction of AI-driven content production, covering both technological and business layout.

Apr 24, 2026

150

The Shadow of OpenAI and Anthropic: Why Cursor's $5 Billion Funding Round Was Rejected by Big Tech Investors

AI programming company Cursor faced obstacles in its quest for several billion dollars in funding, with its $5 billion valuation deterring several later-stage investment firms. Previously, SpaceX had shown interest in acquiring it for $6 billion, but top funds including Iconiq have clearly rejected the offer. The main reason for the cold funding climate is that global capital has already completed its initial setup in the AI sector.

Apr 24, 2026

200

Soul Open-Source Real-Time Digital Human Generation Model SoulXFlashTalk Achieves Sub-Second Latency

Soul AI Lab open-sourced SoulXFlashTalk, the first 1.4 billion-parameter real-time digital human generation model, featuring sub-second latency and 32 FPS, offering a complete real-time interaction solution. The open-source release includes project page, technical report, source code, and model weights, lowering industry R&D barriers.....

Apr 24, 2026

210

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

OpenAI Faces Copyright Risks for Using Game Content to Train Sora

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Yinghe Yimei Collaborates with Beijing Tiantan Hospital to Launch the World's First Comprehensive Cranial CT Auxiliary Report Large Model

The World's First Large Model for Full-Disease Coverage in Cranial CT Auxiliary Report Generation is Launched!

AI Daily: DeepSeek-V4 Preview Version Officially Released; Tesla In-Car Voice Accesses Doubao; Meituan Secretly Trials a Trillion-Level AI Large Model

Hong Kong Stock Market's Large Model Stocks Plunge! Zhipu and Minimax Suffer Heavy Losses After Deepseek V4 Release

Cambricon Announces Full Series Model Day0 Compatibility and Open-Sourcing of Optimized Code for DeepSeek-V4

Cambrian Successfully Compatible with DeepSeek-V4 Promotes Efficient AI Model Operation

Meituan Secretly Launches a Trillion-Parameter AI Large Model! Currently Only Open to Invited Users

Kunlun Tech Launches 4+3 Strategy: From Technical Foundation to Business Cycle

The Shadow of OpenAI and Anthropic: Why Cursor's $5 Billion Funding Round Was Rejected by Big Tech Investors

Soul Open-Source Real-Time Digital Human Generation Model SoulXFlashTalk Achieves Sub-Second Latency

AI News Recommendations

Yinghe Yimei Collaborates with Beijing Tiantan Hospital to Launch the World's First Comprehensive Cranial CT Auxiliary Report Large Model

The World's First Large Model for Full-Disease Coverage in Cranial CT Auxiliary Report Generation is Launched!

AI Daily: DeepSeek-V4 Preview Version Officially Released; Tesla In-Car Voice Accesses Doubao; Meituan Secretly Trials a Trillion-Level AI Large Model

Hong Kong Stock Market's Large Model Stocks Plunge! Zhipu and Minimax Suffer Heavy Losses After Deepseek V4 Release

Cambricon Announces Full Series Model Day0 Compatibility and Open-Sourcing of Optimized Code for DeepSeek-V4

Cambrian Successfully Compatible with DeepSeek-V4 Promotes Efficient AI Model Operation

Meituan Secretly Launches a Trillion-Parameter AI Large Model! Currently Only Open to Invited Users

Kunlun Tech Launches 4+3 Strategy: From Technical Foundation to Business Cycle

The Shadow of OpenAI and Anthropic: Why Cursor's $5 Billion Funding Round Was Rejected by Big Tech Investors

Soul Open-Source Real-Time Digital Human Generation Model SoulXFlashTalk Achieves Sub-Second Latency