Insiders have revealed that Reddit has reached an agreement with Google, valued at approximately $60 million annually. The Springer Publishing Group has partnered with OpenAI, becoming the first publisher to deeply integrate the news industry with artificial intelligence technology. The collaboration between OpenAI and Axel Springer indicates that large-scale model training may require paid access to data. Publishing companies, with their extensive digital image and text resources, could become significant datasets for large-scale model training. CITIC Publishing is attempting to cooperate with authors and large model companies for language training, while Zhangyue Technology has engaged in deep cooperation with ByteDance in areas such as copyright and content production.
The Copyright Issues of AI Large Model Training Data Become Prominent, and the Value of Quality Training Databases May Be Re-evaluated

财联社
This article is from AIbase Daily
Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.