Insiders have revealed that Reddit has reached an agreement with Google, valued at approximately $60 million annually. The Springer Publishing Group has partnered with OpenAI, becoming the first publisher to deeply integrate the news industry with artificial intelligence technology. The collaboration between OpenAI and Axel Springer indicates that large-scale model training may require paid access to data. Publishing companies, with their extensive digital image and text resources, could become significant datasets for large-scale model training. CITIC Publishing is attempting to cooperate with authors and large model companies for language training, while Zhangyue Technology has engaged in deep cooperation with ByteDance in areas such as copyright and content production.
The Copyright Issues of AI Large Model Training Data Become Prominent, and the Value of Quality Training Databases May Be Re-evaluated
财联社
50
AI Large ModelsTraining DataCopyright IssuesQuality DatabasesArtificial IntelligenceIntellectual PropertyCaijingDatasets
© Copyright AIbase Base 2024, Click to View Source - https://www.aibase.com/news/5551