Reddit Demands Tech Giants to Pay for Data Use as Stalemate with Companies like Microsoft Continues

AIbase基地

Published inAI News · 4 min read · Aug 1, 2024

134

Recently, during an interview, Steve Huffman, CEO of Reddit, revealed that the company is seeking data usage agreements with major tech firms, requiring those who wish to continue scraping Reddit data to pay for it. This move stems from agreements already in place with Google and OpenAI, with Huffman hoping other companies will follow suit.

Huffman specifically called out Microsoft, Anthropic, and Perplexity for refusing to negotiate over data usage, stating that "blocking these companies is quite troublesome." He noted that without relevant agreements, Reddit has no control or insight into how its data is used, forcing the company to block those unwilling to accept the terms.

Reddit, Official Logo Screenshot

In response to this situation, Reddit has tightened restrictions on web crawlers in recent months. Early July saw the company update its robots.txt file to block crawlers from accessing data without a signed agreement. Subsequently, users noticed that Reddit content only appeared in Google search results with an agreement, while it vanished from other search engines like Bing.

Huffman criticized Microsoft for using Reddit data to train AI without authorization and selling the content through the Bing API to other search engines. He cited the Microsoft AI CEO's statement that public data on the internet is "freeware." Huffman believes this viewpoint reflects the stance of some tech companies towards internet content.

Regarding the disappearance of Reddit content from Bing, Microsoft Search Director Jordi Ribas explained that this was due to Reddit blocking Bing from crawling its site. A Microsoft spokesperson emphasized that the company respects the directives of website providers regarding content usage.

Huffman pointed out that the value exchange model of traditional search engines has evolved. With the convergence of search, summarization, and AI training, the simple model of exchanging content for traffic has become more complex. He stated that Reddit, along with traditional media publishers, is exploring a paid model for providing information to generative AI.

In response, Anthropic stated that it has blacklisted Reddit for crawlers, respecting its robots.txt settings. Microsoft declined to comment on the matter, and Perplexity did not respond to requests for comment.

This controversy highlights the complexities of content value and usage rights in the digital age, also suggesting potential new collaboration models between tech companies and content providers.

Microsoft Launches MAI-Transcribe-1, the World's Most Accurate Speech-to-Text Model

Microsoft has launched a new speech-to-text model called MAI-Transcribe-1, which achieved an average word error rate of only 3.9% across 25 languages, becoming the most accurate transcription model in the world. The model performed excellently in the FLEURS benchmark test, especially showing outstanding results in 11 core languages such as English. This is the third product in Microsoft's MAI series, following the release of a speech synthesis and image generation model.

Google Vids Integrates Veo3.1 Model, Supporting Text Prompts to Direct AI Virtual Characters Interaction

Google upgrades its enterprise video application Vids, integrating the Veo3.1 model to enable dynamic interaction with AI virtual characters. Users can control the character and scene interaction through text commands while maintaining character consistency. The update enhances multimodal integration, improving video creation efficiency.

Google Releases Gemma4 Open Source Model: Adopting the Apache License to Fully Unleash Developer Productivity

Google has released the new open source AI model Gemma4, which adopts the Apache 2.0 license, replacing previous restrictive agreements. This allows developers to freely use, modify, and distribute the model, facilitating commercial applications. The model achieves dual upgrades in technical architecture, improving performance and ecosystem compatibility.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Reddit Demands Tech Giants to Pay for Data Use as Stalemate with Companies like Microsoft Continues

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Behind the Children's Safety Alliance: Hidden Secrets: OpenAI's Secret Funding Sparks Doubt

Powered by the Apache 2.0 License! Google Gemma 4 is Now Open Source: 31B Parameters Performance Approaches Leading Large Models

Google Releases New Pricing Strategy for Gemini API, Billing for Inference Services on Demand

Microsoft Launches AI Self-Development Campaign: Aiming to Unveil the Strongest In-House Model by 2027

OpenAI Acquires Tech Comedy Show TBPN to Guide AI Public Dialogue

Microsoft Launches MAI-Transcribe-1, the World's Most Accurate Speech-to-Text Model

Google Plans to Build a 933 MW Natural Gas Power Plant to Support the Operation of Its Large AI Data Centers

Google Vids Integrates Veo3.1 Model, Supporting Text Prompts to Direct AI Virtual Characters Interaction

Microsoft Accelerates Development of In-House AI Models, Aiming to Lead the Industry in Text, Image, and Audio Processing

Google Releases Gemma4 Open Source Model: Adopting the Apache License to Fully Unleash Developer Productivity

AI News Recommendations

Behind the Children's Safety Alliance: Hidden Secrets: OpenAI's Secret Funding Sparks Doubt

Powered by the Apache 2.0 License! Google Gemma 4 is Now Open Source: 31B Parameters Performance Approaches Leading Large Models

Google Releases New Pricing Strategy for Gemini API, Billing for Inference Services on Demand

Microsoft Launches AI Self-Development Campaign: Aiming to Unveil the Strongest In-House Model by 2027

OpenAI Acquires Tech Comedy Show TBPN to Guide AI Public Dialogue

Microsoft Launches MAI-Transcribe-1, the World's Most Accurate Speech-to-Text Model

Google Plans to Build a 933 MW Natural Gas Power Plant to Support the Operation of Its Large AI Data Centers

Google Vids Integrates Veo3.1 Model, Supporting Text Prompts to Direct AI Virtual Characters Interaction

Microsoft Accelerates Development of In-House AI Models, Aiming to Lead the Industry in Text, Image, and Audio Processing

Google Releases Gemma4 Open Source Model: Adopting the Apache License to Fully Unleash Developer Productivity

GEO Services