Applebot Faces Backlash: Major Media Outlets Set Up Blocks, AI Content Scraping Sparks New Controversy

AIbase基地

Published inAI News · 5 min read · Aug 30, 2024

520

According to a recent report by Wired magazine, Apple's content-grabbing robot, Applebot, has recently faced collective resistance from several mainstream media outlets, sparking widespread discussion in the industry about AI content scraping.

Since its first appearance in November 2014 and its official debut in May 2015, Applebot has been quietly working to improve the search capabilities of Siri and Spotlight. However, recent investigations show that several well-known media outlets and platforms, including Facebook, Instagram, The New York Times, and Financial Times, have chosen to block this robot, denying it access to their website content.

This resistance is primarily achieved through the robots.txt file. Research data shows that approximately 6% to 7% of websites have blocked Applebot-Extended, while another study indicates that as many as 25% of tested websites have chosen to block it. This phenomenon is not limited to Applebot; OpenAI and Google's scraping robots have also encountered similar treatment, with 53% and 43% of news websites, respectively, choosing to intercept them.

Robot Computer Office Artificial Intelligence

Image source note: The image is generated by AI, provided by the image licensing service Midjourney

Although the blocking rate of Applebot is relatively low, experts believe this is not because the media has a particular fondness for it, but rather because Applebot's public visibility is not as high as other robots, thus not garnering enough attention. This explanation reveals the complex situation in the current AI content scraping field.

Behind this "social cold war" lies the complex attitude of the media industry towards AI technology. On one hand, AI technology has brought revolutionary changes to content distribution and user experience; on the other hand, unauthorized content scraping has raised issues such as copyright protection and data privacy.

For Apple, the plight of Applebot is undoubtedly a warning. Finding a balance between technological innovation and content rights has become a difficult issue for tech giants. At the same time, this also serves as a wake-up call for the entire industry, reminding us to re-examine the content ecosystem in the AI era.

As AI technology continues to deepen, similar controversies may intensify. How to formulate reasonable content scraping rules, how to protect the rights of creators, and how to seek a balance between openness and protection are challenges that the entire internet industry needs to face together.

In this game between AI and traditional media, there are no absolute winners. In the future, we may need to establish a more transparent and fair content ecosystem, protecting originality while also leaving room for technological innovation. Only in this way can we truly achieve a win-win situation for AI technology and the content industry, promoting the healthy development of the entire industry.

OpenAI Delays First Open-Source Large Model Release, Ensuring Safety Becomes Top Priority

OpenAI announced the postponement of its first open-source large model release, with CEO Sam Altman stating that more time is needed for safety testing and risk assessment. This new model, which has performance comparable to o3-mini, may be named 'Open Model,' but the extent of its openness remains unclear. Research Vice President Aidan Clark emphasized that the company maintains strict standards for open source, as the model cannot be recalled once released. Although the delay disappointed some users, OpenAI believes ensuring safety and taking a responsible approach is more important. This decision will shape the future of models.

OpenAI Postpones Open-Source Large Model Release, Prioritizes Safety Testing

OpenAI announced the postponement of the open-source large model release. CEO Sam Altman stated that more time is needed for safety testing. The model was originally scheduled to be released this week but is now delayed until next week to ensure its safety and reliability. Altman emphasized that once the model is released, it cannot be recalled and must be handled with caution. This is OpenAI's first attempt to release a downloadable self-running model, aimed at providing powerful tools for researchers and small businesses. Although the delay is disappointing, the community generally understands the importance of safety testing and believes it is crucial for the AI ecosystem.

Reverse Acquisition Reappears: Google Acquires Part of Windsurf's Technology and Core Team

After OpenAI's $3B Windsurf acquisition failed, Google DeepMind secured a $2.4B non-exclusive tech license and poached key talent. Windsurf's founders and top researchers will join Google while the company remains independent, showcasing a new 'reverse acquisition' trend in AI where giants boost capabilities through hires and licensing. Most of Windsurf's 250-member team will stay to develop AI coding tools. The OpenAI deal collapsed due to Micro....

Google DeepMind Open Sources GenAI Processors: One-Click Building of Real-Time AI Workflows

Google DeepMind open sources the GenAI Processors Python library, helping developers build efficient generative AI workflows. The library supports asynchronous processing of multimodal data and optimizes Gemini API application development, significantly reducing latency in real-time applications. Core features include a modular Processor interface, streaming API design, and concurrency optimization, enabling rapid development of real-time applications such as intelligent assistants. Currently only supports Python, but with an open community contribution model, future plans include expanding functionality to cover more scenarios.

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

Applebot Faces Backlash: Major Media Outlets Set Up Blocks, AI Content Scraping Sparks New Controversy

AIbase基地

This article is from AIbase Daily

AI News Recommendations

OpenAI Delays First Open-Source Large Model Release, Ensuring Safety Becomes Top Priority

OpenAI Postpones Open-Source Large Model Release, Prioritizes Safety Testing

Reverse Acquisition Reappears: Google Acquires Part of Windsurf's Technology and Core Team

SpaceX invests $2 billion to help xAI accelerate the追赶 of OpenAI

Google DeepMind Open Sources GenAI Processors: One-Click Building of Real-Time AI Workflows

Google Announces the Latest Class of Students at the American Artificial Intelligence Infrastructure Institute

OpenAI Subtly Adds Shopify as a Search Partner, Strengthening ChatGPT Shopping Search Functionality

Google Veo3 Adds Image-to-Video Feature, Users Create Over 40 Million Videos Within Seven Weeks

Google AI Advertising Tool Launches Strongly in India, Driving New Changes in Digital Marketing!

Google's Medical AI Model MedGemma Series Released, Can Run on a Single GPU