2024-11-29 10:24:45.AIbase.13.6k
Bluesky User Data Scraped: One Million Public Posts Used for AI Training
Recently, the social media platform Bluesky faced a significant data scraping incident. A machine learning librarian, Daniel van Strien, scraped over one million public user posts from Bluesky's API and uploaded this data to the AI company Hugging Face. The dataset includes users' Decentralized Identifiers (DIDs) and a range of features to search for specific user content.