Hackers Upload Malicious AI Models on HuggingFace Using 'Corrupted' Pickle Files

AIbase基地

Published inAI News · 4 min read · Feb 10, 2025

238

Recently, cybersecurity researchers discovered that two malicious machine learning models were quietly uploaded to the well-known machine learning platform Hugging Face. These models utilized a novel technique to successfully evade security detection through "corrupted" pickle files, raising concerns.

Karlo Zanki, a researcher at ReversingLabs, pointed out that the beginning of the pickle files extracted from these PyTorch format archives suggests the presence of malicious Python code. This malicious code mainly consists of a reverse shell that can connect to a hard-coded IP address, enabling remote control by hackers. This attack method using pickle files is known as nullifAI, aiming to bypass existing security measures.

Specifically, the two malicious models found on Hugging Face are glockr1/ballr7 and who-r-u0000/0000000000000000000000000000000000000. These models serve more as proof of concept rather than actual supply chain attack cases. While the pickle format is very common in the distribution of machine learning models, it also poses security risks, as this format allows arbitrary code execution during loading and deserialization.

Researchers found that these two models used compressed pickle files in PyTorch format, employing a compression method of 7z, which is different from the default ZIP format. This feature enabled them to evade the malicious detection of Hugging Face's Picklescan tool. Zanki further noted that although deserialization of the pickle files may fail due to the insertion of malicious payloads, it can still partially deserialize, thus executing the malicious code.

Complicating matters, because the malicious code is located at the beginning of the pickle stream, Hugging Face's security scanning tools failed to identify the potential risks of the models. This incident has sparked widespread concern about the security of machine learning models. In response to this issue, researchers have made fixes and updated the Picklescan tool to prevent similar incidents from occurring again.

This incident serves as a reminder to the tech community that cybersecurity issues should not be overlooked, especially against the backdrop of rapid advancements in AI and machine learning, making it particularly important to protect user and platform security.

Key Points:
🛡️ Malicious models used "corrupted" pickle file techniques to successfully evade security detection.
🔍 Researchers found these models contained reverse shells that connect to hard-coded IP addresses.
🔧 Hugging Face has updated its security scanning tools to fix related vulnerabilities.

Moonshot AI Releases and Opensources Kimi K2 Model, Strong in Code and Agentic Tasks

Moonshot AI officially released its latest creation - the Kimi K2 model, and simultaneously announced its open source. This foundation model based on the MoE architecture has gained widespread attention in the AI field since its release, thanks to its strong coding capabilities and excellent general Agent task processing abilities. The Kimi K2 model has a total of 1T parameters, with 32B activated parameters. It has achieved top performance among open-source models in a series of benchmark performance tests such as SWE Bench Verified, Tau2, and AceBench.

Mistral AI Releases Devstral2507: Designed for Code-Centric Language Modeling

Mistral AI launched the Devstral2507 series with two AI models: the open-source Devstral Small1.1 (24 billion parameters, SWE-Bench score of 53.6%) and the enterprise version Devstral Medium2507 (score of 61.6%). Small1.1 supports a 128k context window and local deployment, while Medium2507 outperforms some commercial models. Both are optimized for code reasoning and program synthesis, and support integration with agent frameworks.

Google Open Sources MCP Toolbox for Databases: Unlock the Infinite Possibilities of AI and Databases with 10 Lines of Code

Google releases the open-source tool MCP Toolbox for Databases, simplifying the integration of AI agents with SQL databases. The tool connects to a database with just 10 lines of code and supports secure mechanisms such as connection pool management, authentication, and schema introspection. It is compatible with various Google Cloud databases. As an open-source project, it lowers the development barrier, but currently mainly supports Google ecosystem databases. Future expansion of compatibility may be needed. This tool has the potential to become a standard component for AI development, driving intelligent data processing.

Claude Code Upgraded Again! Hooks Feature Unlocks a New Dimension in AI Programming, Making Automation Smarter

With the deep application of artificial intelligence technology in the field of programming, Claude Code launched by Anthropic has become a reliable assistant for many developers, thanks to its powerful code comprehension and automation capabilities. Just yesterday, Claude Code received an important update, introducing the Hooks feature, which provides developers with more precise control and a more efficient development experience. What is the Hooks feature? The Hooks feature is a user-defined shell introduced by Claude Code.

Cursor Boldly Poaches! Core Personnel from Claude Code Join Competitor

As competition in the AI industry intensifies, a notable "poaching" incident has recently occurred. The developer of the popular coding application Cursor, Anysphere, successfully poached two key personnel from Anthropic: Boris Cherny, the lead developer of the Claude Code project, and Cat Wu, the product manager. This move has not only surprised industry insiders but also raised questions about the relationship between Anthropic and Cursor.

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

Hackers Upload Malicious AI Models on HuggingFace Using 'Corrupted' Pickle Files

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Moonshot AI Releases and Opensources Kimi K2 Model, Strong in Code and Agentic Tasks

No CUDA Code Needed! H100 Accelerates 33%-50% Flash Attention Author's New Work Sparks Controversy

Mistral AI Releases Devstral2507: Designed for Code-Centric Language Modeling

xAI Shockingly Launches Grok4 Strong Reasoning + Code Master

Microsoft Win11 is about to launch the AI Dynamic Wallpaper feature, preview code has appeared

Google Open Sources MCP Toolbox for Databases: Unlock the Infinite Possibilities of AI and Databases with 10 Lines of Code

Report: Bilibili to Launch AI Creation Tool with Code Name H, Focus on Video Podcast Business

Claude Code Upgraded Again! Hooks Feature Unlocks a New Dimension in AI Programming, Making Automation Smarter

Cursor Boldly Poaches! Core Personnel from Claude Code Join Competitor

xAI Console Adds Reference to Grok4 and Grok4Code, Marking the Upcoming Release of the Next-Generation AI Model