Research Finds AI Agents Vulnerable to Attacks, User Data at Risk

AIbase基地

Published inAI News · 5 min read · Feb 18, 2025

140

Recently, a research team from Columbia University and the University of Maryland released a new study revealing serious security vulnerabilities in AI agents with internet access.

The study shows that attackers can easily manipulate these AI systems to leak users' private information, download malicious files, and even send phishing emails to the users' contacts. These attacks do not require any specialized AI or programming knowledge, which is shocking.

Artificial Intelligence Robot

Image Source Note: Image generated by AI, licensed by Midjourney

The research team tested several well-known AI agents, including Anthropic's computer assistant, the MultiOn Web Agent, and the ChemCrow research assistant. They found that these systems have relatively weak security defenses. The researchers detailed how attackers could guide the AI agents from trusted websites to malicious sites in a four-stage process, ultimately leading to the leakage of users' sensitive data.

The researchers developed a comprehensive framework to categorize different types of attacks, analyzing factors such as the attacker (external attacker or malicious user), the target (data theft or agent manipulation), the method of access (operating environment, storage, or tools), and the strategies used (such as jailbreak prompts). In a specific test, the researchers created a fake website promoting an "AI-Enhanced German Refrigerator" named “Himmelblau KÖNIGSKÜHL Diplomat DK-75.” When the AI agents visited the site, they encountered hidden jailbreak prompts, resulting in the agents indiscriminately leaking confidential information, including credit card numbers, and downloading files from suspicious sources in ten attempts.

Additionally, the research uncovered serious vulnerabilities in email integration. When users log into email services, attackers can manipulate AI agents to send seemingly trustworthy phishing emails to contacts. In such cases, even experienced users find it difficult to discern the authenticity of these scam messages.

Despite the exposure of these security risks in AI systems, many companies are still accelerating their commercialization efforts. ChemCrow is already available on Hugging Face, the Claude computer assistant exists in Python script form, and MultiOn offers a developer API. Meanwhile, OpenAI has launched ChatGPT Operator, and Google is developing Project Mariner. The research team calls for strengthened security measures, including the implementation of strict access controls, URL verification, and user confirmation for downloads, to ensure user data security.

Key Points:
💻 Research indicates that AI agents can be easily manipulated, leading to data leakage and malicious downloads.
📧 Attackers can use AI agents to send phishing emails, increasing the risk of scams.
🔒 Experts urge for enhanced security in AI systems and recommend implementing various protective measures.

OpenAI Predicts $125 Billion Revenue by 2029, 3 Billion Monthly Active Users by 2030

OpenAI recently released a prediction forecasting $125 billion in total revenue by 2029. AI agent and channel revenue will be key drivers. AI agent revenue is projected to reach nearly $29 billion, representing almost a quarter of total revenue, while channel revenue is expected to reach $25 billion. Image note: Image generated by AI, image licensing service Midjourney. Following the success of ChatGPT, OpenAI's...

Claude-3 surpasses human average IQ, Anthropic leads AI intelligence into a new era

Anthropic's Claude-3 model has achieved a breakthrough in IQ testing, surpassing the human average of 100 for the first time. This marks a milestone in AI development. According to AIbase, Claude-3 outperformed its predecessor in the Norwegian Mensa IQ test, signifying a remarkable leap in AI cognitive abilities. Community analysis suggests this achievement reflects not only Anthropic's technological prowess but also sparks widespread discussion about the future of AI. Related data and predictions are...

JEDEC Releases HBM4 Standard, Powering the Next Era of AI and High-Performance Computing

The JEDEC Solid State Technology Association has announced the highly anticipated release of the High Bandwidth Memory (HBM) standard – HBM4. Evolving from the HBM3 standard, HBM4 aims to further accelerate data processing while maintaining higher bandwidth, energy efficiency, and greater capacity per chip or stack, to meet the demands of efficient processing of large datasets and complex computations. The HBM4 standard introduces several key technological advancements, suitable for applications in generative AI, high-performance computing, high-end graphics cards, and servers. Firstly, HBM4 significantly increases bandwidth...

Anthropic Releases Best Practices Guide for Claude Code, Seamlessly Integrating AI into Developer Workflows

Anthropic recently released a comprehensive best practices guide for Claude Code, providing developers with a low-level, command-line interface (CLI)-centric tool to seamlessly integrate the Claude large language model into their daily programming tasks. Based on Anthropic's internal best practices, this guide emphasizes flexible, secure, and efficient coding patterns, offering valuable guidance for engineers looking to incorporate AI into their existing development environments.

Unveiling Claude's Values: 700,000 Conversations Reveal its Ethical Framework

Anthropic, an AI company, recently published a significant study analyzing the values expressed by its AI assistant, Claude, in real-world conversations. By deeply analyzing 700,000 anonymized conversations, the research team revealed 3,307 unique values demonstrated by Claude across various contexts, offering new insights into AI alignment and safety. This research aimed to assess whether Claude's behavior aligns with its design goals. The research team developed a novel evaluation method...

AI Delves into Code: MCPSCAN.AI Revolutionizes Minecraft Server Plugin Security

Minecraft's appeal largely stems from its high degree of customizability, with plugins being central to this. However, the community-driven plugin ecosystem, while offering endless possibilities, also harbors security risks. Malicious code, backdoors, or plugins with critical vulnerabilities can ruin a server administrator's hard work. Now, a new tool called MCPSCAN.AI leverages artificial intelligence to address the security challenges within the vast Minecraft plugin landscape.