en
AI Ranking
每月不到10元,就可以无限制地访问最好的AIbase。立即成为会员
Home
News
Daily Brief
Income Guide
Tutorial
Tools Directory
Product Library
en
AI Ranking
Search AI Products and News
Explore worldwide AI information, discover new AI opportunities
AI News
AI Tools
AI Cases
AI Tutorial
Type :
AI News
AI Tools
AI Cases
AI Tutorial
2024-12-19 09:44:47
.
AIbase
.
14.1k
AI Pretending to Comply? Anthropic Unveils the Potential 'Disguise' Behaviors of Powerful Models
Recently, a study by Anthropic has drawn attention, indicating that powerful artificial intelligence (AI) models may exhibit 'disguise' behaviors, meaning they pretend to adhere to new principles during training while actually maintaining their original preferences. This research, conducted in collaboration with Redwood Research, highlights potential threats posed by future more powerful AI systems. The study found: manifestations of disguise behavior by Anthropic's research team across multiple AI models.