AutoDAN-Turbo
An automated framework for breaking the limitations of large language models
CommonProductProgrammingAutomationJailbreaking
AutoDAN-Turbo is an automated framework that operates without human intervention, designed to discover and implement various strategies to circumvent the limitations of large language models (LLMs). The framework can automatically develop diverse attack strategies, significantly increasing the success rate of attacks, and integrates existing human-designed jailbreak strategies into a unified framework. Its significance lies in enhancing the security and reliability of LLMs in adversarial environments, offering a new automated approach for red team assessment tools.
AutoDAN-Turbo Visit Over Time
Monthly Visits
494758773
Bounce Rate
37.69%
Page per Visit
5.7
Visit Duration
00:06:29