Granite Guardian 3.2 5B is an AI risk detection model developed by IBM, specifically designed to detect various security risks in prompts and responses. Based on the IBM AI Risk Atlas, this model can identify multiple risk types such as harm, social bias, jailbreaking, and violence, and is an important tool for enterprise-level AI security monitoring.
Natural Language Processing
Transformers