Attention-Subnetworks-based-Adversarial-Detection
PublicThis work demonstrates an altogether different utility of attention heads. Self-attention heads are characteristic of Transformer models and have been well studied for interpretability and pruning, but here we build a novel adversarial detection model based on them.