Hugging Face: Red-Teaming Resistance Leaderboard benchmarks frontier LLM robustness (GPT-4, Claude-2, Vicuna-13B) | SignalBreak | SignalBreak