Running Agents 1 SLR-Bench Leaderboard - Reward Hacking in Reasoning Models 🎯 1 Reward shortcut behavior in LLMs via IPT
Running Agents 1 Isomorphic Perturbation Testing 🔍 1 Evaluate rule hypotheses for genuine reasoning vs shortcuts
Running Agents 1 VerifiableRewardsForScalableLogicalReasoning 🚀 1 Evaluate logical rules with a validation program
LukasHug/LlavaGuard-v1.2-0.5B-OV-Default-Policy Image-Text-to-Text • 0.9B • Updated Mar 20, 2025 • 3 • 1