pinned Running RL API Testing Environment π RL env training agents to find OWASP API vulnerabilities
Running on Zero Agents SHADE Monitor β Before vs After π‘ Qwen 1.5B baseline vs GRPO-trained LoRA monitor.