SafeWork-R1: Coevolving Safety and Intelligence under the AI-45^{circ} Law Paper • 2507.18576 • Published Jul 24, 2025 • 10
Toward Efficient Agents: Memory, Tool learning, and Planning Paper • 2601.14192 • Published 22 days ago • 54
ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback Paper • 2601.10156 • Published 28 days ago • 26
ProGuard: Towards Proactive Multimodal Safeguard Paper • 2512.23573 • Published Dec 29, 2025 • 6
Collaborative Shadows: Distributed Backdoor Attacks in LLM-Based Multi-Agent Systems Paper • 2510.11246 • Published Oct 13, 2025 • 2
SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models Paper • 2402.05044 • Published Feb 7, 2024 • 2
From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities Paper • 2401.15071 • Published Jan 26, 2024 • 37