osunlp/bioscan-traits
Viewer
• Updated
• 80.8k • 31 • 1
Natural language processing, language models, language agents
When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Computer-Use Agents
When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents