Wenbo Chen's picture

2

Wenbo Chen

wenbochen111

·

https://wenbo11.github.io/

AI & ML interests

LLM

Recent Activity

authored a paper about 11 hours ago

ClawsBench: Evaluating Capability and Safety of LLM Productivity Agents in Simulated Workspaces

upvoted a paper about 14 hours ago

ClawsBench: Evaluating Capability and Safety of LLM Productivity Agents in Simulated Workspaces

authored a paper about 2 months ago

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

View all activity

Organizations

None yet

authored a paper about 11 hours ago

ClawsBench: Evaluating Capability and Safety of LLM Productivity Agents in Simulated Workspaces

Paper • 2604.05172 • Published 4 days ago • 18

upvoted a paper about 14 hours ago

ClawsBench: Evaluating Capability and Safety of LLM Productivity Agents in Simulated Workspaces

Paper • 2604.05172 • Published 4 days ago • 18

authored a paper about 2 months ago

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

Paper • 2602.12670 • Published Feb 13 • 58

upvoted a paper about 2 months ago

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

Paper • 2602.12670 • Published Feb 13 • 58