Yaorui SHI

yrshi

·

syr-cn

AI & ML interests

None yet

Recent Activity

upvoted a paper about 24 hours ago

EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis

upvoted a paper 1 day ago

Long-Horizon-Terminal-Bench: Testing the Limits of Agents on Long-Horizon Terminal Tasks with Dense Reward-Based Grading

upvoted a paper 2 days ago

AgenticSTS: A Bounded-Memory Testbed for Long-Horizon LLM Agents

View all activity

Organizations

yrshi 's datasets

None public yet