Yaorui SHI
yrshi
AI & ML interests
None yet
Recent Activity
upvoted a paper about 9 hours ago
Auto-Rubric as Reward: From Implicit Preferences to Explicit Multimodal Generative Criteria upvoted a paper about 12 hours ago
Rubric-based On-policy Distillation upvoted a paper 4 days ago
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning