RuiyangSi's picture

8 1

RuiyangSi

RuiyangSi

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

upvoted a paper about 1 month ago

SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks

upvoted a paper about 1 month ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published 6 days ago • 149

upvoted 2 papers about 1 month ago

SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks

Paper • 2604.08865 • Published Apr 10 • 29

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 95

upvoted a paper 4 months ago

Shaping capabilities with token-level data filtering

Paper • 2601.21571 • Published Jan 29 • 29

upvoted 2 papers 6 months ago

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 134

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 242

upvoted a paper 7 months ago

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Paper • 2510.08540 • Published Oct 9, 2025 • 110

upvoted a collection about 1 year ago

Open-Sora

A Series of Open-Sora Models • 11 items • Updated Feb 21, 2025 • 10