This collection includes the models used in the paper "Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recip
liyaxuan
lllyx
·
AI & ML interests
None yet
Recent Activity
upvoted a paper about 17 hours ago
Near-Future Policy Optimization updated a model 8 days ago
lllyx/Qwen3-1.7B-SFT updated a collection 8 days ago
Rethinking OPDOrganizations
None yet