arxiv:2602.06540
cwt
yiye2023
AI & ML interests
None yet
Recent Activity
liked
a model 17 days ago
openbmb/MiniCPM-SALA upvoted a paper 17 days ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation