Key Sargent
KeySargent4
AI & ML interests
None yet
Recent Activity
upvoted a paper 19 days ago
Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes upvoted a paper 19 days ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward ExtrapolationOrganizations
None yet