arxiv:2601.16725
Xiangyu
xixy
·
AI & ML interests
None yet
Recent Activity
new activity 6 days ago
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled:Claude distillation authored a paper about 2 months ago
Enhancing Efficiency and Exploration in Reinforcement Learning for LLMsOrganizations
None yet