6 25 9

Bingxiang He

hbx

https://hbx-hbx.github.io/

AI & ML interests

NLP

Recent Activity

commentedon a paper about 11 hours ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

authored a paper 6 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

commentedon a paper 8 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

View all activity

Organizations

commented a paper about 11 hours ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 9 days ago • 85 •

authored a paper 6 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 9 days ago • 85

commented a paper 8 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 9 days ago • 85 •

upvoted a paper 8 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 9 days ago • 85

submitted a paper to Daily Papers 8 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 9 days ago • 85

commented a paper about 1 month ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 59 •

upvoted a paper about 1 month ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 59

submitted a paper to Daily Papers about 1 month ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 59

liked a model 2 months ago

openbmb/MiniCPM-SALA

Text Generation • 9B • Updated 20 days ago • 1.96k • 673

upvoted a paper 2 months ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published Feb 10 • 59

liked 2 models 3 months ago

openbmb/MiniCPM-o-4_5

Any-to-Any • 9B • Updated Mar 7 • 32.5k • 1.34k

openbmb/AgentCPM-Explore

Text Generation • 4B • Updated Jan 18 • 191 • 412

updated 2 models 4 months ago

hbx/JustRL-Nemotron-1.5B

Text Generation • 2B • Updated Dec 29, 2025 • 819 • 3

hbx/JustRL-DeepSeek-1.5B

Text Generation • 2B • Updated Dec 29, 2025 • 2.16k • 10

upvoted a collection 4 months ago

JustRL

Collection

2 items • Updated Nov 1, 2025 • 5

New activity in hbx/JustRL-Nemotron-1.5B 4 months ago

Add Hugging Face paper link badge to model card

#1 opened 4 months ago by

nielsr

New activity in hbx/JustRL-DeepSeek-1.5B 4 months ago

Improve model card: Update title, add paper link, correct license and citation

#1 opened 4 months ago by

nielsr

commented a paper 4 months ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published Dec 18, 2025 • 27 •

upvoted a paper 4 months ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published Dec 18, 2025 • 27

submitted a paper to Daily Papers 4 months ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published Dec 18, 2025 • 27

Bingxiang He

AI & ML interests

Recent Activity

Organizations

hbx's activity

Add Hugging Face paper link badge to model card

Improve model card: Update title, add paper link, correct license and citation