6 15 186

Jian Hu

chuyi777

https://hujian.website

hijkzzz

AI & ML interests

Reinforcement Learning

Recent Activity

updated a model 2 months ago

OpenRLHF/Llama-3-8b-rm-700k

upvoted a paper 3 months ago

PhyCritic: Multimodal Critic Models for Physical AI

updated a dataset 3 months ago

OpenRLHF/aime-2024

View all activity

Organizations

updated a model 2 months ago

OpenRLHF/Llama-3-8b-rm-700k

Text Ranking • 8B • Updated Mar 16 • 1.07k • 3

upvoted a paper 3 months ago

PhyCritic: Multimodal Critic Models for Physical AI

Paper • 2602.11124 • Published Feb 11 • 55

updated 2 datasets 3 months ago

OpenRLHF/aime-2024

Viewer • Updated Feb 6 • 30 • 662

OpenRLHF/dapo-math-17k

Viewer • Updated Feb 6 • 17.4k • 200

authored a paper 4 months ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 111

published 2 datasets 4 months ago

OpenRLHF/aime-2024

Viewer • Updated Feb 6 • 30 • 662

OpenRLHF/dapo-math-17k

Viewer • Updated Feb 6 • 17.4k • 200

upvoted a paper 4 months ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 111

upvoted a paper 7 months ago

DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning

Paper • 2510.15110 • Published Oct 16, 2025 • 18

upvoted a paper 8 months ago

BroRL: Scaling Reinforcement Learning via Broadened Exploration

Paper • 2510.01180 • Published Oct 1, 2025 • 20

liked 2 models 9 months ago

moonshotai/Kimi-K2-Instruct-0905

Text Generation • 1T • Updated Jan 30 • 1.83M • • 711

nvidia/NVIDIA-Nemotron-Nano-12B-v2

Text Generation • Updated Nov 25, 2025 • 14.8k • • 164

updated a dataset 9 months ago

OpenRLHF/gem_guess_game

Viewer • Updated Aug 30, 2025 • 2.05k • 12 • 1

published a dataset 9 months ago

OpenRLHF/gem_guess_game

Viewer • Updated Aug 30, 2025 • 2.05k • 12 • 1

New activity in nvidia/NVIDIA-Nemotron-Nano-9B-v2 9 months ago

some problem when I asked the model: 你是谁？

🤯 2

#8 opened 9 months ago by

wenzel94

upvoted a paper 9 months ago

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

Paper • 2508.08221 • Published Aug 11, 2025 • 50

liked a model 9 months ago

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26, 2025 • 7.71M • • 4.62k

liked a model 10 months ago

mistralai/Devstral-Small-2505

24B • Updated Aug 18, 2025 • 18.3k • 869

liked 2 datasets 10 months ago

MegaScience/MegaScience

Viewer • Updated Jul 24, 2025 • 1.25M • 15.1k • 132

newfacade/LeetCodeDataset

Viewer • Updated May 29, 2025 • 2.87k • 2.04k • 65

Jian Hu

AI & ML interests

Recent Activity

Organizations

chuyi777's activity

some problem when I asked the model: 你是谁？