2 9 15

CL Yu

clyu

AI & ML interests

None yet

Recent Activity

liked a dataset 6 days ago

nvidia/Nemotron-Pretraining-Specialized-v1.1

liked a dataset 7 days ago

stepfun-ai/Step-3.5-Flash-SFT

liked a model about 2 months ago

Qwen/Qwen3-Coder-Next

View all activity

Organizations

liked a dataset 6 days ago

nvidia/Nemotron-Pretraining-Specialized-v1.1

Viewer • Updated 14 days ago • 19.8M • 2.8k • 29

liked a dataset 7 days ago

stepfun-ai/Step-3.5-Flash-SFT

Viewer • Updated 11 days ago • 1.62M • 40.5k • 281

liked a model about 2 months ago

Qwen/Qwen3-Coder-Next

Text Generation • 80B • Updated Feb 3 • 1.25M • • 1.18k

submitted a paper to Daily Papers about 2 months ago

Approximation of Log-Partition Function in Policy Mirror Descent Induces Implicit Regularization for LLM Post-Training

Paper • 2602.05933 • Published Feb 5 • 5

upvoted an article about 2 months ago

Article

We Got Claude to Build CUDA Kernels and teach open models!

Jan 28

•

150

upvoted a paper about 2 months ago

MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods

Paper • 2601.21821 • Published Jan 29 • 61

liked a dataset 2 months ago

MiniMaxAI/OctoCodingBench

Viewer • Updated Jan 13 • 72 • 411 • 263

updated a model 3 months ago

clyu/clip0.28_clipl0.2_vanilla_bsz512_mb128

Updated Dec 17, 2025

published a model 3 months ago

clyu/clip0.28_clipl0.2_vanilla_bsz512_mb128

Updated Dec 17, 2025

updated a model 3 months ago

clyu/cliph4_clipl0.5_cumloss_bsz512_mb128

Updated Dec 17, 2025

published a model 3 months ago

clyu/cliph4_clipl0.5_cumloss_bsz512_mb128

Updated Dec 17, 2025

liked a model 4 months ago

Salesforce/xRouter

Text Generation • 8B • Updated Nov 4, 2025 • 23 • 14

updated a model 4 months ago

clyu/qwen3_14b_rstar_sft_step802

15B • Updated Nov 17, 2025 • 3

published a model 4 months ago

clyu/qwen3_14b_rstar_sft_step802

15B • Updated Nov 17, 2025 • 3

liked 2 datasets 5 months ago

microsoft/rStar-Coder

Viewer • Updated Jul 20, 2025 • 1.86M • 5.97k • 234

zhenghaoxu/R2E-Gym-Lite-with-Difficulty

Viewer • Updated Sep 19, 2025 • 6.24k • 71 • 4

upvoted 3 papers 5 months ago

LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

Paper • 2510.19363 • Published Oct 22, 2025 • 63

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 85

AlphaQuanter: An End-to-End Tool-Orchestrated Agentic Reinforcement Learning Framework for Stock Trading

Paper • 2510.14264 • Published Oct 16, 2025 • 10

commented a paper 5 months ago

AlphaQuanter: An End-to-End Tool-Orchestrated Agentic Reinforcement Learning Framework for Stock Trading

Paper • 2510.14264 • Published Oct 16, 2025 • 10 •

CL Yu

AI & ML interests

Recent Activity

Organizations

clyu's activity

We Got Claude to Build CUDA Kernels and teach open models!