Derek Li

movefast

movefast

AI & ML interests

RL, Planning

Recent Activity

updated a dataset 13 days ago

movefast/s1k-processed-clean

published a dataset 13 days ago

movefast/s1k-processed-clean

updated a dataset 13 days ago

movefast/s1k-processed

View all activity

Organizations

None yet

updated a dataset 13 days ago

movefast/s1k-processed-clean

Viewer • Updated 13 days ago • 998 • 139

published a dataset 13 days ago

movefast/s1k-processed-clean

Viewer • Updated 13 days ago • 998 • 139

updated a dataset 13 days ago

movefast/s1k-processed

Viewer • Updated 13 days ago • 998 • 7

published a dataset 13 days ago

movefast/s1k-processed

Viewer • Updated 13 days ago • 998 • 7

upvoted an article 2 months ago

Article

A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons

Feb 4, 2025

•

updated a dataset 2 months ago

movefast/Bespoke-Stratos-17k-iter-2

Viewer • Updated Nov 26, 2025 • 16.7k • 11

published a dataset 3 months ago

movefast/Bespoke-Stratos-17k-iter-2

Viewer • Updated Nov 26, 2025 • 16.7k • 11

updated a model 3 months ago

movefast/iter_1

Text Generation • 8B • Updated Nov 23, 2025 • 1

published a model 3 months ago

movefast/iter_1

Text Generation • 8B • Updated Nov 23, 2025 • 1

updated a model 4 months ago

movefast/Qwen2.5-1.5B-Open-R1-GRPO

2B • Updated Oct 7, 2025

published a model 4 months ago

movefast/Qwen2.5-1.5B-Open-R1-GRPO

2B • Updated Oct 7, 2025

updated a dataset 6 months ago

movefast/orz_omni_math_8k_w_task_type

Viewer • Updated Aug 24, 2025 • 8.07k • 5

published a dataset 6 months ago

movefast/orz_omni_math_8k_w_task_type

Viewer • Updated Aug 24, 2025 • 8.07k • 5

upvoted a paper 6 months ago

Omni-Thinker: Scaling Cross-Domain Generalization in LLMs via Multi-Task RL with Hybrid Rewards

Paper • 2507.14783 • Published Jul 20, 2025 • 4

updated a model 7 months ago

movefast/qwen3_8b_orm_step_20

8B • Updated Jul 18, 2025

published a model 7 months ago

movefast/qwen3_8b_orm_step_20

8B • Updated Jul 18, 2025

updated a model 7 months ago

movefast/qwen3_8b_orm_step_35

8B • Updated Jul 18, 2025

published a model 7 months ago

movefast/qwen3_8b_orm_step_35

8B • Updated Jul 18, 2025

updated a dataset 7 months ago

movefast/orz_omni_math_8k

Viewer • Updated Jul 9, 2025 • 8.07k • 5

published a dataset 7 months ago

movefast/orz_omni_math_8k

Viewer • Updated Jul 9, 2025 • 8.07k • 5

Derek Li

AI & ML interests

Recent Activity

Organizations

movefast's activity

A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons