3 113 50

Maozhou Ge

Gmc2

GHGmc2

AI & ML interests

None yet

Recent Activity

liked a Space about 12 hours ago

google/model-explorer

liked a model 3 days ago

ggml-org/gemma-4-E2B-it-GGUF

liked a model 3 days ago

google/gemma-4-E2B-it

View all activity

Organizations

None yet

liked a Space about 12 hours ago

Model Explorer

👓

Explore and visualize machine learning models

liked 2 models 3 days ago

ggml-org/gemma-4-E2B-it-GGUF

5B • Updated 7 days ago • 44.7k • 50

google/gemma-4-E2B-it

Any-to-Any • 5B • Updated 7 days ago • 515k • 374

upvoted an article 23 days ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

about 1 month ago

•

121

upvoted a paper 25 days ago

The Art of Scaling Reinforcement Learning Compute for LLMs

Paper • 2510.13786 • Published Oct 15, 2025 • 33

upvoted a collection about 1 month ago

Deepseek Papers

Collection

Deepseek papers collection • 31 items • Updated 4 days ago • 338

liked 2 models about 2 months ago

zai-org/GLM-5

Text Generation • 754B • Updated 5 days ago • 379k • • 1.97k

Qwen/Qwen3.5-397B-A17B

Image-Text-to-Text • 403B • Updated 26 days ago • 908k • • 1.43k

liked a Space about 2 months ago

Sparsity Viz

📉

Explore MoE model sparsity across many LLMs

upvoted an article about 2 months ago

Article

Visualize and understand GPU memory in PyTorch

Dec 24, 2024

•

269

liked a Space 3 months ago

Megatron Memory Estimator

👁

Estimate GPU memory usage for Megatron models

upvoted an article 3 months ago

Article

Introduction to ggml

Aug 13, 2024

•

278

upvoted a paper 3 months ago

Hyper-Connections

Paper • 2409.19606 • Published Sep 29, 2024 • 26

upvoted a paper 4 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 106

liked a model 4 months ago

deepseek-ai/DeepSeek-V3.2

Text Generation • 685B • Updated Dec 1, 2025 • 1.2M • 1.38k

upvoted a collection 5 months ago

LLaDA 2.0

Collection

7 items • Updated 16 days ago • 41

upvoted an article 5 months ago

Article

Finetune Stable Diffusion Models with DDPO via TRL

Sep 29, 2023

•

liked a model 5 months ago

moonshotai/Kimi-K2-Thinking

Text Generation • 1.1T • Updated Jan 30 • 58.9k • • 1.69k

liked a Space 5 months ago

The Smol Training Playbook

📚

3.1k

The secrets to building world-class LLMs

upvoted a collection 5 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 43 items • Updated Mar 2 • 710

Maozhou Ge

AI & ML interests

Recent Activity

Organizations

Gmc2's activity

Model Explorer

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Sparsity Viz

Visualize and understand GPU memory in PyTorch

Megatron Memory Estimator

Introduction to ggml

Finetune Stable Diffusion Models with DDPO via TRL

The Smol Training Playbook