Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Yikun Jiang
code1phoenix
Follow
AI & ML interests
None yet
Recent Activity
published
a model
9 days ago
code1phoenix/zamba2-2.7b-grpo-v2-length-gsm8k
published
a model
9 days ago
code1phoenix/zamba2-2.7b-dpo-v3-length-gsm8k
updated
a model
12 days ago
code1phoenix/zamba2-2.7b-dpo-v3-length-gsm8k
View all activity
Organizations
None yet
code1phoenix
's models
15
Sort: Recently updated
code1phoenix/zamba2-2.7b-dpo-v3-length-gsm8k
Updated
12 days ago
•
10
code1phoenix/zamba2-2.7b-grpo-v2-length-gsm8k
Updated
13 days ago
•
9
code1phoenix/pixelcopter
Reinforcement Learning
•
Updated
16 days ago
code1phoenix/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
16 days ago
code1phoenix/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
16 days ago
•
38
code1phoenix/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
16 days ago
•
25
code1phoenix/ppo-pyramid
Reinforcement Learning
•
Updated
16 days ago
•
20
code1phoenix/ppo-SnowballTarget
Reinforcement Learning
•
Updated
17 days ago
•
273
code1phoenix/cartpole-1
Reinforcement Learning
•
Updated
17 days ago
code1phoenix/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
17 days ago
•
41
code1phoenix/Taxi-v3
Reinforcement Learning
•
Updated
17 days ago
code1phoenix/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
17 days ago
code1phoenix/ppo-Huggy
Reinforcement Learning
•
Updated
18 days ago
•
59
code1phoenix/ppo-LunarLander-v3
Reinforcement Learning
•
Updated
18 days ago
•
19
code1phoenix/NanoChat
Updated
Dec 29, 2025