Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Yikun Jiang
code1phoenix
Follow
AI & ML interests
None yet
Recent Activity
published
a model
7 days ago
code1phoenix/zamba2-2.7b-grpo-v2-length-gsm8k
published
a model
7 days ago
code1phoenix/zamba2-2.7b-dpo-v3-length-gsm8k
updated
a model
10 days ago
code1phoenix/zamba2-2.7b-dpo-v3-length-gsm8k
View all activity
Organizations
None yet
code1phoenix
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
published
2 models
7 days ago
code1phoenix/zamba2-2.7b-grpo-v2-length-gsm8k
Updated
10 days ago
•
9
code1phoenix/zamba2-2.7b-dpo-v3-length-gsm8k
Updated
10 days ago
•
10
updated
2 models
10 days ago
code1phoenix/zamba2-2.7b-dpo-v3-length-gsm8k
Updated
10 days ago
•
10
code1phoenix/zamba2-2.7b-grpo-v2-length-gsm8k
Updated
10 days ago
•
9
updated
a model
14 days ago
code1phoenix/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
14 days ago
published
a model
14 days ago
code1phoenix/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
14 days ago
updated
2 models
14 days ago
code1phoenix/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
14 days ago
•
38
code1phoenix/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
14 days ago
•
25
published
a model
14 days ago
code1phoenix/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
14 days ago
•
25
updated
a model
14 days ago
code1phoenix/ppo-pyramid
Reinforcement Learning
•
Updated
14 days ago
•
20
published
a model
14 days ago
code1phoenix/ppo-pyramid
Reinforcement Learning
•
Updated
14 days ago
•
20
updated
a model
14 days ago
code1phoenix/ppo-SnowballTarget
Reinforcement Learning
•
Updated
14 days ago
•
273
published
a model
14 days ago
code1phoenix/ppo-SnowballTarget
Reinforcement Learning
•
Updated
14 days ago
•
273
updated
a model
14 days ago
code1phoenix/pixelcopter
Reinforcement Learning
•
Updated
14 days ago
published
a model
14 days ago
code1phoenix/pixelcopter
Reinforcement Learning
•
Updated
14 days ago
updated
a model
14 days ago
code1phoenix/cartpole-1
Reinforcement Learning
•
Updated
14 days ago
updated
a model
15 days ago
code1phoenix/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
15 days ago
•
41
published
2 models
15 days ago
code1phoenix/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
15 days ago
•
41
code1phoenix/cartpole-1
Reinforcement Learning
•
Updated
14 days ago
updated
a model
15 days ago
code1phoenix/Taxi-v3
Reinforcement Learning
•
Updated
15 days ago
Load more