Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Ach0
/
GCPO-R1-1.5B
like
0
Text Generation
Safetensors
English
qwen2
GRPO
DAPO
GCPO
RL
RLVR
conversational
arxiv:
2510.07790
License:
mit
Model card
Files
Files and versions
xet
Community
main
GCPO-R1-1.5B
Commit History
Update README.md
4c6b181
verified
Ach0
commited on
Oct 11, 2025
Create README.md
63912d6
verified
Ach0
commited on
Oct 11, 2025
Upload folder using huggingface_hub
2d79974
verified
Ach0
commited on
Oct 11, 2025
initial commit
82e79ba
verified
Ach0
commited on
Oct 11, 2025