AI & ML interests
None yet
Organizations
models 33
jyc0325/Qwen2.5-1.5B-Open-R1-Code-GRPO
Text Generation
• 2B • Updated
jyc0325/Qwen2.5-1.5B-Instruct-gccpSFT-GRPO
Text Generation
• 2B • Updated
jyc0325/Qwen2.5-1.5B-Instruct-gccpSFT
Text Generation
• 2B • Updated
• 3
jyc0325/Qwen2.5-7B-Instruct-SFT
Text Generation
• 8B • Updated
• 2
jyc0325/Qwen2.5-1.5B-Open-R1-Code-GRPOv2
Text Generation
• 2B • Updated
jyc0325/Qwen2.5-1.5B-SFT-ORPO
Text Generation
• 2B • Updated
jyc0325/Qwen2.5-1.5B-DPO-SFT-code
Text Generation
• 2B • Updated
jyc0325/Qwen2.5-1.5B-SFT-v1
Text Generation
• 2B • Updated
jyc0325/Qwen2.5-1.5B-ORPO-code-hard
Text Generation
• 2B • Updated
• 1
jyc0325/Qwen2.5-1.5B-DPO-code-hard
Text Generation
• 2B • Updated
datasets 10
Viewer
• Updated
• 35.7k • 83
Viewer
• Updated
• 35.7k • 65
jyc0325/vcpp-pref-hard-pairs
Viewer
• Updated
• 26.9k • 6
jyc0325/vcpp-pref-code-only
Viewer
• Updated
• 32.9k • 6
jyc0325/vezora-pref-code-only
Viewer
• Updated
• 52.9k • 6
jyc0325/vezora-pref-clean
Viewer
• Updated
• 54k • 6
jyc0325/verifiable-coding-problems-python-pref
Viewer
• Updated
• 32.9k • 7
jyc0325/Code-Preference-Pairs
Viewer
• Updated
• 54k • 6
jyc0325/Mixture-of-Thoughts-code-8k
Viewer
• Updated
• 25.2k • 5
jyc0325/python_decontaminated_OpenR1-Math-220k
Preview
• Updated
• 6