arxiv:2506.13502
Zhikun Xu
JerrrrryKun
AI & ML interests
None yet
Organizations
None yet
models
25
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-1epoch500steps
Text Generation
•
8B
•
Updated
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-300steps
Text Generation
•
8B
•
Updated
•
2
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-200steps
Text Generation
•
8B
•
Updated
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-100steps
Text Generation
•
8B
•
Updated
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-1-1430steps
Text Generation
•
8B
•
Updated
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-ispass-400steps
Text Generation
•
8B
•
Updated
•
1
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-vanillaRL-200steps
Text Generation
•
8B
•
Updated
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-200steps
Text Generation
•
8B
•
Updated
•
1
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-vanillaRL-100steps
Text Generation
•
8B
•
Updated
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-100steps
Text Generation
•
8B
•
Updated
datasets
0
None public yet