Zhikun Xu's picture

2 2 2

Zhikun Xu

JerrrrryKun

·

https://jerrrrykun.github.io/

AI & ML interests

None yet

Organizations

None yet

Papers 3

arxiv:2506.13502

arxiv:2502.10454

arxiv:2410.16235

models 25

JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-1epoch500steps

Text Generation • 8B • Updated Nov 25, 2025

JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-300steps

Text Generation • 8B • Updated Nov 24, 2025 • 2

JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-200steps

Text Generation • 8B • Updated Nov 24, 2025

JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-100steps

Text Generation • 8B • Updated Nov 24, 2025

JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-1-1430steps

Text Generation • 8B • Updated Nov 24, 2025

JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-ispass-400steps

Text Generation • 8B • Updated Oct 7, 2025 • 1

JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-vanillaRL-200steps

Text Generation • 8B • Updated Oct 6, 2025

JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-200steps

Text Generation • 8B • Updated Oct 6, 2025 • 1

JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-vanillaRL-100steps

Text Generation • 8B • Updated Oct 6, 2025

JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-100steps

Text Generation • 8B • Updated Oct 6, 2025

datasets 0

None public yet