JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-1epoch500steps Text Generation • 8B • Updated 17 days ago • 18
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-300steps Text Generation • 8B • Updated 18 days ago • 15
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-200steps Text Generation • 8B • Updated 18 days ago • 28
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-100steps Text Generation • 8B • Updated 18 days ago • 18
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-1-1430steps Text Generation • 8B • Updated 18 days ago • 38
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-ispass-400steps Text Generation • 8B • Updated Oct 7 • 8
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-vanillaRL-200steps Text Generation • 8B • Updated Oct 6 • 6
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-200steps Text Generation • 8B • Updated Oct 6 • 5
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-vanillaRL-100steps Text Generation • 8B • Updated Oct 6 • 7
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-100steps Text Generation • 8B • Updated Oct 6 • 6
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-NoPerturbationData-NoPL-400steps Text Generation • 8B • Updated Oct 6 • 6
JerrrrryKun/OLMo-2-1124-7B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-100steps Text Generation • 7B • Updated Oct 6 • 7
JerrrrryKun/OLMo-2-1124-7B-Instruct-LLM4Math-V2data-Sequential-vanillaRL-100steps Text Generation • 7B • Updated Oct 6 • 5
JerrrrryKun/SmolLM3-3B-LLM4Math-V2data-Sequential-perturbationsignalonly-300steps Text Generation • 3B • Updated Oct 5 • 6
JerrrrryKun/DeepMath-1.5B-LLM4Math-V2data-Sequential-vanillaRL-400steps Text Generation • 2B • Updated Oct 5 • 5
JerrrrryKun/DeepMath-1.5B-LLM4Math-V2data-Sequential-perturbationsignalonly-400steps Text Generation • 2B • Updated Oct 5 • 6
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-400steps Text Generation • 8B • Updated Oct 4 • 5
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-vanillaRL-400steps Text Generation • 8B • Updated Oct 4 • 7
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-NoPerturbationData-NoPL-100steps Text Generation • 8B • Updated Oct 2 • 6
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-NoPerturbationData-100steps Text Generation • 8B • Updated Oct 2 • 6
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-300steps Text Generation • 8B • Updated Oct 2 • 5
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-NoPreconditionLabel-300steps Text Generation • 8B • Updated Oct 1 • 6
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-400steps Text Generation • 8B • Updated Sep 29 • 4
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-LLM4Math-V2data-250steps Text Generation • 8B • Updated Sep 27 • 4