·
AI & ML interests
None yet
Organizations
None yet
mihirpd/3b_reverse_only_no_dynamic_global_step_200
mihirpd/3b_shortcut_unremoved_correct_global_step_200
mihirpd/3b_num_without_shortcut_global_step_234
mihirpd/3b_shortcut_unremoved_global_step_200
mihirpd/Qwen14B-vague-jazz-2019-global_step_205
mihirpd/Qwen7B-vague-jazz-2019-global_step_205
Updated
mihirpd/Qwen30B-Instruct-wandering-blaze-2012-global_step_100
mihirpd/quiet-grass-102-checkpoint-2000
3B • Updated mihirpd/qwen2.5-32b-eerie-horseman-1951-global_step_200
33B • Updated mihirpd/qwen2.5-7b-instruct-rare-music-1897
8B • Updated • 1
mihirpd/qwen2.5-3b-rare-glade-1739-global_step_185
3B • Updated mihirpd/qwen2.5-3b-rural-paper-1690
3B • Updated mihirpd/qwen2.5-3b-rare-glade-1739
3B • Updated mihirpd/qwen2.5_32b_verl_sft
33B • Updated • 1
mihirpd/qwen2.5_14b_verl_sft_new
15B • Updated mihirpd/qwen2.5-72b-instruct-eternal-bird-1034
73B • Updated mihirpd/qwen2.5_14b_verl_sft
Text Generation
• 15B • Updated • 3
mihirpd/qwen-32b-instruct-electric-sunset-897
33B • Updated mihirpd/qwen-32b-instruct-jumping-sea-898
33B • Updated mihirpd/qwen-32b-instruct-symbolic-translated-numeric-reverse-temp
33B • Updated mihirpd/qwen2.5_7b_verl_sft
Text Generation
• 8B • Updated • 1
mihirpd/qwen2.5-72b_valiant-yogurt-331_global-step-150
Updated
mihirpd/OpenR1-Qwen-7B-SFT
15B • Updated • 1
mihirpd/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated • 1
mihirpd/alignprop-trl-aesthetics
Text-to-Image
• Updated • 8
• 1