sagnikM/grpo_sgd_qwen3-8b_3k_seqlen_momentum_0p9_1e-2 Text Generation • 8B • Updated 13 days ago • 185
sagnikM/grpo_sgd_llama3p1_8b_3k-seqlen_momentum_0p9_1e-3 Text Generation • 8B • Updated 15 days ago • 178
sagnikM/grpo_sgd_qwen3_1p7b_3k-seqlen_momentum_0p9_1e-2 Text Generation • 2B • Updated 15 days ago • 140
sagnikM/grpo_sgd_qwen3_1p7b_3k-seqlen_momentum_0p9_1e-1 Text Generation • 2B • Updated 15 days ago • 110