chriswhpang/Llama-3.2-1B-Instruct-OpenThought-SFT-GRPO-16bit Text Generation • 1B • Updated Feb 19, 2025 • 4 •
chriswhpang/Llama-3.2-1B-Instruct-OpenThought-SFT-VLLM Text Generation • 1B • Updated Feb 17, 2025 • 3 •