NamrataThakur/Small_Language_Model_GQA_48M_Pretrained Text Generation • Updated about 20 hours ago • 378 • 1
NamrataThakur/Small_Language_Model_MHA_53M_Pretrained Text Generation • Updated about 20 hours ago • 415 • 1
MihaiPopa-1/Stentor-30M-Instruct-heretic-safety-defiltered Text Generation • 30.4M • Updated 20 days ago • 24 • 2
NamrataThakur/Small_Language_Model_MOE_127M_Pretrained Text Generation • Updated about 20 hours ago • 268 • 1