Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 24 items • Updated 2 days ago • 90
huihui-ai/Huihui-GLM-4.7-Flash-abliterated-mlx-4bit Text Generation • 30B • Updated 3 days ago • 543 • 6