SonicMoE: Accelerating MoE with IO and Tile-aware Optimizations Paper • 2512.14080 • Published 11 days ago • 5
view article Article The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator 10 days ago • 32
Deepseek V3 (All Versions) Collection Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions. • 7 items • Updated 3 days ago • 39