Running Featured 92 LFM2.5 1.2B Thinking WebGPU 💧 92 Run LFM2.5-1.2B-Thinking directly in your browser on WebGPU
view article Article Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI 3 days ago • 47
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels Aug 18, 2025 • 95
Mamba-3: Improved Sequence Modeling using State Space Principles Paper • 2603.15569 • Published 4 days ago • 4