docs: improve model card with quickstart, benchmarks, Apache-2.0 f80c9ca verified Carmenest commited on 26 days ago
Remove F16 GGUF (redundant, keep Q4_K_M + Q8_0) 1c65114 verified Carmenest commited on about 1 month ago
Update model card with inter-step cache benchmark results (v0.2.0) 9dc8866 verified Carmenest commited on Mar 22
Update benchmarks with rigorous real-prompt results (buffer 1.5x fix) deb1e7f verified Carmenest commited on Mar 19