embedl
/

Llama-3.2-1B-Instruct-FlashHead-W4A16

text-generation-inference

compressed-tensors

Model card Files Files and versions

Llama-3.2-1B-Instruct-FlashHead-W4A16

1.6 GB

2 contributors

History: 7 commits

WilhelmT's picture

Delete files flash_head_assets/clustering_cache.pt with huggingface_hub

9f8fd4e verified 4 days ago