MobileLLM 80M (Replication)

Model Size: 80M parameters
Architecture: Llama-based (Deep &amp; Thin)
Status: Research / Testing

This is a replicated version of the MobileLLM 80M model, based on the paper MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases.