MobileLLM 60M (Replication)

Model Size: 60M parameters
Architecture: Llama-based (Deep &amp; Thin)
Status: Research / Testing

This is a replicated version of the MobileLLM 60M model, based on the paper MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases.