MobileLLM is a lightweight large language model (LLM) framework developed by Facebook Research, optimized for on-device deployment where computational and memory efficiency are critical. Introduced in the ICML 2024 paper “MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases”, it focuses on delivering strong reasoning and generalization capabilities in models under one billion parameters. The framework integrates several architectural innovations—SwiGLU activation, deep and thin network design, embedding sharing, and grouped-query attention (GQA)—to achieve a superior trade-off between model size, inference speed, and accuracy. MobileLLM demonstrates remarkable performance, with the 125M and 350M variants outperforming previous state-of-the-art models of the same scale by up to 4.3% on zero-shot commonsense reasoning tasks.

Features

  • Optimized transformer architecture for sub-billion parameter LLMs
  • Combines SwiGLU activation, embedding sharing, and grouped-query attention
  • Supports distributed multi-node pretraining with PyTorch ≥ 2.0
  • Delivers state-of-the-art zero-shot reasoning results across multiple tasks
  • Includes reproducible training and evaluation pipelines for multiple model sizes
  • Scalable design philosophy extending from 125M to 1.5B parameters

Project Samples

Project Activity

See All Activity >

License

Fair License

Follow MobileLLM

MobileLLM Web Site

You Might Also Like
Gen AI apps are built with MongoDB Atlas Icon
Gen AI apps are built with MongoDB Atlas

Build gen AI apps with an all-in-one modern database: MongoDB Atlas

MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of MobileLLM!

Additional Project Details

Operating Systems

Linux

Programming Language

Python, Unix Shell

Related Categories

Unix Shell Large Language Models (LLM), Python Large Language Models (LLM)

Registered

2025-10-08