Mellum-4b-base is JetBrains’ first open-source large language model designed and optimized for code-related tasks. Built with 4 billion parameters and a LLaMA-style architecture, it was trained on over 4.2 trillion tokens across multiple programming languages, including datasets such as The Stack, StarCoder, and CommitPack. With a context window of 8,192 tokens, it excels at code completion, fill-in-the-middle tasks, and intelligent code suggestions for professional developer tools and IDEs. The model is efficient for both cloud inference with vLLM and local deployment using llama.cpp or Ollama, thanks to its bf16 precision and AMP training. While the base model is not fine-tuned for downstream tasks, it is designed to be easily adapted through supervised fine-tuning (SFT) or reinforcement learning (RL). Benchmarks on RepoBench, SAFIM, and HumanEval demonstrate its competitive performance, with specialized fine-tuned versions for Python already showing strong improvements.

Features

  • 4B parameter LLaMA-style architecture optimized for coding tasks
  • Trained on 4.2T tokens from The Stack, StarCoder, CommitPack, and Wikipedia
  • 8,192-token context window for handling larger codebases
  • Efficient for both cloud inference (vLLM) and local use (llama.cpp, Ollama)
  • Base model with support for SFT and RL fine-tuning for specific applications
  • Strong benchmark results on RepoBench, SAFIM, and HumanEval tasks
  • Includes Python SFT variant with superior performance over the base model
  • Licensed under Apache 2.0 for open and flexible use

Project Samples

Project Activity

See All Activity >

Categories

AI Models

Follow Mellum-4b-base

Mellum-4b-base Web Site

You Might Also Like
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Mellum-4b-base!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Models

Registered

2025-09-11