MaskFormer

MaskFormer is a unified framework for image segmentation developed by Facebook Research, designed to bridge the gap between semantic, instance, and panoptic segmentation within a single architecture. Unlike traditional segmentation pipelines that treat these tasks separately, MaskFormer reformulates segmentation as a mask classification problem, enabling a consistent and efficient approach across multiple segmentation domains. Built on top of Detectron2, it supports a wide range of datasets including ADE20K, Cityscapes, COCO-Stuff, and Mapillary Vistas, and provides pretrained baselines for each. The model achieves strong performance and scalability while simplifying training and evaluation workflows. Its successor, Mask2Former, extends the same meta-architecture to achieve state-of-the-art results across all major segmentation benchmarks. MaskFormer’s modular design, dataset integration, and compatibility with existing Detectron2 models make it an essential research tool.

Features

Unified architecture for semantic, instance, and panoptic segmentation
Built on Detectron2 with full compatibility across models and datasets
Supports ADE20K, Cityscapes, COCO-Stuff, and Mapillary Vistas datasets
Reformulates segmentation as a mask classification task for efficiency
Includes pretrained baselines and a comprehensive model zoo
Foundation for Mask2Former, achieving state-of-the-art segmentation results

Project Activity

See All Activity >

License

Creative Commons Attribution License

Follow MaskFormer

MaskFormer Web Site

User Reviews

Be the first to post a review of MaskFormer!

Additional Project Details

Operating Systems

Linux, Mac

Programming Language

Python

Related Categories

Python AI Models

Registered

2025-10-08

Similar Business Software

Vertex AI

Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case. Through Vertex AI Workbench, Vertex AI is natively integrated with BigQuery, Dataproc, and Spark. You can use BigQuery ML to create and execute machine learning models in BigQuery...

See Software
Qwen-Image

Qwen-Image is a multimodal diffusion transformer (MMDiT) foundation model offering state-of-the-art image generation, text rendering, editing, and understanding. It excels at complex text integration, seamlessly embedding alphabetic and logographic scripts into visuals with typographic fidelity,...

See Software
LM-Kit.NET

LM-Kit.NET is a cutting-edge, high-level inference SDK designed specifically to bring the advanced capabilities of Large Language Models (LLM) into the C# ecosystem. Tailored for developers working within .NET, LM-Kit.NET provides a comprehensive suite of powerful Generative AI tools, making...

See Software
Ultralytics

Ultralytics offers a full-stack vision-AI platform built around its flagship YOLO model suite that enables teams to train, validate, and deploy computer-vision models with minimal friction. The platform allows you to drag and drop datasets, select from pre-built templates or fine-tune custom...

See Software
Google AI Studio

Google AI Studio is a comprehensive, web-based development environment that democratizes access to Google's cutting-edge AI models, notably the Gemini family, enabling a broad spectrum of users to explore and build innovative applications. This platform facilitates rapid prototyping by providing...

See Software
Gemini 2.0

Gemini 2.0 is an advanced AI-powered model developed by Google, designed to offer groundbreaking capabilities in natural language understanding, reasoning, and multimodal interactions. Building on the success of its predecessor, Gemini 2.0 integrates large language processing with enhanced...

See Software

Report inappropriate content

MaskFormer

Per-Pixel Classification is Not All You Need for Semantic Segmentation

Get an email when there's a new version of MaskFormer

Features

Project Activity

Categories

License

Follow MaskFormer

User Reviews

Additional Project Details

Operating Systems

Programming Language

Related Categories

Registered