Navigate:

All ReposMask2Former

~$MASK2↑0.1%

Mask2Former: Transformer-based universal image segmentation

Unified transformer architecture for multi-task image segmentation.

LIVE RANKINGS • 01:44 PM • STEADY

OVERALL

#365

184

AI & ML

#99

30 DAY RANKING TREND

ovr#365

·AI#99

STARS

3.3K

FORKS

500

7D STARS

7D FORKS

Tags:

AI & ML

See Repo:

Learn more about Mask2Former

Mask2Former is a computer vision model that performs image segmentation using transformer-based architecture with masked attention mechanisms. The system processes images through a backbone encoder and applies attention operations constrained by learned masks to generate segmentation outputs. It handles three segmentation task types (panoptic, instance, and semantic) through a single unified model architecture rather than task-specific variants. The codebase supports training and inference on major segmentation benchmarks including ADE20K, Cityscapes, COCO, and Mapillary Vistas, with additional support for video instance segmentation.

Unified multi-task architecture

A single model handles panoptic, instance, and semantic segmentation without task-specific modifications. This contrasts with prior approaches that typically required separate models or significant architectural changes per task.

Masked attention mechanism

The transformer uses learned masks to constrain attention operations, reducing computational overhead compared to full attention while maintaining segmentation quality. This design choice improves efficiency during both training and inference.

Multi-dataset support

The framework includes implementations for multiple major segmentation datasets and benchmarks, with pre-trained models available in the Model Zoo. Video instance segmentation is also supported through an accompanying technical report.

Top in AI & ML

Trending Repos

Pi Mono

17,222#1

OpenClaw

233,443#2

Zvec

8,089#3

Claude Code

70,649#4

Heretic

9,761#5

See all →

LIVE RANKINGS • 01:44 PM • STEADY

OVERALL

#365

184

AI & ML

#99

30 DAY RANKING TREND

ovr#365

·AI#99

STARS

3.3K

FORKS

500

7D STARS

7D FORKS

[ EXPLORE MORE ]

Related Repositories

Discover similar tools and frameworks used by developers

Mask2Former: Transformer-based universal image segmentation

Learn more about Mask2Former

What is Mask2Former for?

What makes Mask2Former different?

Unified multi-task architecture

Masked attention mechanism

Multi-dataset support

Top in AI & ML

Pi Mono

OpenClaw

Claude Code

Heretic

Rowboat

Trending Repos

Pi Mono

OpenClaw

Zvec

Claude Code

Heretic

Related Repositories

ALLWEONE Presentation AI

YOLOv7

whisper.cpp

Claude Code

ComfyUI-Manager

Product

Company

Helpful Links