← all datasets

ADE20K

Canonical

51papers using it

2016first seen

Papers using ADE20K (43)

Masked-attention Mask Transformer For Universal Image Segmentation2021 · 2,664 cites

Context Encoding For Semantic Segmentation2018 · 1,460 cites

Semantic Understanding Of Scenes Through The ADE20K Dataset2016 · 1,348 cites

Segnext: Rethinking Convolutional Attention Design For Semantic Segmentation2022 · 483 cites

Topformer: Token Pyramid Transformer For Mobile Semantic Segmentation2022 · 315 cites

Multi-scale High-resolution Vision Transformer For Semantic Segmentation2021 · 238 cites

K-net: Towards Unified Image Segmentation2021 · 197 cites

Seaformer++: Squeeze-enhanced Axial Transformer For Mobile Visual Recognition2023 · 82 cites

Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition2022 · 73 cites

You Only Segment Once: Towards Real-time Panoptic Segmentation2023 · 60 cites

MVP: Multimodality-guided Visual Pre-training2022 · 56 cites

Dsnet: A Novel Way To Use Atrous Convolutions In Semantic Segmentation2024 · 55 cites

Semantic Segmentation Via Highly Fused Convolutional Network With Multiple Soft Cost Functions2018 · 37 cites

Content-aware Token Sharing For Efficient Semantic Segmentation With Vision Transformers2023 · 31 cites

In Defense Of Lazy Visual Grounding For Open-vocabulary Semantic Segmentation2024 · 15 cites

A Unified View of Masked Image Modeling2022 · 14 cites

Rest V2: Simpler, Faster And Stronger2022 · 14 cites

Full Contextual Attention For Multi-resolution Transformers In Semantic Segmentation2022 · 12 cites

Decoder Denoising Pretraining for Semantic Segmentation2022 · 10 cites

Understanding Gaussian Attention Bias of Vision Transformers Using Effective Receptive Fields2023 · 7 cites

Remax: Relaxing For Better Training On Efficient Panoptic Segmentation2023 · 7 cites

Feature Selective Transformer for Semantic Image Segmentation2022 · 6 cites

Skip-attention: Improving Vision Transformers By Paying Less Attention2023 · 6 cites

Incepformer: Efficient Inception Transformer With Pyramid Pooling For Semantic Segmentation2022 · 5 cites

HCFormer: Unified Image Segmentation with Hierarchical Clustering2022 · 3 cites

Diffusion For Out-of-distribution Detection On Road Scenes And Beyond2024 · 3 cites

Enhancing Transformer-based Vision Models: Addressing Feature Map Anomalies Through Novel Optimization Strategies2025 · 3 cites

Low-Resolution Self-Attention for Semantic Segmentation2023 · 2 cites

Token Cropr: Faster Vits For Quite A Few Tasks2024 · 2 cites

Dmformer: Closing The Gap Between CNN And Vision Transformers2022 · 2 cites

SOS: Segment Object System For Open-world Instance Segmentation With Object Priors2024 · 2 cites

A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting2024 · 1 cites

PAUMER: Patch Pausing Transformer For Semantic Segmentation2023 · 1 cites

Structtoken : Rethinking Semantic Segmentation With Structural Prior2022 · 1 cites

Transformer Scale Gate for Semantic Segmentation2022

Seeing Through Clutter: Structured 3D Scene Reconstruction via Iterative Object Removal2026

Locality-Attending Vision Transformer2026

Exploring Open-Vocabulary Object Recognition in Images using CLIP2026

ARTA: Adaptive Mixed-resolution Token Allocation For Efficient Dense Feature Extraction2026

Mambavision: A Hybrid Mamba-transformer Vision Backbone2024

Cross-domain Semantic Segmentation With Large Language Model-assisted Descriptor Generation2025

Spiralmlp: A Lightweight Vision MLP Architecture2024

PNM: Pixel Null Model For General Image Segmentation2022

ADE20K — datasets — computer-vision