Awesome Segmentation
Segmentation is one of the most active areas in Awesome Computer Vision β 2,014 papers in this collection, evaluated on datasets like COCO, Cityscapes, ImageNet. A strong starting point is "Dual Attention Network For Scene Segmentation".
Datasets & benchmarks
Key papers
- Dual Attention Network For Scene Segmentation (2018)Jun Fu, Jing Liu, Haijie Tian, et al.34.87
- Pyramid Vision Transformer: A Versatile Backbone For Dense Prediction Without Convolutions (2021)Wenhai Wang, Enze Xie, Xiang Li, et al.33.76
- Vision Transformers For Dense Prediction (2021)RenΓ© Ranftl, Alexey Bochkovskiy, Vladlen Koltun31.27
- Pointrend: Image Segmentation As Rendering (2019)Alexander Kirillov, Yuxin Wu, Kaiming He, et al.31.23
- Unified Perceptual Parsing For Scene Understanding (2018)Tete Xiao, Yingcheng Liu, Bolei Zhou, et al.29.22
- Real-time Scene Text Detection With Differentiable Binarization (2019)Minghui Liao, Zhaoyi Wan, Cong Yao, et al.28.03
- Res2net: A New Multi-scale Backbone Architecture (2019)Shang-Hua Gao, Ming-Ming Cheng, Kai Zhao, et al.25.82
- Masked-attention Mask Transformer For Universal Image Segmentation (2021)Bowen Cheng, Ishan Misra, Alexander G. Schwing, et al.25.69
- Prior Guided Feature Enrichment Network For Few-shot Segmentation (2020)Zhuotao Tian, Hengshuang Zhao, Michelle Shu, et al.25.54
- Involution: Inverting The Inherence Of Convolution For Visual Recognition (2021)Duo Li, Jie Hu, Changhu Wang, et al.25.47
- Ccnet: Criss-cross Attention For Semantic Segmentation (2018)Zilong Huang, Xinggang Wang, Yunchao Wei, et al.25.44
- SCAN: Learning To Classify Images Without Labels (2020)Wouter van Gansbeke, Simon Vandenhende, Stamatios Georgoulis, et al.25.15
- Rethinking RGB-D Salient Object Detection: Models, Data Sets, And Large-scale Benchmarks (2019)Deng-Ping Fan, Zheng Lin, Jia-Xing Zhao, et al.24.88
- Upsnet: A Unified Panoptic Segmentation Network (2019)Yuwen Xiong, Renjie Liao, Hengshuang Zhao, et al.24.62
- Structure-measure: A New Way To Evaluate Foreground Maps (2017)Deng-Ping Fan, Ming-Ming Cheng, Yun Liu, et al.24.07
- Acnet: Attention Based Network To Exploit Complementary Features For RGBD Semantic Segmentation (2019)Xinxin Hu, Kailun Yang, Lei Fei, et al.23.95
- Semantic Understanding Of Scenes Through The ADE20K Dataset (2016)Bolei Zhou, Hang Zhao, Xavier Puig, et al.23.48
- Fine-grained Visual Classification Via Progressive Multi-granularity Training Of Jigsaw Patches (2020)Ruoyi Du, Dongliang Chang, Ayan Kumar Bhunia, et al.23.39
- Panoptic Feature Pyramid Networks (2019)Alexander Kirillov, Ross Girshick, Kaiming He, et al.22.98
- Bottleneck Transformers For Visual Recognition (2021)Aravind Srinivas, Tsung-Yi Lin, Niki Parmar, et al.22.72
- Weakly-supervised Salient Object Detection Via Scribble Annotations (2020)Jing Zhang, Xin Yu, Aixuan Li, et al.22.59
- Multi-scale Vision Longformer: A New Vision Transformer For High-resolution Image Encoding (2021)Pengchuan Zhang, Xiyang Dai, Jianwei Yang, et al.22.55
- Egnet:edge Guidance Network For Salient Object Detection (2019)Jia-Xing Zhao, Jiangjiang Liu, Den-Ping Fan, et al.22.41
- Simple Copy-paste Is A Strong Data Augmentation Method For Instance Segmentation (2020)Golnaz Ghiasi, Yin Cui, Aravind Srinivas, et al.22.19
- Mask Textspotter V3: Segmentation Proposal Network For Robust Scene Text Spotting (2020)Minghui Liao, Guan Pang, Jing Huang, et al.22.16
- Distribution Alignment: A Unified Framework For Long-tail Visual Recognition (2021)Songyang Zhang, Zeming Li, Shipeng Yan, et al.22.08
- Detectors: Detecting Objects With Recursive Feature Pyramid And Switchable Atrous Convolution (2020)Siyuan Qiao, Liang-Chieh Chen, Alan Yuille22.04
- Collaborative Video Object Segmentation By Foreground-background Integration (2020)Zongxin Yang, Yunchao Wei, Yi Yang22.02
- Ranet: Ranking Attention Network For Fast Video Object Segmentation (2019)Ziqin Wang, Jun Xu, Li Liu, et al.21.97
- Hierarchical Dynamic Filtering Network For RGB-D Salient Object Detection (2020)Youwei Pang, Lihe Zhang, Xiaoqi Zhao, et al.21.50
- Cross-x Learning For Fine-grained Visual Categorization (2019)Wei Luo, Xitong Yang, Xianjie Mo, et al.21.36
- EDN: Salient Object Detection Via Extremely-downsampled Network (2020)Yu-Huan Wu, Yun Liu, Le Zhang, et al.21.36
- Anabranch Network For Camouflaged Object Segmentation (2021)Trung-Nghia Le, Tam V. Nguyen, Zhongliang Nie, et al.21.04
- Cars Can't Fly Up In The Sky: Improving Urban-scene Segmentation Via Height-driven Attention Networks (2020)Sungha Choi, Joanne T. Kim, Jaegul Choo20.96
- Multi-interactive Dual-decoder For Rgb-thermal Salient Object Detection (2020)Zhengzheng Tu, Zhun Li, Chenglong Li, et al.20.82
- Augmentation For Small Object Detection (2019)Mate Kisantal, Zbigniew Wojna, Jakub Murawski, et al.20.78
- Video Object Segmentation Using Space-time Memory Networks (2019)Seoung Wug Oh, Joon-Young Lee, Ning Xu, et al.20.78
- Bilateral Reference For High-resolution Dichotomous Image Segmentation (2024)Peng Zheng, Dehong Gao, Deng-Ping Fan, et al.20.61
- Swiftnet: Real-time Video Object Segmentation (2021)Haochen Wang, Xiaolong Jiang, Haibing Ren, et al.20.57
- MOTS: Multi-object Tracking And Segmentation (2019)Paul Voigtlaender, Michael Krause, Aljosa Osep, et al.20.41
- 'squeeze & Excite' Guided Few-shot Segmentation Of Volumetric Images (2019)Abhijit Guha Roy, Shayan Siddiqui, Sebastian PΓΆlsterl, et al.20.37
- Weakly Supervised Learning Of Instance Segmentation With Inter-pixel Relations (2019)Jiwoon Ahn, Sunghyun Cho, Suha Kwak20.27
- Reverse Attention For Salient Object Detection (2018)Shuhan Chen, Xiuli Tan, Ben Wang, et al.20.26
- Sipmask: Spatial Information Preservation For Fast Image And Video Instance Segmentation (2020)Jiale Cao, Rao Muhammad Anwer, Hisham Cholakkal, et al.20.22
- Youtube-vos: Sequence-to-sequence Video Object Segmentation (2018)Ning Xu, Linjie Yang, Yuchen Fan, et al.20.04
- Multi-branch And Multi-scale Attention Learning For Fine-grained Visual Categorization (2020)Fan Zhang, Meng Li, Guisheng Zhai, et al.20.02
- Revisiting Weak-to-strong Consistency In Semi-supervised Semantic Segmentation (2022)Lihe Yang, Lei Qi, Litong Feng, et al.20.00
- Image Segmentation Using Text And Image Prompts (2021)Timo LΓΌddecke, Alexander S. Ecker19.96
- Sg-one: Similarity Guidance Network For One-shot Semantic Segmentation (2018)Xiaolin Zhang, Yunchao Wei, Yi Yang, et al.19.74
- Open-vocabulary Semantic Segmentation With Mask-adapted CLIP (2022)Feng Liang, Bichen Wu, Xiaoliang Dai, et al.19.49
- Picie: Unsupervised Semantic Segmentation Using Invariance And Equivariance In Clustering (2021)Jang Hyun Cho, Utkarsh Mall, Kavita Bala, et al.19.39
- Real-time Scene Text Detection With Differentiable Binarization And Adaptive Scale Fusion (2022)Minghui Liao, Zhisheng Zou, Zhaoyi Wan, et al.19.30
- Stronger, Fewer, & Superior: Harnessing Vision Foundation Models For Domain Generalized Semantic Segmentation (2023)Zhixiang Wei, Lin Chen, Yi Jin, et al.19.28
- YOLACT++: Better Real-time Instance Segmentation (2019)Daniel Bolya, Chong Zhou, Fanyi Xiao, et al.19.23
- Point-set Anchors For Object Detection, Instance Segmentation And Pose Estimation (2020)Fangyun Wei, Xiao Sun, Hongyang Li, et al.19.22
- Scaling Local Self-attention For Parameter Efficient Visual Backbones (2021)Ashish Vaswani, Prajit Ramachandran, Aravind Srinivas, et al.19.20
- Action Segmentation With Joint Self-supervised Temporal Domain Adaptation (2020)Min-Hung Chen, Baopu Li, Yingze Bao, et al.18.98
- Rethinking Semantic Segmentation: A Prototype View (2022)Tianfei Zhou, Wenguan Wang, Ender Konukoglu, et al.18.97
- LAVT: Language-aware Vision Transformer For Referring Image Segmentation (2021)Zhao Yang, Jiaqi Wang, Yansong Tang, et al.18.95
- Mixed Transformer U-net For Medical Image Segmentation (2021)Hongyi Wang, Shiao Xie, Lanfen Lin, et al.18.70