Mask R-CNN

被引:301
|
作者
He, Kaiming [1 ]
Gkioxari, Georgia [1 ]
Dollar, Piotr [1 ]
Girshick, Ross [1 ]
机构
[1] Facebook AI Res, Menlo Pk, CA 94025 USA
关键词
Task analysis; Semantics; Feature extraction; Object detection; Proposals; Image segmentation; Quantization (signal); Instance segmentation; object detection; pose estimation; convolutional neural network;
D O I
10.1109/TPAMI.2018.2844175
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a conceptually simple, flexible, and general framework for object instance segmentation. Our approach efficiently detects objects in an image while simultaneously generating a high-quality segmentation mask for each instance. The method, called Mask R-CNN, extends Faster R-CNN by adding a branch for predicting an object mask in parallel with the existing branch for bounding box recognition. Mask R-CNN is simple to train and adds only a small overhead to Faster R-CNN, running at 5 fps. Moreover, Mask R-CNN is easy to generalize to other tasks, e.g., allowing us to estimate human poses in the same framework. We show top results in all three tracks of the COCO suite of challenges, including instance segmentation, bounding-box object detection, and person keypoint detection. Without bells and whistles, Mask R-CNN outperforms all existing, single-model entries on every task, including the COCO 2016 challenge winners. We hope our simple and effective approach will serve as a solid baseline and help ease future research in instance-level recognition. Code has been made available at: https://github.com/facebookresearch/Detectron.
引用
收藏
页码:386 / 397
页数:12
相关论文
共 50 条
  • [1] Mask R-CNN
    He, Kaiming
    Gkioxari, Georgia
    Dollar, Piotr
    Girshick, Ross
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2980 - 2988
  • [2] Mask Scoring R-CNN
    Huang, Zhaojin
    Huang, Lichao
    Gong, Yongchao
    Huang, Chang
    Wang, Xinggang
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6402 - 6411
  • [3] IEMask R-CNN: Information-Enhanced Mask R-CNN
    Bi, Xiuli
    Hu, Jinwu
    Xiao, Bin
    Li, Weisheng
    Gao, Xinbo
    IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (02) : 688 - 700
  • [4] Nuclei R-CNN: Improve Mask R-CNN for Nuclei Segmentation
    Lv, Guofeng
    Wen, Ke
    Wu, Zheng
    Jin, Xu
    An, Hong
    He, Jie
    2019 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP), 2019, : 357 - 362
  • [5] SE-Mask R-CNN: An improved Mask R-CNN for apple detection and segmentation
    Liu, Yikun
    Yang, Gongping
    Huang, Yuwen
    Yin, Yilong
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 6715 - 6725
  • [6] Mask R-CNN for Ear Detection
    Bizjak, Matic
    Peer, Peter
    Emersic, Ziga
    2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, : 1624 - 1628
  • [7] Crack Detection and Comparison Study Based on Faster R-CNN and Mask R-CNN
    Xu, Xiangyang
    Zhao, Mian
    Shi, Peixin
    Ren, Ruiqi
    He, Xuhui
    Wei, Xiaojun
    Yang, Hao
    SENSORS, 2022, 22 (03)
  • [8] Hangar Detection with Mask R-CNN Algorithm
    Omeroglu, Asli Nur
    Kumbasar, Nida
    Oral, Emin Argun
    Ozbek, I. Yucel
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [9] Multiple Barcode Detection with Mask R-CNN
    Polat, Enes
    Mohammed, Hussein Mahmood Abdo
    Omeroglu, Asli Nur
    Kumbasar, Nida
    Ozbek, IYucel
    Oral, Emin Argun
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [10] Gastric Cancer Diagnosis with Mask R-CNN
    Cao, Guitao
    Song, Wenli
    Zhao, Zhenwei
    2019 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2019), VOL 1, 2019, : 60 - 63