Rich feature hierarchies for accurate object detection and semantic segmentation

被引:12683
|
作者
Girshick, Ross [1 ]
Donahue, Jeff [1 ]
Darrell, Trevor [1 ]
Malik, Jitendra [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
来源
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2014年
关键词
D O I
10.1109/CVPR.2014.81
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection performance, as measured on the canonical PASCAL VOC dataset, has plateaued in the last few years. The best-performing methods are complex ensemble systems that typically combine multiple low-level image features with high-level context. In this paper, we propose a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 30% relative to the previous best result on VOC 2012-achieving a mAP of 53.3%. Our approach combines two key insights: (1) one can apply high-capacity convolutional neural networks (CNNs) to bottom-up region proposals in order to localize and segment objects and (2) when labeled training data is scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, yields a significant performance boost. Since we combine region proposals with CNNs, we call our method R-CNN: Regions with CNN features. Source code for the complete system is available at http://www.cs.berkeley.edu/similar to rbg/rcnn.
引用
收藏
页码:580 / 587
页数:8
相关论文
共 50 条
  • [31] Semantic Guided Feature Aggregation Network for Salient Object Detection
    Wang Z.-W.
    Song H.-H.
    Fan J.-Q.
    Liu Q.-S.
    Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (11): : 2386 - 2395
  • [32] SAFPN: a full semantic feature pyramid network for object detection
    Wang, Gaihua
    Li, Qi
    Wang, Nengyuan
    Liu, Hong
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (04) : 1729 - 1739
  • [33] Object Detection Oriented Feature Pooling for Video Semantic Indexing
    Ueki, Kazuya
    Kobayashi, Tetsunori
    PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2017), VOL 5, 2017, : 44 - 51
  • [34] Leveraging Spatial-semantic Information in Object Detection and Segmentation
    Guo Q.-Z.
    Yuan C.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (06): : 2776 - 2788
  • [35] Contrastive and consistent feature learning for weakly supervised object localization and semantic segmentation
    Ki, Minsong
    Uh, Youngjung
    Lee, Wonyoung
    Byun, Hyeran
    NEUROCOMPUTING, 2021, 445 : 244 - 254
  • [36] SSFENET: SPATIAL AND SEMANTIC FEATURE ENHANCEMENT NETWORK FOR OBJECT DETECTION
    Wang, Tianyuan
    Ma, Can
    Su, Haoshan
    Wang, Weiping
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1500 - 1504
  • [37] Qualitative multiscale feature hierarchies for object tracking
    Bretzner, L
    Lindeberg, T
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2000, 11 (02) : 115 - 129
  • [38] A feature enriching object detection framework with weak segmentation loss
    Zhang, Tianqi
    Hao, Li-Ying
    Guo, Ge
    NEUROCOMPUTING, 2019, 335 : 72 - 80
  • [39] SCAN: Semantic Context Aware Network for Accurate Small Object Detection
    Guan, Linting
    Wu, Yan
    Zhao, Junqiao
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2018, 11 (01) : 951 - 961
  • [40] Efficient and Accurate Text Detection Combining Differentiable Binarization with Semantic Segmentation
    Liu, Yue
    Shi, Ying
    Lin, Chaojun
    Hua, Jie
    Huang, Ziqi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT III, 2022, 13531 : 630 - 642