Rich feature hierarchies for accurate object detection and semantic segmentation

被引：12683

作者：

Girshick, Ross ^{[1
]}

Donahue, Jeff ^{[1
]}

Darrell, Trevor ^{[1
]}

Malik, Jitendra ^{[1
]}

机构：

[1] Univ Calif Berkeley, Berkeley, CA 94720 USA

来源：

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2014年

关键词：

D O I：

10.1109/CVPR.2014.81

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Object detection performance, as measured on the canonical PASCAL VOC dataset, has plateaued in the last few years. The best-performing methods are complex ensemble systems that typically combine multiple low-level image features with high-level context. In this paper, we propose a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 30% relative to the previous best result on VOC 2012-achieving a mAP of 53.3%. Our approach combines two key insights: (1) one can apply high-capacity convolutional neural networks (CNNs) to bottom-up region proposals in order to localize and segment objects and (2) when labeled training data is scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, yields a significant performance boost. Since we combine region proposals with CNNs, we call our method R-CNN: Regions with CNN features. Source code for the complete system is available at http://www.cs.berkeley.edu/similar to rbg/rcnn.

引用

页码：580 / 587

页数：8

共 50 条

[31] Semantic Guided Feature Aggregation Network for Salient Object Detection
Wang Z.-W.
Song H.-H.
Fan J.-Q.
Liu Q.-S.
Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (11): : 2386 - 2395
[32] SAFPN: a full semantic feature pyramid network for object detection
Wang, Gaihua
Li, Qi
Wang, Nengyuan
Liu, Hong
PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (04) : 1729 - 1739
[33] Object Detection Oriented Feature Pooling for Video Semantic Indexing
Ueki, Kazuya
Kobayashi, Tetsunori
PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2017), VOL 5, 2017, : 44 - 51
[34] Leveraging Spatial-semantic Information in Object Detection and Segmentation
Guo Q.-Z.
Yuan C.
Ruan Jian Xue Bao/Journal of Software, 2023, 34 (06): : 2776 - 2788
[35] Contrastive and consistent feature learning for weakly supervised object localization and semantic segmentation
Ki, Minsong
Uh, Youngjung
Lee, Wonyoung
Byun, Hyeran
NEUROCOMPUTING, 2021, 445 : 244 - 254
[36] SSFENET: SPATIAL AND SEMANTIC FEATURE ENHANCEMENT NETWORK FOR OBJECT DETECTION
Wang, Tianyuan
Ma, Can
Su, Haoshan
Wang, Weiping
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1500 - 1504
[37] Qualitative multiscale feature hierarchies for object tracking
Bretzner, L
Lindeberg, T
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2000, 11 (02) : 115 - 129
[38] A feature enriching object detection framework with weak segmentation loss
Zhang, Tianqi
Hao, Li-Ying
Guo, Ge
NEUROCOMPUTING, 2019, 335 : 72 - 80
[39] SCAN: Semantic Context Aware Network for Accurate Small Object Detection
Guan, Linting
Wu, Yan
Zhao, Junqiao
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2018, 11 (01) : 951 - 961
[40] Efficient and Accurate Text Detection Combining Differentiable Binarization with Semantic Segmentation
Liu, Yue
Shi, Ying
Lin, Chaojun
Hua, Jie
Huang, Ziqi
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT III, 2022, 13531 : 630 - 642

← 1 2 3 4 5 →