Count- and Similarity-Aware R-CNN for Pedestrian Detection

被引:29
作者
Xie, Jin [1 ]
Cholakkal, Hisham [2 ,3 ]
Anwer, Rao Muhammad [2 ,3 ]
Khan, Fahad Shahbaz [2 ,3 ]
Pang, Yanwei [1 ]
Shao, Ling [2 ,3 ]
Shah, Mubarak [4 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin Key Lab Brain Inspired Artificial Intelli, Tianjin, Peoples R China
[2] Mohamed bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
[3] Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
[4] Univ Cent Florida, Orlando, FL USA
来源
COMPUTER VISION - ECCV 2020, PT XVII | 2020年 / 12362卷
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Pedestrian detection; Human instance segmentation; NMS;
D O I
10.1007/978-3-030-58520-4_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent pedestrian detection methods generally rely on additional supervision, such as visible bounding-box annotations, to handle heavy occlusions. We propose an approach that leverages pedestrian count and proposal similarity information within a two-stage pedestrian detection framework. Both pedestrian count and proposal similarity are derived from standard full-body annotations commonly used to train pedestrian detectors. We introduce a count-weighted detection loss function that assigns higher weights to the detection errors occurring at highly overlapping pedestrians. The proposed loss function is utilized at both stages of the two-stage detector. We further introduce a count-and-similarity branch within the two-stage detection framework, which predicts pedestrian count as well as proposal similarity. Lastly, we introduce a count and similarity-aware NMS strategy to identify distinct proposals. Our approach requires neither part information nor visible bounding-box annotations. Experiments are performed on the CityPersons and CrowdHuman datasets. Our method sets a new state-of-the-art on both datasets. Further, it achieves an absolute gain of 2.4% over the current state-of-the-art, in terms of log-average miss rate, on the heavily occluded (HO) set of CityPersons test set. Finally, we demonstrate the applicability of our approach for the problem of human instance segmentation. Code and models are available at: https://github.com/Leotju/CaSe.
引用
收藏
页码:88 / 104
页数:17
相关论文
共 37 条
[1]   Soft-NMS - Improving Object Detection With One Line of Code [J].
Bodla, Navaneeth ;
Singh, Bharat ;
Chellappa, Rama ;
Davis, Larry S. .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5562-5570
[2]   Illuminating Pedestrians via Simultaneous Detection & Segmentation [J].
Brazil, Garrick ;
Yin, Xi ;
Liu, Xiaoming .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4960-4969
[3]  
Cai ZW, 2019, Arxiv, DOI [arXiv:1906.09756, DOI 10.48550/ARXIV.1906.09756, 10.48550/arXiv.1906.09756]
[4]   A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection [J].
Cai, Zhaowei ;
Fan, Quanfu ;
Feris, Rogerio S. ;
Vasconcelos, Nuno .
COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 :354-370
[5]   High-Level Semantic Networks for Multi-Scale Object Detection [J].
Cao, Jiale ;
Pang, Yanwei ;
Zhao, Shengjie ;
Li, Xuelong .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (10) :3372-3386
[6]   Taking a Look at Small-Scale Pedestrians and Occluded Pedestrians [J].
Cao, Jiale ;
Pang, Yanwei ;
Han, Jungong ;
Gao, Bolin ;
Li, Xuelong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :3143-3152
[7]   Beyond triplet loss: a deep quadruplet network for person re-identification [J].
Chen, Weihua ;
Chen, Xiaotang ;
Zhang, Jianguo ;
Huang, Kaiqi .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1320-1329
[8]  
Chi C, 2020, AAAI CONF ARTIF INTE, V34, P10639
[9]   Pedestrian Detection: An Evaluation of the State of the Art [J].
Dollar, Piotr ;
Wojek, Christian ;
Schiele, Bernt ;
Perona, Pietro .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (04) :743-761
[10]  
He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]