MLPD: Multi-Label Pedestrian Detector in Multispectral Domain

被引:54
作者
Kim, Jiwon [1 ]
Kim, Hyeongjun [1 ]
Kim, Taejoo [1 ]
Kim, Namil [2 ]
Choi, Yukyung [1 ]
机构
[1] Sejong Univ, Sch Intelligent Mechatron Engn, Seoul 05006, South Korea
[2] NAVER LABS, Gyeonggi Do 13638, South Korea
基金
新加坡国家研究基金会;
关键词
Deep learning for visual perception; object detection; multispectral pedestrian detection; FUSION;
D O I
10.1109/LRA.2021.3099870
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Multispectral pedestrian detection has been actively studied as a promising multi-modality solution to handle illumination and weather changes. Most multi-modality approaches carry the assumption that all inputs are fully-overlapped. However, these kinds of data pairs are not common in practical applications due to the complexity of the existing sensor configuration. In this letter, we tackle multispectral pedestrian detection, where all input data are not paired. To this end, we propose a novel single-stage detection framework that leverages multi-label learning to learn input state-aware features by assigning a separate label according to the given state of the input image pair. We also present a novel augmentation strategy by applying geometric transformations to synthesize the unpaired multispectral images. In extensive experiments, we demonstrate the efficacy of the proposed method under various real-world conditions, such as fully-overlapped images and partially-overlapped images, in stereo-vision. Code and a demonstration video are available at https://github.com/sejong-rcv/MLPD-Multi-Label-Pedestrian-Detection.
引用
收藏
页码:7846 / 7853
页数:8
相关论文
共 22 条
  • [1] [Anonymous], 2016, P BRIT MACH VIS C 20
  • [2] Choi H, 2016, INT C PATT RECOG, P621, DOI 10.1109/ICPR.2016.7899703
  • [3] Pedestrian Detection: An Evaluation of the State of the Art
    Dollar, Piotr
    Wojek, Christian
    Schiele, Bernt
    Perona, Pietro
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (04) : 743 - 761
  • [4] Using multi-label classification to improve object detection
    Gong, Tao
    Liu, Bin
    Chu, Qi
    Yu, Nenghai
    [J]. NEUROCOMPUTING, 2019, 370 : 174 - 185
  • [5] Pedestrian Detection at Day/Night Time with Visible and FIR Cameras: A Comparison
    Gonzalez, Alejandro
    Fang, Zhijie
    Socarras, Yainuvis
    Serrat, Joan
    Vazquez, David
    Xu, Jiaolong
    Lopez, Antonio M.
    [J]. SENSORS, 2016, 16 (06)
  • [6] Fusion of multispectral data through illumination-aware deep neural networks for pedestrian detection
    Guan, Dayan
    Cao, Yanpeng
    Yang, Jiangxin
    Cao, Yanlong
    Yang, Michael Ying
    [J]. INFORMATION FUSION, 2019, 50 : 148 - 157
  • [7] Ha Q, 2017, IEEE INT C INT ROBOT, P5108, DOI 10.1109/IROS.2017.8206396
  • [8] Hwang S, 2015, PROC CVPR IEEE, P1037, DOI 10.1109/CVPR.2015.7298706
  • [9] Kim M, 2019, IEEE IMAGE PROC, P1650, DOI [10.1109/icip.2019.8803098, 10.1109/ICIP.2019.8803098]
  • [10] Fully Convolutional Region Proposal Networks for Multispectral Person Detection
    Koenig, Daniel
    Adam, Michael
    Jarvers, Christian
    Layher, Georg
    Neumann, Heiko
    Teutsch, Michael
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 243 - 250