Tunnel Pedestrian Detection Based on Super Resolution and Convolutional Neural Network

被引:0
作者
Zhao Min [1 ,2 ]
Mei Ying [1 ,2 ]
Sun Dihua [1 ,2 ]
机构
[1] Chongqing Univ, Coll Automat, Chongqing 400044, Peoples R China
[2] Minist Educ, Key Lab Dependable Serv Comp Cyber Phys Soc, Chongqing, Peoples R China
来源
PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019) | 2019年
关键词
Pedestrian Detection; Super Resolution; CNN; RPN;
D O I
10.1109/ccdc.2019.8833181
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Tunnel pedestrian detection is of great significance to traffic safety. However, because the camera used to collect road images in the tunnel video surveillance system is often far away from the ground, the size of the pedestrian target is small. What's more, in the special monitoring environment of the tunnel, the video image is blurred, the noise is too much. and the resolution is low, which makes the pedestrian target detection difficult. Aiming at the above problems. this paper proposes a new target detection network which cascades super-resolution and target detection networks. Firstly, for the problem that the convolutional neural network is difficult to extract features of low-resolution image, super-resolution is performed before the target detection of the image according to that the super-resolution network can increase the image resolution and enrich the image information. Then, according to the characteristics of the pedestrian target detection task and the relatively fixed size and aspect ratio of pedestrian target in the video image of tunnel, the original RPN network is improved, and a candidate box which is more suitable for the pedestrian target detection task is designed. The experimental results show that the proposed method achieves better detection results in the tunnel pedestrian target detection problem.
引用
收藏
页码:4635 / 4640
页数:6
相关论文
共 14 条
[1]  
Davis JE, 2006, DEAF WAY II READER: PERSPECTIVES FROM THE SECOND INTERNATIONAL CONFERENCE ON DEAF CULTURE, P233
[2]   Image Super-Resolution Using Deep Convolutional Networks [J].
Dong, Chao ;
Loy, Chen Change ;
He, Kaiming ;
Tang, Xiaoou .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) :295-307
[3]  
Girshick R., 2014, IEEE COMP SOC C COMP, DOI [10.1109/CVPR.2014.81, DOI 10.1109/CVPR.2014.81]
[4]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[5]  
Guo A X, 2018, INFORM TECHNOLOGY NE, V37
[6]   Caffe: Convolutional Architecture for Fast Feature Embedding [J].
Jia, Yangqing ;
Shelhamer, Evan ;
Donahue, Jeff ;
Karayev, Sergey ;
Long, Jonathan ;
Girshick, Ross ;
Guadarrama, Sergio ;
Darrell, Trevor .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :675-678
[7]  
Li Y L, 2016, J CHANGZHI U, V33, P31
[8]   Example-based object detection in images by components [J].
Mohan, A ;
Papageorgiou, C ;
Poggio, T .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (04) :349-361
[9]   Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].
Ren, Shaoqing ;
He, Kaiming ;
Girshick, Ross ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149
[10]   ImageNet Large Scale Visual Recognition Challenge [J].
Russakovsky, Olga ;
Deng, Jia ;
Su, Hao ;
Krause, Jonathan ;
Satheesh, Sanjeev ;
Ma, Sean ;
Huang, Zhiheng ;
Karpathy, Andrej ;
Khosla, Aditya ;
Bernstein, Michael ;
Berg, Alexander C. ;
Fei-Fei, Li .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) :211-252