D-STC: Deep learning with spatio-temporal constraints for train drivers detection from videos

被引:11
作者
Xu, Mingliang [1 ]
Fang, Hao [1 ]
Lv, Pei [1 ]
Cui, Lisha [1 ]
Zhang, Shuo [1 ]
Zhou, Bing [1 ]
机构
[1] Zhengzhou Univ, 100 Kexue Rd, Zhengzhou 450000, Henan, Peoples R China
关键词
Deep learning; Train driver detection; Spatio-temporal constraints; Dynamic adjustment;
D O I
10.1016/j.patrec.2017.09.040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video-based train driver operation monitoring is one of the emerging requirements for train safety management and driving operation regularization. Recent years, deep learning methods such as Faster R-CNN have achieved excellent detection performance on images. However, they are not specially designed for object detection from videos, especially for those train drivers who often perform tiny moving in the monitoring video. Spatial and temporal information of videos are not fully explored together to solve this problem. In this paper, a new framework D-STC is proposed to handles the complex situations in train cab and detect train drivers from videos in a more reliable way. The proposed framework first utilizes fine tuning Faster R-CNN framework to detect the train drivers as the initial detection results. Then, the initial detection results of each frame is processed further to suppress false detection results by using the customized spatial constraints. Finally, an optimal threshold adjustment mechanism is presented to improve detection accuracy for the whole video sequence. The D-STC framework improves the accuracy of train driver detection and fully guarantees the detection speed for videos. Experimental results demonstrate the effectiveness of the proposed framework. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:222 / 228
页数:7
相关论文
共 33 条
[1]  
[Anonymous], 2016, ADV MAT SCI ENG
[2]  
[Anonymous], 2010, Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, DOI DOI 10.1109/CVPR.2010.5539906
[3]   Dynamic Scene Understanding for Behavior Analysis Based on String Kernels [J].
Brun, Luc ;
Saggese, Alessia ;
Vento, Mario .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2014, 24 (10) :1669-1681
[4]  
Chen Yan-ping, 2011, 2011 IEEE International Conference on Computer Science and Automation Engineering (CSAE), P748, DOI 10.1109/CSAE.2011.5952952
[5]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[6]  
Dan O., 2014, SPATIO TEMPORAL OBJE
[7]  
Divvala SK, 2009, PROC CVPR IEEE, P1271, DOI 10.1109/CVPRW.2009.5206532
[8]   Fast Feature Pyramids for Object Detection [J].
Dollar, Piotr ;
Appel, Ron ;
Belongie, Serge ;
Perona, Pietro .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (08) :1532-1545
[9]   Scalable Object Detection using Deep Neural Networks [J].
Erhan, Dumitru ;
Szegedy, Christian ;
Toshev, Alexander ;
Anguelov, Dragomir .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :2155-2162
[10]  
Everingham M., 2010, INT C MACHINE LEARNI, P117