Multi-View Indoor Human Detection Neural Network Based on Joint Learning

被引:0
作者
Wang X. [1 ,2 ]
Zhang W. [1 ,2 ]
机构
[1] School of Electrical Automation and Information Engineering, Tianjin University, Tianjin
[2] School of Microelectronics, Tianjin University, Tianjin
来源
Guangxue Xuebao/Acta Optica Sinica | 2019年 / 39卷 / 02期
关键词
Convolutional neural network; Image processing; Indoor human detection; Joint learning; Multi-view; Video surveillance;
D O I
10.3788/AOS201939.0210002
中图分类号
学科分类号
摘要
An indoor human detection dataset (IHDD) is established, and a novel multi-view indoor human detection neural network (MVNN) based on joint learning is proposed. The model consists of input data layer, feature extraction layer, deformation layer, visibility reasoning layer and classification layer, and the proposed MVNN algorithm can improve the detection performance when combined with the region proposal model and the multi-view model. Experimental results on the self-built IHDD show that compared with other existing detection algorithms, the proposed MVNN algorithm has a higher detection rate. It can still obtain good detection results even in the case of difficult situations such as various views, changing poses and occlusion for human targets, which indicates certain theoretical research value and practical value. © 2019, Chinese Lasers Press. All right reserved.
引用
收藏
相关论文
共 35 条
  • [1] Zou J.H., Zhao Q.C., Yang W., Et al., Occupancy detection in the office by analyzing surveillance videos and its application to building energy conservation, Energy and Buildings, 152, pp. 385-398, (2017)
  • [2] Benenson R., Omran M., Hosang J., Et al., Ten years of pedestrian detection, what have we learned?, European Conference on Computer Vision, pp. 613-627, (2015)
  • [3] Dalal N., Triggs B., Histograms of oriented gradients for human detection, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 886-893, (2005)
  • [4] Wang X.Y., Han T.X., Yan S.C., An HOG-LBP human detector with partial occlusion handling, IEEE 12th International Conference on Computer Vision, pp. 32-39, (2009)
  • [5] Zhang J.G., Huang K.Q., Yu Y.N., Et al., Boosted local structured HOG-LBP for object localization, IEEE International Conference on Computer Vision and Pattern Recognition, pp. 1393-1400, (2011)
  • [6] Lowe D.G., Distinctiveimage features from scale-invariant keypoints, International Journal of Computer Vision, 60, 2, pp. 91-110, (2004)
  • [7] Lienhart R., Maydt J., An extended set of Haar-like features for rapid object detection, International Conference on Image Processing, pp. 900-903, (2002)
  • [8] Dollar P., Tu Z.W., Perona P., Et al., Integral channel features, Proceedings of the British Machine Vision Conference, (2009)
  • [9] Dollar P., Appel R., Belongie S., Et al., Fast feature pyramids for object detection, IEEE Transactions on Pattern Analysis and Machine Intelligence, 36, 8, pp. 1532-1545, (2014)
  • [10] Zhang S.S., Bauckhage C., Cremers A.B., Informed haar-like features improve pedestrian detection, IEEE Conference on Computer Vision and Pattern Recognition, pp. 947-954, (2014)