Battlefield Target Aggregation Behavior Recognition Model Based on Multi-Scale Feature Fusion

被引:6
作者
Jiang, Haiyang [1 ]
Pan, Yaozong [1 ]
Zhang, Jian [1 ]
Yang, Haitao [1 ]
机构
[1] Space Engn Univ, 81 Rd, Beijing 101400, Peoples R China
来源
SYMMETRY-BASEL | 2019年 / 11卷 / 06期
关键词
machine vision; aggregation behavior; convolutional neural network; video; action recognition;
D O I
10.3390/sym11060761
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper, our goal is to improve the recognition accuracy of battlefield target aggregation behavior while maintaining the low computational cost of spatio-temporal depth neural networks. To this end, we propose a novel 3D-CNN (3D Convolutional Neural Networks) model, which extends the idea of multi-scale feature fusion to the spatio-temporal domain, and enhances the feature extraction ability of the network by combining feature maps of different convolutional layers. In order to reduce the computational complexity of the network, we further improved the multi-fiber network, and finally established an architecture-3D convolution Two-Stream model based on multi-scale feature fusion. Extensive experimental results on the simulation data show that our network significantly boosts the efficiency of existing convolutional neural networks in the aggregation behavior recognition, achieving the most advanced performance on the dataset constructed in this paper.
引用
收藏
页数:12
相关论文
共 20 条
[1]  
[Anonymous], 2013, IEEE I CONF COMP VIS, DOI DOI 10.1109/ICCV.2013.441
[2]   Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].
Carreira, Joao ;
Zisserman, Andrew .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733
[3]  
Chen Y., 2018, Proceedings of the European conference on computer vision (ECCV)
[4]  
Christoph R., 2016, Advances in neural information processing systems, P3476
[5]   Long-Term Recurrent Convolutional Networks for Visual Recognition and Description [J].
Donahue, Jeff ;
Hendricks, Lisa Anne ;
Rohrbach, Marcus ;
Venugopalan, Subhashini ;
Guadarrama, Sergio ;
Saenko, Kate ;
Darrell, Trevor .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) :677-691
[6]  
Duta I.C., 2017, P INT C MULTIMEDIA M
[7]   Spatiotemporal Multiplier Networks for Video Action Recognition [J].
Feichtenhofer, Christoph ;
Pinz, Axel ;
Wildes, Richard P. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :7445-7454
[8]   Meta-action descriptor for action recognition in RGBD video [J].
Huang, Min ;
Su, Song-Zhi ;
Cai, Guo-Rong ;
Zhang, Hong-Bo ;
Cao, Donglin ;
Li, Shao-Zi .
IET COMPUTER VISION, 2017, 11 (04) :301-308
[9]  
Li C., COLLABORATIVE SPATIO
[10]  
Lin T.-Y., 2016, Feature Pyramid Networks for Object Detection