Action Recognition Using Temporal Partitioning of Motion Information

被引:0
作者
Amirjan, Pouria [1 ]
Mansouri, Azadeh [1 ]
机构
[1] Kharazmi Univ, Fac Elect & Comp Engn, Dept Engn, Tehran, Iran
来源
2019 27TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2019) | 2019年
关键词
component; Action Recognition; First-person Video; Third Person Video; Sub-events; Pyramid Pooling;
D O I
10.1109/iraniancee.2019.8786379
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a temporal representation method for video action recognition is proposed. Since the intrinsic property of the video stream is its temporal variation, the optical flow images are calculated to show the short-term motion. In order to avoid training a complex network from scratch, a pre-trained network is utilized for frame-level feature extraction. For video level representation, pyramidal pooled time series is considered since the short-term variation can be captured in order to represent fixed-size long-term features. In addition, to solve the information missing problem through long videos, a simple video level representation using temporal partitioning is proposed too. The experimental results of the proposed method illustrates an acceptable performance both in first and third-person action recognition.
引用
收藏
页码:1946 / 1950
页数:5
相关论文
共 21 条
[11]  
Liu C., 2009, Ph.D. thesis
[12]  
Moreira TP, 2017, INT CONF ACOUST SPEE, P2627, DOI 10.1109/ICASSP.2017.7952632
[13]  
Purwanto D, 2017, IEEE INT CON MULTI, P895, DOI 10.1109/ICME.2017.8019520
[14]  
Ryoo MS, 2015, PROC CVPR IEEE, P896, DOI 10.1109/CVPR.2015.7298691
[15]  
Ryoo M. S., 2016, VIDEO BASED CONVOLUT
[16]  
Simonyan Karen, 2014, ADV NEURAL INFORM PR, DOI DOI 10.1002/14651858.CD001941.PUB3
[17]  
Takamine A, 2015, 2015 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), P619, DOI 10.1109/SII.2015.7405050
[18]  
Tran D., 2014, ARXIV14120767
[19]   Action Recognition with Improved Trajectories [J].
Wang, Heng ;
Schmid, Cordelia .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :3551-3558
[20]   Temporal Segment Networks: Towards Good Practices for Deep Action Recognition [J].
Wang, Limin ;
Xiong, Yuanjun ;
Wang, Zhe ;
Qiao, Yu ;
Lin, Dahua ;
Tang, Xiaoou ;
Van Gool, Luc .
COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 :20-36