Human action recognition using bag of global and local Zernike moment features

被引:21
作者
Aly, Saleh [1 ,2 ]
Sayed, Asmaa [1 ]
机构
[1] Aswan Univ, Fac Engn, Dept Elect Engn, Aswan 81542, Egypt
[2] Majmaah Univ, Coll Comp & Informat Sci, Dept Informat Technol, Al Majmaah 11952, Saudi Arabia
关键词
Human action recognition; Global Zernike moments; Local Zernike moments; K-means; Bag-of-features; INVARIANTS; NETWORK; HISTORY;
D O I
10.1007/s11042-019-7674-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human action recognition is a fundamental and challenging building block for many computer vision applications. It has been included in many applications such as: video surveillance, human computer interaction and multimedia retrieval systems. Various approaches have been proposed to solve human action recognition problem. Among others, moment-based methods considered as one of the most simple and successful approach. However, moment-based methods take into consideration only global features while neglect the discriminative properties of local features. In this paper, we propose a new efficient method which combine both Global and Local Zernike Moment (GLZM) features based on Bag-of-Features (BoF) technique. Since using only global features are not sufficient to discriminate similar actions like running, walking and jogging, augmenting these features with localized features helps to improve the recognition accuracy. The proposed method first calculate local temporal Motion Energy Images (MEI) by accumulating frame differences of short time consecutive frames. Then, global and local features are calculated using Zernike moments with different polynomial orders to represent global and local motion patterns respectively. Global features are calculated from the whole region of the human performing action while local features focused on localized regions of the human in order to represent local motion information. Both local and global features are preprocessed using whitening transformation, then bag-of-features algorithm is employed to combine those pool of features and represent each action using new GLZM feature descriptor. Finally, we use multi-class Support Vector Machine (SVM) classifier to recognize human actions. In order to validate the proposed method, we perform a set of experiments using three publicly available datasets: Weizmann, KTH and UCF sports action. Experimental results using leave-one-out strategy show that proposed method achieves promising results compared with other state-of-the-art methods.
引用
收藏
页码:24923 / 24953
页数:31
相关论文
共 55 条
[1]   Action recognition based on binary patterns of action-history and histogram of oriented gradient [J].
Ahad, Md. Atiqur Rahman ;
Islam, Md. Nazmul ;
Jahan, Israt .
JOURNAL ON MULTIMODAL USER INTERFACES, 2016, 10 (04) :335-344
[2]   Motion history image: its variants and applications [J].
Ahad, Md. Atiqur Rahman ;
Tan, J. K. ;
Kim, H. ;
Ishikawa, S. .
MACHINE VISION AND APPLICATIONS, 2012, 23 (02) :255-281
[3]   Variable silhouette energy image representations for recognizing human actions [J].
Ahmad, Mohiuddin ;
Lee, Seong-Whan .
IMAGE AND VISION COMPUTING, 2010, 28 (05) :814-824
[4]  
Al-Azzo F, 2017, INT J ADV COMPUT SC, V8, P13
[5]  
Aly S, 2019, PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN COMPUTER ENGINEERING (ITCE 2019), P52, DOI [10.1109/itce.2019.8646504, 10.1109/ITCE.2019.8646504]
[6]  
[Anonymous], 2013, BMVC
[7]  
[Anonymous], MULTIMED TOOLS APPL
[8]  
[Anonymous], INT J COMPUTER ENG A
[9]  
[Anonymous], 2014 11 INT MULT SYS
[10]  
[Anonymous], ACM T INFORM SYST