MLRMV: Multi-layer representation for multi-view action recognition

被引:4
作者
Liu, Zhigang [1 ]
Yin, Ziyang [1 ]
Wu, Yin [1 ]
机构
[1] Northeastern Univ, Sch Comp & Commun Engn, Qinhuangdao 066004, Hebei, Peoples R China
关键词
Multi-layer representation; Multi-view action recognition; Motion atom; Motion phrase; JOINT; SEGMENTATION; TEMPLATES;
D O I
10.1016/j.imavis.2021.104333
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Daily action recognition has gained much interest in computer vision. However, viewpoint changes will lead to sizable intra-class differences in the same action. To deal with this problem, we propose a novel multi-view daily action recognition approach based on the multi-layer representation. In use of motion atoms and motion phrases, we construct the middle-level feature representations in multi-view daily actions. A multi-view unsupervised discriminative clustering method is proposed for constructing motion atoms, and the classification accuracy of motion atoms is improved by jointly learning atom dictionaries and the classifier. Moreover, we present discontinuous temporal scale motion phrases and a grading mechanism of motion phrases to strengthen the representative ability of motion phrases and the final recognition accuracy. Finally, the experimental results based on the WVU dataset, the NTU RGB-D dataset, and N-UCLA dataset show that the proposed methods have the state-of-the-art performance, compared with the classic methods such as IDT, MoFAP, JLMF, and so on. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 39 条
[1]   Human action recognition using short-time motion energy template images and PCANet features [J].
Abdelbaky, Amany ;
Aly, Saleh .
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (16) :12561-12574
[2]  
Amer MR, 2012, LECT NOTES COMPUT SC, V7575, P187, DOI 10.1007/978-3-642-33765-9_14
[3]  
[Anonymous], 2016, WVU MULTIVIEW ACTION
[4]   On Temporal Order Invariance for View-Invariant Action Recognition [J].
Anwaar-ul-Haq ;
Gondal, Iqbal ;
Murshed, Manzur .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (02) :203-211
[5]   The recognition of human movement using temporal templates [J].
Bobick, AF ;
Davis, JW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (03) :257-267
[6]   A High-Throughput FPGA Accelerator for Short-Read Mapping of the Whole Human Genome [J].
Chen, Yen-Lung ;
Chang, Bo-Yi ;
Yang, Chia-Hsiang ;
Chiueh, Tzi-Dar .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (06) :1465-1478
[7]  
Duan H., 2021, ARXIV PREPRINT ARXIV
[8]   Clustering by passing messages between data points [J].
Frey, Brendan J. ;
Dueck, Delbert .
SCIENCE, 2007, 315 (5814) :972-976
[9]   Adaptive Fusion and Category-Level Dictionary Learning Model for Multiview Human Action Recognition [J].
Gao, Zan ;
Xuan, Hai-Zhen ;
Zhang, Hua ;
Wan, Shaohua ;
Choo, Kim-Kwang Raymond .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (06) :9280-9293
[10]   Simultaneous joint and object trajectory templates for human activity recognition from 3-D data [J].
Ghodsi, Saeed ;
Mohammadzade, Hoda ;
Korki, Erfan .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 55 :729-741