Unsupervised video segmentation for multi-view daily action recognition

被引:2
作者
Liu, Zhigang [1 ]
Wu, Yin [1 ]
Yin, Ziyang [1 ]
Gao, Chunlei [1 ]
机构
[1] Northeastern Univ, Sch Comp & Commun Engn, Qinhuangdao 066004, Peoples R China
关键词
Video segmentation; Multi-layer representation; Multi-view action recognition; Motion atom; Motion phrase; VECTOR; REPRESENTATION; FRAMEWORK; NETWORK;
D O I
10.1016/j.imavis.2023.104687
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-layer representations have achieved outstanding performances in the complex daily action recognition. However, the fixed length of the sliding window (SW) leads to the segmented motion atoms incomplete or non-unique. To deal with this problem, we propose unsupervised video segmentation (UVS) for multi-view daily action recognition. Firstly, the average cosine similarity is designed to ensure the integrity of motion atoms. Secondly, we utilize the ordered combination of motion atoms to construct the table of multi-scale motion phrases in a top-down manner, instead of the fixed-scale traditional motion phrases. Finally, the experimental results based on the WVU dataset, the NTU RGB-D 120 dataset, and the N-UCLA dataset show that the proposed UVS method has state-of-the-art performance, compared with the classic methods such as IDT, MoFAP, JLMF, FGCN, and MVMLR.& COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 49 条
[1]   Improving Human Action Recognition Using Hierarchical Features And Multiple Classifier Ensembles [J].
Bulbul, Mohammad Farhad ;
Islam, Saiful ;
Zhou, Yatong ;
Ali, Hazrat .
COMPUTER JOURNAL, 2021, 64 (11) :1633-1655
[2]   Multi-View Super Vector for Action Recognition [J].
Cai, Zhuowei ;
Wang, Limin ;
Peng, Xiaojiang ;
Qiao, Yu .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :596-603
[3]  
Cheng ZW, 2012, LECT NOTES COMPUT SC, V7584, P52, DOI 10.1007/978-3-642-33868-7_6
[4]  
Dhiman C, 2019, 2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2019), P225, DOI [10.1109/BigMM.2019.00041, 10.1109/BigMM.2019.00-21]
[5]   View-Invariant Deep Architecture for Human Action Recognition Using Two-Stream Motion and Shape Temporal Dynamics [J].
Dhiman, Chhavi ;
Vishwakarma, Dinesh Kumar .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :3835-3844
[6]   A Robust Framework for Abnormal Human Action Recognition Using R-Transform and Zernike Moments in Depth Videos [J].
Dhiman, Chhavi ;
Vishwakarma, Dinesh Kumar .
IEEE SENSORS JOURNAL, 2019, 19 (13) :5195-5203
[7]  
Ding L, 2017, Arxiv, DOI arXiv:1705.07818
[8]   Hierarchical Attention Network for Action Segmentation [J].
Gammulle, Harshala ;
Denman, Simon ;
Sridharan, Sridha ;
Fookes, Clinton .
PATTERN RECOGNITION LETTERS, 2020, 131 :442-448
[9]   Adaptive Fusion and Category-Level Dictionary Learning Model for Multiview Human Action Recognition [J].
Gao, Zan ;
Xuan, Hai-Zhen ;
Zhang, Hua ;
Wan, Shaohua ;
Choo, Kim-Kwang Raymond .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (06) :9280-9293
[10]   Incremental human action recognition with dual memory [J].
Gutoski, Matheus ;
Lazzaretti, Andre Eugenio ;
Lopes, Heitor Silverio .
IMAGE AND VISION COMPUTING, 2021, 116