Encoding learning network combined with feature similarity constraints for human action recognition

被引:0
作者
Wu, Chao [1 ]
Gao, Yakun [1 ]
Li, Guang [1 ]
Shi, Chunfeng [1 ]
机构
[1] Henan Inst Technol, Sch Elect Engn & Automat, Xinxiang 453003, Peoples R China
基金
英国科研创新办公室;
关键词
Extreme learning machine (ELM); Feature encoding; Similarity constraint; Human action recognition; MACHINE;
D O I
10.1007/s11042-023-17424-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extreme learning machine (ELM) is a fast and efficient classifier. Due to the inability to process descriptor-level features extracted from video sequences, the networks based on ELM cannot be directly used to recognize human actions. Encoding learning network (ELN) is proposed to solve this problem. The network is composed of feature encoding module and double similarity-constrained extreme learning machine (DS-ELM). In feature encoding module, the sparse mapping weight matrix is combined with pyramid pooling to generate representation-level features. DS-ELM is used to classify generated features. In order to utilize the similarity information between the features of each layer, different weight matrices in ELN are separately trained to improve the recognition ability. In the training of sparse mapping weight matrix, the auto-encoded dictionary and similarity constrained linear coding (SCLC) method are proposed to encode the desired output. The sparse mapping weight matrix is trained by using partial descriptor features and corresponding desired outputs. In the training of the classification weights, the ELM objective function is updated by similarity relationship between hidden layer features to derive the training formula of DS-ELM, which improves the classification performance while avoiding iterative training. To verify the feasibility of the ELN, experiments are conducted on Olympic Sports, UCF11, Hollywood2, UCF101, and Self-collection databases. Experimental results show that the proposed ELN is able to directly process descriptor features. And, the similarity information between the features of each layer can be further utilized by ELN to obtain excellent recognition performance compared with other improved methods based on ELM.
引用
收藏
页码:48631 / 48658
页数:28
相关论文
共 66 条
[1]   A framework of human action recognition using length control features fusion and weighted entropy-variances based feature selection [J].
Afza, Farhat ;
Khan, Muhammad Attique ;
Sharif, Muhammad ;
Kadry, Seifedine ;
Manogaran, Gunasekaran ;
Saba, Tanzila ;
Ashraf, Imran ;
Damasevicius, Robertas .
IMAGE AND VISION COMPUTING, 2021, 106
[2]   Optimized deep learning-based cricket activity focused network and medium scale benchmark [J].
Ahmad, Waqas ;
Munsif, Muhammad ;
Ullah, Habib ;
Ullah, Mohib ;
Alsuwailem, Alhanouf Abdulrahman ;
Saudagar, Abdul Khader Jilani ;
Muhammad, Khan ;
Sajjad, Muhammad .
ALEXANDRIA ENGINEERING JOURNAL, 2023, 73 :771-779
[3]   Quick extreme learning machine for large-scale classification [J].
Albtoush, Audi ;
Fernandez-Delgado, Manuel ;
Cernadas, Eva ;
Barro, Senen .
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (08) :5923-5938
[4]   Affinity propagation clustering-aided two-label hierarchical extreme learning machine for Wi-Fi fingerprinting-based indoor positioning [J].
Alitaleshi, Atefe ;
Jazayeriy, Hamid ;
Kazemitabar, Javad .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 13 (6) :3303-3317
[5]   Densely connected convolutional extreme learning machine for hyperspectral image classification [J].
Cai, Yaoming ;
Zhang, Zijia ;
Yan, Qin ;
Zhang, Dongfang ;
Banu, Mst Jainab .
NEUROCOMPUTING, 2021, 434 :21-32
[6]   Maximum Correntropy Criterion-Based Hierarchical One-Class Classification [J].
Cao, Jiuwen ;
Dai, Haozhen ;
Lei, Baiying ;
Yin, Chun ;
Zeng, Huanqiang ;
Kummert, Anton .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (08) :3748-3754
[7]   Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].
Carreira, Joao ;
Zisserman, Andrew .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733
[8]   MARS: Motion-Augmented RGB Stream for Action Recognition [J].
Crasto, Nieves ;
Weinzaepfel, Philippe ;
Alahari, Karteek ;
Schmid, Cordelia .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7874-7883
[9]   Human action recognition using two-stream attention based LSTM networks [J].
Dai, Cheng ;
Liu, Xingang ;
Lai, Jinfeng .
APPLIED SOFT COMPUTING, 2020, 86
[10]   Skeleton-Based Multifeatures and Multistream Network for Real-Time Action Recognition [J].
Deng, Zhiwen ;
Gao, Qing ;
Ju, Zhaojie ;
Yu, Xiang .
IEEE SENSORS JOURNAL, 2023, 23 (07) :7397-7409