Encoding learning network combined with feature similarity constraints for human action recognition

被引:0
作者
Chao Wu
Yakun Gao
Guang Li
Chunfeng Shi
机构
[1] Henan Institute of Technology,School of Electrical Engineering and Automation
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Extreme learning machine (ELM); Feature encoding; Similarity constraint; Human action recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Extreme learning machine (ELM) is a fast and efficient classifier. Due to the inability to process descriptor-level features extracted from video sequences, the networks based on ELM cannot be directly used to recognize human actions. Encoding learning network (ELN) is proposed to solve this problem. The network is composed of feature encoding module and double similarity-constrained extreme learning machine (DS-ELM). In feature encoding module, the sparse mapping weight matrix is combined with pyramid pooling to generate representation-level features. DS-ELM is used to classify generated features. In order to utilize the similarity information between the features of each layer, different weight matrices in ELN are separately trained to improve the recognition ability. In the training of sparse mapping weight matrix, the auto-encoded dictionary and similarity constrained linear coding (SCLC) method are proposed to encode the desired output. The sparse mapping weight matrix is trained by using partial descriptor features and corresponding desired outputs. In the training of the classification weights, the ELM objective function is updated by similarity relationship between hidden layer features to derive the training formula of DS-ELM, which improves the classification performance while avoiding iterative training. To verify the feasibility of the ELN, experiments are conducted on Olympic Sports, UCF11, Hollywood2, UCF101, and Self-collection databases. Experimental results show that the proposed ELN is able to directly process descriptor features. And, the similarity information between the features of each layer can be further utilized by ELN to obtain excellent recognition performance compared with other improved methods based on ELM.
引用
收藏
页码:48631 / 48658
页数:27
相关论文
共 172 条
[1]  
Krizhevsky A(2017)ImageNet classification with deep convolutional neural networks Commun ACM 60 84-90
[2]  
Sutskever I(2006)Extreme learning machine: theory and applications Neurocomputing 70 489-501
[3]  
Hinton GE(2012)Extreme learning machine for regression and multiclass classification IEEE Trans Syst Man Cybern Part B (Cybernetics) 42 513-529
[4]  
Huang GB(2013)Training extreme learning machine via regularized correntropy criterion Neural Comput Appl 23 1977-1986
[5]  
Zhu QY(2018)Improving image classification accuracy with ELM and CSIFT Comput Sci Eng 21 26-34
[6]  
Siew CK(2017)Going deeper into action recognition: a survey Image Vis Comput 60 4-21
[7]  
Huang GB(2017)A comprehensive review on handcrafted and learning-based action representation approaches for human activity recognition Appl Sci 7 110-110
[8]  
Zhou H(2013)Representational learning with extreme learning machine for big data IEEE Intell Syst 28 31-34
[9]  
Ding X(2021)Dynamic hand gesture recognition based on 3D hand pose estimation for human–robot interaction IEEE Sens J 22 17421-17430
[10]  
Zhang R(2023)Skeleton-Based Multifeatures and Multistream Network for Real-Time Action Recognition IEEE Sens J 23 7397-7409