Laplacian eigenmaps based manifold regularized CNN for visual

被引:1
作者
Zong, Ming [1 ]
Ma, Zhizhong [2 ]
Zhu, Fangyi [3 ]
Ma, Yujun [4 ]
Wang, Ruili [2 ]
机构
[1] Shanghai Inst Technol, Sch Comp Sci & Informat Engn, Shanghai, Peoples R China
[2] Wenzhou Univ Technol, Sch Data Sci & Artificial Intelligence, Wenzhou, Peoples R China
[3] Tik Tok Pte Ltd, Singapore, Singapore
[4] Massey Univ, Sch Math & Computat Sci, Auckland, New Zealand
关键词
Action recognition; Manifold learning; Laplacian eigenmaps; CNN; NETWORKS;
D O I
10.1016/j.ins.2024.121503
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The paper proposes a novel Laplacian eigenmaps based manifold regularized CNN (LE-CNN) for action recognition. The proposed LE-CNN model incorporates Laplacian eigenmaps based manifold structure information of training samples into CNN layer by layer, which can keep adjacent samples as close as possible during space transformation. In addition, the Laplacian eigenmaps based manifold structure can accelerate convergence during the training process. Experiments are performed on two standard action datasets, UCF101 and HMDB51, to validate the proposed LE-CNN model. Furthermore, we extend experiments on a large-scale action recognition dataset (i.e. the Kinetics dataset) and compare the LE-CNN model with other advanced models. In addition to action recognition, we also apply the LE-CNN model to an image classification task on the CIFAR-10 dataset, to demonstrate the effectiveness of the LE-CNN model across different classification tasks.
引用
收藏
页数:12
相关论文
共 39 条
[1]   ViViT: A Video Vision Transformer [J].
Arnab, Anurag ;
Dehghani, Mostafa ;
Heigold, Georg ;
Sun, Chen ;
Lucic, Mario ;
Schmid, Cordelia .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :6816-6826
[2]  
Belkin M, 2002, ADV NEUR IN, V14, P585
[3]  
Belkin M, 2006, J MACH LEARN RES, V7, P2399
[4]   Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].
Carreira, Joao ;
Zisserman, Andrew .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733
[5]   Deep Manifold Learning Combined With Convolutional Neural Networks for Action Recognition [J].
Chen, Xin ;
Weng, Jian ;
Lu, Wei ;
Xu, Jiaming ;
Weng, Jiasi .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (09) :3938-3952
[6]   Online Selective Kernel-Based Temporal Difference Learning [J].
Chen, Xingguo ;
Gao, Yang ;
Wang, Ruili .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (12) :1944-1956
[7]   Background-foreground interaction for moving object detection in dynamic scenes [J].
Chen, Zhe ;
Wang, Ruili ;
Zhang, Zhen ;
Wang, Huibin ;
Xu, Lizhong .
INFORMATION SCIENCES, 2019, 483 :65-81
[8]  
Donahue J, 2015, PROC CVPR IEEE, P2625, DOI 10.1109/CVPR.2015.7298878
[9]   Learning Spatiotemporal Features with 3D Convolutional Networks [J].
Du Tran ;
Bourdev, Lubomir ;
Fergus, Rob ;
Torresani, Lorenzo ;
Paluri, Manohar .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4489-4497
[10]   Feature selection for least squares projection twin support vector machine [J].
Guo, Jianhui ;
Yi, Ping ;
Wang, Ruili ;
Ye, Qiaolin ;
Zhao, Chunxia .
NEUROCOMPUTING, 2014, 144 :174-183