Gesture recognition based on multi-modal feature weight

被引:66
作者
Duan, Haojie [1 ]
Sun, Ying [1 ,2 ]
Cheng, Wentao [1 ,3 ]
Jiang, Du [1 ,3 ]
Yun, Juntong [2 ,3 ]
Liu, Ying [4 ]
Liu, Yibo [4 ]
Zhou, Dalin [5 ]
机构
[1] Wuhan Univ Sci & Technol, Key Lab Met Equipment & Control Technol, Minist Educ, Wuhan 430081, Peoples R China
[2] Wuhan Univ Sci & Technol, Inst Precis Mfg, Wuhan, Peoples R China
[3] Wuhan Univ Sci & Technol, Res Ctr Biomimet Robot & Intelligent Measurement, Wuhan, Peoples R China
[4] Wuhan Univ Sci & Technol, Hubei Key Lab Mech Transmiss & Mfg Engn, Wuhan, Peoples R China
[5] Univ Portsmouth, Sch Comp, Portsmouth, Hants, England
基金
中国国家自然科学基金;
关键词
gesture recognition; RGB-D; multi-modal fusion; weight adaptation; NETWORK; ALGORITHM; CNNS;
D O I
10.1002/cpe.5991
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the continuous development of sensor technology, the acquisition cost of RGB-D images is getting lower and lower, and gesture recognition based on depth images and Red-Green-Blue (RGB) images has gradually become a research direction in the field of pattern recognition. However, most of the current processing methods for RGB-D gesture images are relatively simple, ignoring the relationship and influence between its two modes, and unable to make full use of the correlation factors between different modes. In view of the above problems, this paper optimizes the effect of RGB-D information processing by considering the independent features and related features of multi-modal data to construct a weight adaptive algorithm to fuse different features. Simulation experiments show that the method proposed in this paper is better than the traditional RGB-D gesture image processing method and the gesture recognition rate is higher. Comparing the current more advanced gesture recognition methods, the method proposed in this paper also achieves higher recognition accuracy, which verifies the feasibility and robustness of this method.
引用
收藏
页数:13
相关论文
共 74 条
[1]  
[Anonymous], 2012, Advances in Neural Information Processing Systems
[2]  
Blum M, 2012, IEEE INT CONF ROBOT, P1298, DOI 10.1109/ICRA.2012.6225188
[3]  
Bo L., 2013, Experimental Robotics, volume 88 of Springer Tracts in Advanced Robotics, P387
[4]   An under-sampled software defect prediction method based on hybrid multi-objective cuckoo search [J].
Cai, Xingjuan ;
Niu, Yun ;
Geng, Shaojin ;
Zhang, Jiangjiang ;
Cui, Zhihua ;
Li, Jianwei ;
Chen, Jinjun .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (05)
[5]   Multi-Objective Three-Dimensional DV-Hop Localization Algorithm With NSGA-II [J].
Cai, Xingjuan ;
Wang, Penghong ;
Du, Lei ;
Cui, Zhihua ;
Zhang, Wensheng ;
Chen, Jinjun .
IEEE SENSORS JOURNAL, 2019, 19 (21) :10003-10015
[6]   An Interactive Image Segmentation Method in Hand Gesture Recognition [J].
Chen, Disi ;
Li, Gongfa ;
Sun, Ying ;
Kong, Jianyi ;
Jiang, Guozhang ;
Tang, Heng ;
Ju, Zhaojie ;
Yu, Hui ;
Liu, Honghai .
SENSORS, 2017, 17 (02)
[7]   Modeling of Emergency Supply Scheduling Problem Based on Reliability and Its Solution Algorithm under Variable Road Network after Sudden-Onset Disasters [J].
Chen, Tinggui ;
Wu, Shiwen ;
Yang, Jianjun ;
Cong, Guodong ;
Li, Gongfa .
COMPLEXITY, 2020, 2020
[8]   Public Opinion Polarization by Individual Revenue from the Social Preference Theory [J].
Chen, Tinggui ;
Li, Qianqian ;
Fu, Peihua ;
Yang, Jianjun ;
Xu, Chonghuan ;
Cong, Guodong ;
Li, Gongfa .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2020, 17 (03)
[9]   Modeling of the Public Opinion Polarization Process with the Considerations of Individual Heterogeneity and Dynamic Conformity [J].
Chen, Tinggui ;
Li, Qianqian ;
Yang, Jianjun ;
Cong, Guodong ;
Li, Gongfa .
MATHEMATICS, 2019, 7 (10)
[10]   Risk Propagation Model and Its Simulation of Emergency Logistics Network Based on Material Reliability [J].
Chen, Tinggui ;
Wu, Shiwen ;
Yang, Jianjun ;
Cong, Guodong .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2019, 16 (23)