Multi-sensor data fusion for sign language recognition based on dynamic Bayesian networks and convolution neural networks

被引：0

作者：

Zhao, Y. D. ^{[1
]}

Xiao, Q. K. ^{[1
]}

Wang, H. ^{[1
]}

机构：

[1] Xian Technol Univ, Dept Elect Informat Engn, Xian, Shaanxi, Peoples R China

来源：

AUTOMATIC CONTROL, MECHATRONICS AND INDUSTRIAL ENGINEERING | 2019年

关键词：

sign language recognition; dynamic Bayesian network; convolution neural network; multi-sensor data;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

A new multi-sensor fusion framework is proposed, which is based on Convolution Neural Network (CNN) and Dynamic Bayesian Network (DBN) for Sign Language Recognition (SLR). In this framework, a Microsoft Kinect, which is a low-cost RGB-D sensor, is used as a tool for the Human-Computer Interaction (HCI). In our method, firstly, the color and depth videos are collected using Kinect, then all image sequences features are extracted out using the CNN. The color and depth feature sequences are input into the DBN as observation data. Based on the graph model fusion machine, the maximum hidden state probability is calculated as recognition results of dynamic isolated sign language. The dataset is tested using the existing SLR methods. Using the proposed DBN+CNN SLR framework, the highest recognition rate can reach 99.40%. The test results show that our approach is effective.

引用

页码：329 / 336

页数：8

共 11 条

[1] Coupled hidden Markov models for complex action recognition [J].

Brand, M ;

Oliver, N ;

Pentland, A .

1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, :994-999

[2] Hand gesture recognition using a real-time tracking method and hidden Markov models [J].

Chen, FS ;

Fu, CM ;

Huang, CL .

IMAGE AND VISION COMPUTING, 2003, 21 (08) :745-758

[3]

Elons AS, 2014, 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES), P368, DOI 10.1109/ICCES.2014.7030987

[4]

Huang T. S, 2007, P CVPR 07 IEEE C, V2, P1

[5]

Lang Simon, 2012, ARTIFICIAL INTELLIGE

[6]

Marin G, 2014, IEEE IMAGE PROC, P1565, DOI 10.1109/ICIP.2014.7025313

[7]

Nefian AV, 2002, INT CONF ACOUST SPEE, P2013

[8] XKin: an open source framework for hand pose and gesture recognition using kinect [J].

Pedersoli, Fabrizio ;

Benini, Sergio ;

Adami, Nicola ;

Leonardi, Riccardo .

VISUAL COMPUTER, 2014, 30 (10) :1107-1122

[9]

Pugeault N., 2012, IEEE ICCV WORKSH, V28, P1114

[10]

Ramirez J. A, 2014, FEATURE EXTRACTION B

← 1 2 →