Feature extraction in Brazilian Sign Language Recognition based on phonological structure and using RGB-D sensors

被引：71

作者：

Almeida, Silvia Grasiella Moreira ^{[1
,2
]}

Guimaraes, Frederico Gadelha ^{[3
]}

Ramirez, Jaime Arturo ^{[3
]}

机构：

[1] Univ Fed Minas Gerais, Grad Program Elect Engn, BR-31270901 Belo Horizonte, MG, Brazil

[2] Fed Inst Minas Gerais, Ouro Preto, MG, Brazil

[3] Univ Fed Minas Gerais, Dept Elect Engn, Belo Horizonte, MG, Brazil

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2014年 / 41卷 / 16期

关键词：

Brazilian Sign Language Recognition; RGB-D sensors; Feature extraction; HAND GESTURE RECOGNITION; LOW-COST; TRANSLATION; TRANSFORM; KINECT; SYSTEM; WORDS; MODEL; FIELD;

D O I：

10.1016/j.eswa.2014.05.024

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In contrast to speech recognition, whose speech features have been extensively explored in the research literature, feature extraction in Sign Language Recognition (SLR) is still a very challenging problem. In this paper we present a methodology for feature extraction in Brazilian Sign Language (BSL, or LIBRAS in Portuguese) that explores the phonological structure of the language and relies on RGB-D sensor for obtaining intensity, position and depth data. From the RGB-D images we obtain seven vision-based features. Each feature is related to one, two or three structural elements in BSL. We investigate this relation between extracted features and structural elements based on shape, movement and position of the hands. Finally we employ Support Vector Machines (SVM) to classify signs based on these features and linguistic elements. The experiments show that the attributes of these elements can be successfully recognized in terms of the features obtained from the RGB-D images, with accuracy results individually above 80% on average. The proposed feature extraction methodology and the decomposition of the signs into their phonological structure is a promising method to help expert systems designed for SLR. (C) 2014 Elsevier Ltd. All rights reserved.

引用

页码：7259 / 7271

页数：13

共 80 条

[1] Video-based signer-independent Arabic sign language recognition using hidden Markov models [J].

AL-Rousan, M. ;

Assaleh, K. ;

Tala'a, A. .

APPLIED SOFT COMPUTING, 2009, 9 (03) :990-999

[2] Evolutionary joint selection to improve human action recognition with RGB-D devices [J].

Andre Chaaraoui, Alexandros ;

Ramon Padilla-Lopez, Jose ;

Climent-Perez, Pau ;

Florez-Revuelta, Francisco .

EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (03) :786-794

[3]

[Anonymous], 2012, INT J COMPUT APPL

[4]

[Anonymous], 2012, International Journal of Computer Applications

[5]

[Anonymous], 2012, NCIPET

[6] A belief-based sequential fusion approach for fusing manual signs and non-manual signals [J].

Aran, Oya ;

Burger, Thomas ;

Caplier, Alice ;

Akarun, Lale .

PATTERN RECOGNITION, 2009, 42 (05) :812-822

[7] Thai sign language translation using Scale Invariant Feature Transform and Hidden Markov Models [J].

Auephanwiriyakul, Sansanee ;

Phitakwinai, Suwannee ;

Suttapak, Wattanapong ;

Chanda, Phonkrit ;

Theera-Umpon, Nipon .

PATTERN RECOGNITION LETTERS, 2013, 34 (11) :1291-1298

[8]

Bossard B, 2003, LECT NOTES ARTIF INT, V2915, P90

[9] 3D motion trajectory analysis approach to improve Sign Language 3D-based content recognition [J].

Boulares, Mehrez ;

Jemni, Mohamed .

PROCEEDINGS OF THE INTERNATIONAL NEURAL NETWORK SOCIETY WINTER CONFERENCE (INNS-WC2012), 2012, 13 :133-143

[10] Using multiple sensors for mobile sign language recognition [J].

Brashear, H ;

Starner, T ;

Lukowicz, P ;

Junker, H .

SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, PROCEEDINGS, 2003, :45-52

← 1 2 3 4 5 6 7 8 →