DURATION-NORMALIZED FEATURE SELECTION FOR INDIAN SPOKEN LANGUAGE IDENTIFICATION IN UTTERANCE LENGTH MISMATCH

被引：0

作者：

Bakshi, Aarti M. ^{[1
]}

Kopparapu, Sunil K. ^{[2
]}

机构：

[1] UMIT, SNDT, Dept Elect & Commun, Mumbai, Maharashtra, India

[2] TATA Consultancy Serv, TCS Res, Yantra Pk, Thana, India

来源：

JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY | 2022年 / 17卷 / 03期

关键词：

Classifier; Feature selection; Indian language; Spoken language identification;

D O I：

暂无

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Spoken Indian language identification (SLID) plays a significant role in multilingual call center automation. The ability to identify the language of a very short-length utterance is crucial in a call center to enable route the call to an agent who can communicate in the native language of the caller. In this paper, we propose a duration normalized feature selection technique and show through extensive experimentation that this helps in improving the language identification, especially when the length of the spoken utterance is unknown a priori. We show that proposed duration normalized feature selection followed by output fusion of different classifiers perform best for utterance length mismatch condition. The relative improvement in accuracy from 31.5% to 99.0% and 10.4% to 25.9%, when trained with 30 s utterances and tested with 15 s and 0.2 s utterances, is achieved using a 150 duration normalized feature set.

引用

页码：2120 / 2134

页数：15

共 21 条

[1] Native Language Identification in Very Short Utterances Using Bidirectional Long Short-Term Memory Network [J].

Adeeba, Farah ;

Hussain, Sarmad .

IEEE ACCESS, 2019, 7 :17098-17110

[2]

[Anonymous], 2014, P INTERSPEECH

[3] Feature Selection for Speech Emotion Recognition in Spanish and Basque: On the Use of Machine Learning to Improve Human-Computer Interaction [J].

Arruti, Andoni ;

Cearreta, Idoia ;

Alvarez, Aitor ;

Lazkano, Elena ;

Sierra, Basilio .

PLOS ONE, 2014, 9 (10)

[4] A survey on swarm intelligence approaches to feature selection in data mining [J].

Bach Hoai Nguyen ;

Xue, Bing ;

Zhang, Mengjie .

SWARM AND EVOLUTIONARY COMPUTATION, 2020, 54

[5]

Bakshi A., 2020, SPOKEN INDIAN LANGUA

[6] Indian language identification using time-frequency image textural descriptors and GWO-based feature selection [J].

Chowdhury, Amit A. ;

Borkar, Vaibhav S. ;

Birajdar, Gajanan K. .

JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2020, 32 (01) :111-132

[7] A Hybrid Meta-Heuristic Feature Selection Method for Identification of Indian Spoken Languages From Audio Signals [J].

Das, Aankit ;

Guha, Samarpan ;

Singh, Pawan Kumar ;

Ahmadian, Ali ;

Senu, Norazak ;

Sarkar, Ram .

IEEE ACCESS, 2020, 8 :181432-181449

[8]

Dehak N, 2011, 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, P864

[9] Metric learning loss functions to reduce domain mismatch in the x-vector space for language recognition [J].

Duroselle, Raphael ;

Jouvet, Denis ;

Illina, Irina .

INTERSPEECH 2020, 2020, :447-451

[10]

Eyben F., 2010, P 18 ACM INT C MULT, P1459

← 1 2 3 →