DURATION-NORMALIZED FEATURE SELECTION FOR INDIAN SPOKEN LANGUAGE IDENTIFICATION IN UTTERANCE LENGTH MISMATCH

被引:0
作者
Bakshi, Aarti M. [1 ]
Kopparapu, Sunil K. [2 ]
机构
[1] UMIT, SNDT, Dept Elect & Commun, Mumbai, Maharashtra, India
[2] TATA Consultancy Serv, TCS Res, Yantra Pk, Thana, India
来源
JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY | 2022年 / 17卷 / 03期
关键词
Classifier; Feature selection; Indian language; Spoken language identification;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Spoken Indian language identification (SLID) plays a significant role in multilingual call center automation. The ability to identify the language of a very short-length utterance is crucial in a call center to enable route the call to an agent who can communicate in the native language of the caller. In this paper, we propose a duration normalized feature selection technique and show through extensive experimentation that this helps in improving the language identification, especially when the length of the spoken utterance is unknown a priori. We show that proposed duration normalized feature selection followed by output fusion of different classifiers perform best for utterance length mismatch condition. The relative improvement in accuracy from 31.5% to 99.0% and 10.4% to 25.9%, when trained with 30 s utterances and tested with 15 s and 0.2 s utterances, is achieved using a 150 duration normalized feature set.
引用
收藏
页码:2120 / 2134
页数:15
相关论文
共 21 条
[1]   Native Language Identification in Very Short Utterances Using Bidirectional Long Short-Term Memory Network [J].
Adeeba, Farah ;
Hussain, Sarmad .
IEEE ACCESS, 2019, 7 :17098-17110
[2]  
[Anonymous], 2014, P INTERSPEECH
[3]   Feature Selection for Speech Emotion Recognition in Spanish and Basque: On the Use of Machine Learning to Improve Human-Computer Interaction [J].
Arruti, Andoni ;
Cearreta, Idoia ;
Alvarez, Aitor ;
Lazkano, Elena ;
Sierra, Basilio .
PLOS ONE, 2014, 9 (10)
[4]   A survey on swarm intelligence approaches to feature selection in data mining [J].
Bach Hoai Nguyen ;
Xue, Bing ;
Zhang, Mengjie .
SWARM AND EVOLUTIONARY COMPUTATION, 2020, 54
[5]  
Bakshi A., 2020, SPOKEN INDIAN LANGUA
[6]   Indian language identification using time-frequency image textural descriptors and GWO-based feature selection [J].
Chowdhury, Amit A. ;
Borkar, Vaibhav S. ;
Birajdar, Gajanan K. .
JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2020, 32 (01) :111-132
[7]   A Hybrid Meta-Heuristic Feature Selection Method for Identification of Indian Spoken Languages From Audio Signals [J].
Das, Aankit ;
Guha, Samarpan ;
Singh, Pawan Kumar ;
Ahmadian, Ali ;
Senu, Norazak ;
Sarkar, Ram .
IEEE ACCESS, 2020, 8 :181432-181449
[8]  
Dehak N, 2011, 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, P864
[9]   Metric learning loss functions to reduce domain mismatch in the x-vector space for language recognition [J].
Duroselle, Raphael ;
Jouvet, Denis ;
Illina, Irina .
INTERSPEECH 2020, 2020, :447-451
[10]  
Eyben F., 2010, P 18 ACM INT C MULT, P1459