DURATION-NORMALIZED FEATURE SELECTION FOR INDIAN SPOKEN LANGUAGE IDENTIFICATION IN UTTERANCE LENGTH MISMATCH

被引:0
作者
Bakshi, Aarti M. [1 ]
Kopparapu, Sunil K. [2 ]
机构
[1] UMIT, SNDT, Dept Elect & Commun, Mumbai, Maharashtra, India
[2] TATA Consultancy Serv, TCS Res, Yantra Pk, Thana, India
来源
JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY | 2022年 / 17卷 / 03期
关键词
Classifier; Feature selection; Indian language; Spoken language identification;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Spoken Indian language identification (SLID) plays a significant role in multilingual call center automation. The ability to identify the language of a very short-length utterance is crucial in a call center to enable route the call to an agent who can communicate in the native language of the caller. In this paper, we propose a duration normalized feature selection technique and show through extensive experimentation that this helps in improving the language identification, especially when the length of the spoken utterance is unknown a priori. We show that proposed duration normalized feature selection followed by output fusion of different classifiers perform best for utterance length mismatch condition. The relative improvement in accuracy from 31.5% to 99.0% and 10.4% to 25.9%, when trained with 30 s utterances and tested with 15 s and 0.2 s utterances, is achieved using a 150 duration normalized feature set.
引用
收藏
页码:2120 / 2134
页数:15
相关论文
共 21 条
  • [1] Native Language Identification in Very Short Utterances Using Bidirectional Long Short-Term Memory Network
    Adeeba, Farah
    Hussain, Sarmad
    [J]. IEEE ACCESS, 2019, 7 : 17098 - 17110
  • [2] Feature Selection for Speech Emotion Recognition in Spanish and Basque: On the Use of Machine Learning to Improve Human-Computer Interaction
    Arruti, Andoni
    Cearreta, Idoia
    Alvarez, Aitor
    Lazkano, Elena
    Sierra, Basilio
    [J]. PLOS ONE, 2014, 9 (10):
  • [3] A survey on swarm intelligence approaches to feature selection in data mining
    Bach Hoai Nguyen
    Xue, Bing
    Zhang, Mengjie
    [J]. SWARM AND EVOLUTIONARY COMPUTATION, 2020, 54
  • [4] Bakshi A., 2020, SPOKEN INDIAN LANGUA
  • [5] Indian language identification using time-frequency image textural descriptors and GWO-based feature selection
    Chowdhury, Amit A.
    Borkar, Vaibhav S.
    Birajdar, Gajanan K.
    [J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2020, 32 (01) : 111 - 132
  • [6] A Hybrid Meta-Heuristic Feature Selection Method for Identification of Indian Spoken Languages From Audio Signals
    Das, Aankit
    Guha, Samarpan
    Singh, Pawan Kumar
    Ahmadian, Ali
    Senu, Norazak
    Sarkar, Ram
    [J]. IEEE ACCESS, 2020, 8 : 181432 - 181449
  • [7] Dehak N, 2011, 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, P864
  • [8] Metric learning loss functions to reduce domain mismatch in the x-vector space for language recognition
    Duroselle, Raphael
    Jouvet, Denis
    Illina, Irina
    [J]. INTERSPEECH 2020, 2020, : 447 - 451
  • [9] Eyben F., 2010, P 18 ACM INT C MULT, P1459, DOI DOI 10.1145/1873951.1874246
  • [10] Sub-band Envelope Features using Frequency Domain Linear Prediction for Short Duration Language Identification
    Fernando, Sarith
    Sethu, Vidhyasaharan
    Ambikairajah, Eliathamby
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1818 - 1822