Identification of Language using Mel-Frequency Cepstral Coefficients (MFCC)

被引:44
|
作者
Koolagudi, Shashidhar G. [1 ]
Rastogi, Deepika [1 ]
Rao, K. Sreenivasa [2 ]
机构
[1] Graph Era Univ, Sch Comp, Dehra Dun 248002, Uttarakhand, India
[2] Indian Inst Technol, Kharagpur 721302, W Bengal, India
关键词
Gaussian Mixture Model; Language identification; Mel-frequency Cepstral Coefficient; Spectral features;
D O I
10.1016/j.proeng.2012.06.392
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper focuses on the task of identifying a language from speech signal. In this paper, we have use Mel-frequency cepstral coefficient as features. Language identification models are developed for fifteen Indian languages namely Assamese, Bangla, Guajarati, Hindi, Kannada, Kashmiri, Malayalam, Marathi, Nepali, Oriya, Punjabi, Rajasthani, Tamil, Telugu and Urdu using these spectral features. The identification of above mentioned languages is carried out using Gaussian mixture model. A Semi natural read database is used for obtaining the language specific information. MFCC is obtained by using linear cosine transform of log power spectrum on a nonlinear mel-frequency scale. This paper shows that the performance of Language identification system is better when trained and tested with twenty nine features as compared to six, eight, thirteen, nineteen and twenty one MECC features. It means more the number of features we use better the result we get. The average language recognition rate over fifteen Indian languages is around 88\%. (C) 2012 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of Noorul Islam Centre for Higher Education
引用
收藏
页码:3391 / 3398
页数:8
相关论文
共 50 条
  • [41] Speech Based Arithmetic Calculator Using Mel-Frequency Cepstral Coefficients and Gaussian Mixture Models
    Husain, Moula
    Meena, S. M.
    Gonal, Manjunath K.
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND INFORMATICS (ICACNI 2015), VOL 1, 2016, 43 : 209 - 218
  • [42] Clean speech reconstruction from noisy MEL-frequency cepstral coefficients using a sinusoidal model
    Shao, X
    Milner, B
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 704 - 707
  • [43] UNDERSTANDING SARCASM IN SPEECH USING MEL-FREQUENCY CEPSTRAL COEFFICENT
    Mathur, Abhinav
    Saxena, Vikas
    Singh, Sandeep K.
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING (CONFLUENCE 2017), 2017, : 728 - 732
  • [44] Prediction of fundamental frequency and voicing from mel-frequency cepstral coefficients for unconstrained speech reconstruction
    Milner, Ben
    Shao, Xu
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 24 - 33
  • [45] Integration of Mel-frequency Cepstral Coefficients with Log Energy and Temporal Derivatives for Text-Independent Speaker Identification
    Dhonde, S. B.
    Chaudhari, Amol
    Jagade, S. M.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 1, 2017, 468 : 791 - 797
  • [46] Drive-by bridge damage detection using Mel-frequency cepstral coefficients and support vector machine
    Li, Zhenkun
    Lin, Weiwei
    Zhang, Youqi
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2023, 22 (05): : 3302 - 3319
  • [47] Wind Turbine Gearbox Early Fault Detection Using Mel-Frequency Cepstral Coefficients of Vibration Data
    Velandia-Cardenas, Cristian
    Vidal, Yolanda
    Pozo, Francesc
    STRUCTURAL CONTROL & HEALTH MONITORING, 2024, 2024
  • [48] FET SMALL-SIGNAL MODELING USING MEL-FREQUENCY CEPSTRAL COEFFICIENTS AND THE DISCRETE COSINE TRANSFORM
    Elsharkawy, R. R.
    Hindy, M.
    El-Rabaie, S.
    Dessouky, M. I.
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2010, 19 (08) : 1835 - 1846
  • [49] Modelling and Characterization of an Artificial Neural Network for Infant Cry Recognition Using Mel-Frequency Cepstral Coefficients
    Bandala, Argel A.
    Lim, Allimzon M.
    Cai, Mark Anthony D.
    Bacar, Allan Jeffrey C.
    Manosca, Aynna Claudine G.
    TENCON 2014 - 2014 IEEE REGION 10 CONFERENCE, 2014,
  • [50] Classifying Heart Sound Recordings using Deep Convolutional Neural Networks and Mel-Frequency Cepstral Coefficients
    Rubin, Jonathan
    Abreu, Rui
    Ganguli, Anurag
    Nelaturi, Saigopal
    Matei, Ion
    Sricharan, Kumar
    2016 COMPUTING IN CARDIOLOGY CONFERENCE (CINC), VOL 43, 2016, 43 : 813 - 816