A lazy learning-based language identification from speech using MFCC-2 features

被引:0
|
作者
Himadri Mukherjee
Sk Md Obaidullah
K. C. Santosh
Santanu Phadikar
Kaushik Roy
机构
[1] West Bengal State University,Department of Computer Science
[2] Aliah University,Department of Computer Science and Engineering
[3] The University of South Dakota,Department of Computer Science
[4] Maulana Abul Kalam Azad University of Technology,Department of Computer Science and Engineering
关键词
Lazy learning; Speech recognition; Language identification; Mel frequency cepstral coefficient-based features;
D O I
暂无
中图分类号
学科分类号
摘要
Developing an automatic speech recognition system for multilingual countries like India is a challenging task due to the fact that the people are inured to using multiple languages while talking. This makes language identification from speech an important and essential task prior to recognition of the same. In this paper a system is proposed towards language identification from multilingual speech signals. A new second level Mel frequency cepstral coefficient-based feature named MFCC-2 that handles the large and uneven dimensionality of MFCC has been used to characterize languages in the thick of English, Bangla and Hindi. The system has been tested with recordings of as many as 12,000 utterances of numerals and 41,884 clips extracted from YouTube videos considering background music, data from multiple environments, avoidance of noise suppression and use of keywords from different languages in a single phrase. The highest and average accuracies (for Top-3 classifiers from a pool of nine classifiers) of 98.09% and 95.54%, respectively were achieved for YouTube data.
引用
收藏
页码:1 / 14
页数:13
相关论文
共 50 条
  • [41] ELM speaker identification for limited dataset using multitaper based MFCC and PNCC features with fusion score
    Bharath K P
    Rajesh Kumar M
    Multimedia Tools and Applications, 2020, 79 : 28859 - 28883
  • [42] ELM speaker identification for limited dataset using multitaper based MFCC and PNCC features with fusion score
    Bharath, K. P.
    Kumar, Rajesh M.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (39-40) : 28859 - 28883
  • [43] GMM based language identification system using robust features
    Manchala, Sadanandam
    Prasad, V.
    Janaki, V.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (02) : 99 - 105
  • [44] Automated Identification of Heart Failure With Reduced Ejection Fraction Using Deep Learning-Based Natural Language Processing
    Nargesi, Arash A.
    Adejumo, Philip
    Dhingra, Lovedeep Singh
    Rosand, Benjamin
    Hengartner, Astrid
    Coppi, Andreas
    Benigeri, Simon
    Sen, Sounok
    Ahmad, Tariq
    Nadkarni, Girish N.
    Lin, Zhenqiu
    Ahmad, Faraz S.
    Krumholz, Harlan M.
    Khera, Rohan
    JACC-HEART FAILURE, 2025, 13 (01) : 75 - 87
  • [45] Reading functional requirements using machine learning-based language processing
    Akay, Haluk
    Kim, Sang-Gook
    CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2021, 70 (01) : 139 - 142
  • [46] Deep Learning-Based End-to-End Speaker Identification Using Time-Frequency Representation of Speech Signal
    Saritha, Banala
    Laskar, Mohammad Azharuddin
    Kirupakaran, Anish Monsley
    Laskar, Rabul Hussain
    Choudhury, Madhuchhanda
    Shome, Nirupam
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 43 (3) : 1839 - 1861
  • [47] Transfer learning-based electrocardiogram classification using wavelet scattered features
    Sabeenian, R.
    Janani, K. Sree
    BIOMEDICAL AND BIOTECHNOLOGY RESEARCH JOURNAL, 2023, 7 (01): : 52 - 59
  • [48] Plant disease and pest detection using deep learning-based features
    Turkoglu, Muammer
    Hanbay, Davut
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (03) : 1636 - 1651
  • [49] Predicting river water height using deep learning-based features
    Borwarnginn, Punyanuch
    Haga, Jason H.
    Kusakunniran, Worapan
    ICT EXPRESS, 2022, 8 (04): : 588 - 594
  • [50] Learning-based license plate detection using global and local features
    Zhang, Huaifeng
    Jia, Wenjing
    He, Xiangjian
    Wu, Qiang
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 1102 - +