A review into deep learning techniques for spoken language identification

被引:0
|
作者
Irshad Ahmad Thukroo
Rumaan Bashir
Kaiser J. Giri
机构
[1] Islamic University of Science & Technology,Department of Computer Science
来源
Multimedia Tools and Applications | 2022年 / 81卷
关键词
Spoken language identification; Gaussian mixture model; Support vector machine; Hidden Markov model; Deep neural networks; Artificial neural network; Feed-forward neural network; Recurrent neural network; Convolutional neural network; Ensemble learning; hybridization approaches; Mel frequency cepstral coefficient features;
D O I
暂无
中图分类号
学科分类号
摘要
Information Technology has touched new vistas for a couple of decades mostly to simplify the day-to-day life of the humans. One of the key contributions of Information Technology is the application of Artificial Intelligence to achieve better results. The advent of artificial intelligence has given rise to a new branch of Natural Language Processing (NLP) called Computational Linguistics, which generates frameworks for intelligently manipulating spoken language knowledge and has brought human-machine onto a new stage. In this context, speech has arisen to be one of the imperative forms of interfaces, which is the basic mode of communication for us, and generally the most preferred one. Language identification, being the front-end for various natural language processing tasks, plays an important role in language translation. Owing to this, the focus has been given on the field of speech recognition involving the identification & recognition of languages by a machine. Spoken language identification is the identification of language present in a speech segment despite its size (duration & speed), ambiance (topic & emotion), and moderator (gender, age, demographic region). This paper has investigated various existing spoken language identification models implemented using different deep learning approaches, datasets, and performance measures utilized for their analysis. It also highlights the main features and challenges faced by these models. A comprehensive comparative study of deep learning techniques has been carried out for spoken language identification. Moreover, this review analyzes the efficiency of the spoken language models that can help the researchers to propose new language identification models for speech signals.
引用
收藏
页码:32593 / 32624
页数:31
相关论文
共 50 条
  • [1] A review into deep learning techniques for spoken language identification
    Thukroo, Irshad Ahmad
    Bashir, Rumaan
    Giri, Kaiser J.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (22) : 32593 - 32624
  • [2] Deep learning for spoken language identification: Can we visualize speech signal patterns?
    Mukherjee, Himadri
    Ghosh, Subhankar
    Sen, Shibaprasad
    Obaidullah, Sk Md
    Santosh, K. C.
    Phadikar, Santanu
    Roy, Kaushik
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (12) : 8483 - 8501
  • [3] Deep learning for spoken language identification: Can we visualize speech signal patterns?
    Himadri Mukherjee
    Subhankar Ghosh
    Shibaprasad Sen
    Obaidullah Sk Md
    K. C. Santosh
    Santanu Phadikar
    Kaushik Roy
    Neural Computing and Applications, 2019, 31 : 8483 - 8501
  • [4] FuzzyGCP: A deep learning architecture for automatic spoken language identification from speech signals
    Garain, Avishek
    Singh, Pawan Kumar
    Sarkar, Ram
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 168
  • [5] Performance Evaluation of Deep Bottleneck Features for Spoken Language Identification
    Jiang, Bing
    Song, Yan
    Wei, Si
    Wang, Meng-Ge
    McLoughlin, Ian
    Dai, Li-Rong
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 143 - +
  • [6] Spoken Language Identification Using Prosody, Phonotactics, and Acoustics: A Review
    Thukroo, Irshad Ahmad
    Bashir, Rumaan
    Giri, Kaiser J.
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2022, 21 (04)
  • [7] Spoken Language Identification with Deep Convolutional Neural Network and Data Augmentation
    Korkut, Can
    Haznedaroglu, Ali
    Arslan, Levent M.
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [8] A Systematic Review of Recent Machine Learning Techniques for Plant Disease Identification and Classification
    Goel, Lavika
    Nagpal, Jyoti
    IETE TECHNICAL REVIEW, 2023, 40 (03) : 423 - 439
  • [9] Spoken language understanding software for language learning
    Alam, Hassan
    Kumar, Aman
    Rahman, Fuad
    Hartono, Rachmat
    Tarnikova, Yuliya
    INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL II, 2007, : 107 - +
  • [10] Speech Emotion Recognition Using Deep Learning Techniques: A Review
    Khalil, Ruhul Amin
    Jones, Edward
    Babar, Mohammad Inayatullah
    Jan, Tariqullah
    Zafar, Mohammad Haseeb
    Alhussain, Thamer
    IEEE ACCESS, 2019, 7 : 117327 - 117345