A review into deep learning techniques for spoken language identification

被引：0

作者：

Irshad Ahmad Thukroo

Rumaan Bashir

Kaiser J. Giri

机构：

[1] Islamic University of Science & Technology,Department of Computer Science

来源：

Multimedia Tools and Applications | 2022年 / 81卷

关键词：

Spoken language identification; Gaussian mixture model; Support vector machine; Hidden Markov model; Deep neural networks; Artificial neural network; Feed-forward neural network; Recurrent neural network; Convolutional neural network; Ensemble learning; hybridization approaches; Mel frequency cepstral coefficient features;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Information Technology has touched new vistas for a couple of decades mostly to simplify the day-to-day life of the humans. One of the key contributions of Information Technology is the application of Artificial Intelligence to achieve better results. The advent of artificial intelligence has given rise to a new branch of Natural Language Processing (NLP) called Computational Linguistics, which generates frameworks for intelligently manipulating spoken language knowledge and has brought human-machine onto a new stage. In this context, speech has arisen to be one of the imperative forms of interfaces, which is the basic mode of communication for us, and generally the most preferred one. Language identification, being the front-end for various natural language processing tasks, plays an important role in language translation. Owing to this, the focus has been given on the field of speech recognition involving the identification & recognition of languages by a machine. Spoken language identification is the identification of language present in a speech segment despite its size (duration & speed), ambiance (topic & emotion), and moderator (gender, age, demographic region). This paper has investigated various existing spoken language identification models implemented using different deep learning approaches, datasets, and performance measures utilized for their analysis. It also highlights the main features and challenges faced by these models. A comprehensive comparative study of deep learning techniques has been carried out for spoken language identification. Moreover, this review analyzes the efficiency of the spoken language models that can help the researchers to propose new language identification models for speech signals.

引用

页码：32593 / 32624

页数：31

共 50 条

[1] A review into deep learning techniques for spoken language identification
Thukroo, Irshad Ahmad
Bashir, Rumaan
Giri, Kaiser J.
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (22) : 32593 - 32624
[2] Deep learning for spoken language identification: Can we visualize speech signal patterns?
Mukherjee, Himadri
Ghosh, Subhankar
Sen, Shibaprasad
Obaidullah, Sk Md
Santosh, K. C.
Phadikar, Santanu
Roy, Kaushik
NEURAL COMPUTING & APPLICATIONS, 2019, 31 (12) : 8483 - 8501
[3] Deep learning for spoken language identification: Can we visualize speech signal patterns?
Himadri Mukherjee
Subhankar Ghosh
Shibaprasad Sen
Obaidullah Sk Md
K. C. Santosh
Santanu Phadikar
Kaushik Roy
Neural Computing and Applications, 2019, 31 : 8483 - 8501
[4] FuzzyGCP: A deep learning architecture for automatic spoken language identification from speech signals
Garain, Avishek
Singh, Pawan Kumar
Sarkar, Ram
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 168
[5] Performance Evaluation of Deep Bottleneck Features for Spoken Language Identification
Jiang, Bing
Song, Yan
Wei, Si
Wang, Meng-Ge
McLoughlin, Ian
Dai, Li-Rong
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 143 - +
[6] Spoken Language Identification Using Prosody, Phonotactics, and Acoustics: A Review
Thukroo, Irshad Ahmad
Bashir, Rumaan
Giri, Kaiser J.
JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2022, 21 (04)
[7] Spoken Language Identification with Deep Convolutional Neural Network and Data Augmentation
Korkut, Can
Haznedaroglu, Ali
Arslan, Levent M.
2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
[8] A Systematic Review of Recent Machine Learning Techniques for Plant Disease Identification and Classification
Goel, Lavika
Nagpal, Jyoti
IETE TECHNICAL REVIEW, 2023, 40 (03) : 423 - 439
[9] Spoken language understanding software for language learning
Alam, Hassan
Kumar, Aman
Rahman, Fuad
Hartono, Rachmat
Tarnikova, Yuliya
INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL II, 2007, : 107 - +
[10] Speech Emotion Recognition Using Deep Learning Techniques: A Review
Khalil, Ruhul Amin
Jones, Edward
Babar, Mohammad Inayatullah
Jan, Tariqullah
Zafar, Mohammad Haseeb
Alhussain, Thamer
IEEE ACCESS, 2019, 7 : 117327 - 117345

← 1 2 3 4 5 →