Automatic Gender Authentication from Arabic Speech Using Hybrid Learning

被引：0

作者：

Khan, Amjad Rehman ^{[1
]}

机构：

[1] Prince Sultan Univ, Coll Comp & Informat Sci, Artificial Intelligence & Data Analyt Lab AIDA, Riyadh 11586, Saudi Arabia

来源：

JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY | 2024年 / 15卷 / 04期

关键词：

speech recognition; Arabic language; gender classification; hybrid learning; technological development; RECOGNITION; VOICE; CLASSIFICATION;

D O I：

10.12720/jait.15.4.532-543

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Speech recognition is progressively being utilized in practical applications with time. Automatic gender identification is one of the most intriguing applications since it distinguishes female and male speeches from briefly spoken communication records. This is advantageous in various applications, including automated conversation systems, system verification, demographic attribute prediction and assessing speaker's expressions. Speech is a natural mode of communication, and pitch variation of a gender-specific speech signal is often used to identify a person as male or female. This paper presents a model to identify gender from Arabic speech by integrating audio preprocessing, Mel-Frequency Cepstral Coefficients (MFCC), Delta MFCC, and Log Filter bank feature extraction. Pre-processing involves testing pre-emphasis, framing, windowing, and Fast Fourier Transform. Finally, features are extracted using three feature extraction methods from the processed audios. Feed Forward Neural Networks and Keras-based Neural Networks are employed as classifier models. Regarding accuracy and simplicity, the proposed hybrid method surpasses most previous approaches discussed in the literature for gender categorization from Arabic speech. The proposed model achieved an average classification accuracy of 93.09%.

引用

页码：532 / 543

页数：12

共 36 条

[1] A framework of human action recognition using length control features fusion and weighted entropy-variances based feature selection
Afza, Farhat
Khan, Muhammad Attique
Sharif, Muhammad
Kadry, Seifedine
Manogaran, Gunasekaran
Saba, Tanzila
Ashraf, Imran
Damasevicius, Robertas
[J]. IMAGE AND VISION COMPUTING, 2021, 106
[2] Albaraq M. O. A., 2020, International Journal of Advanced Research in Computer Science, V11
[3] Harris Hawks Sparse Auto-Encoder Networks for Automatic Speech Recognition System
Ali, Mohammed Hasan
Jaber, Mustafa Musa
Abd, Sura Khalil
Rehman, Amjad
Awan, Mazhar Javed
Vitkute-Adzgauskiene, Daiva
Damasevicius, Robertas
Bahaj, Saeed Ali
[J]. APPLIED SCIENCES-BASEL, 2022, 12 (03):
[4] Unsupervised learning blocking keys technique for indexing Arabic entity resolution
Alian, Marwah
Awajan, Arafat
Ramadan, Bandan
[J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) : 621 - 628
[5] DGR: Gender Recognition of Human Speech Using One-Dimensional Conventional Neural Network
Alkhawaldeh, Rami S.
[J]. SCIENTIFIC PROGRAMMING, 2019, 2019
[6] Alrajhi Khwlah, 2019, International Journal of Computing and Digital Systems, V8, P307, DOI 10.12785/ijcds/080310
[7] Investigating the effects of gender, dialect, and training size on the performance of Arabic speech recognition
Alsharhan, Eiman
Ramsay, Allan
[J]. LANGUAGE RESOURCES AND EVALUATION, 2020, 54 (04) : 975 - 998
[8] Comparative Study of Fingerprint-Based Gender Identification
Berriche, Lamia
[J]. SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
[9] Gender identification for Egyptian Arabic dialect in twitter using deep learning models
ElSayed, Shereen
Farouk, Mona
[J]. EGYPTIAN INFORMATICS JOURNAL, 2020, 21 (03) : 159 - 167
[10] An effective gender recognition approach using voice data via deeper LSTM networks
Ertam, Fatih
[J]. APPLIED ACOUSTICS, 2019, 156 : 351 - 358

← 1 2 3 4 →