Automatic Gender Authentication from Arabic Speech Using Hybrid Learning

被引:0
作者
Khan, Amjad Rehman [1 ]
机构
[1] Prince Sultan Univ, Coll Comp & Informat Sci, Artificial Intelligence & Data Analyt Lab AIDA, Riyadh 11586, Saudi Arabia
关键词
speech recognition; Arabic language; gender classification; hybrid learning; technological development; RECOGNITION; VOICE; CLASSIFICATION;
D O I
10.12720/jait.15.4.532-543
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech recognition is progressively being utilized in practical applications with time. Automatic gender identification is one of the most intriguing applications since it distinguishes female and male speeches from briefly spoken communication records. This is advantageous in various applications, including automated conversation systems, system verification, demographic attribute prediction and assessing speaker's expressions. Speech is a natural mode of communication, and pitch variation of a gender-specific speech signal is often used to identify a person as male or female. This paper presents a model to identify gender from Arabic speech by integrating audio preprocessing, Mel-Frequency Cepstral Coefficients (MFCC), Delta MFCC, and Log Filter bank feature extraction. Pre-processing involves testing pre-emphasis, framing, windowing, and Fast Fourier Transform. Finally, features are extracted using three feature extraction methods from the processed audios. Feed Forward Neural Networks and Keras-based Neural Networks are employed as classifier models. Regarding accuracy and simplicity, the proposed hybrid method surpasses most previous approaches discussed in the literature for gender categorization from Arabic speech. The proposed model achieved an average classification accuracy of 93.09%.
引用
收藏
页码:532 / 543
页数:12
相关论文
共 36 条
  • [1] A framework of human action recognition using length control features fusion and weighted entropy-variances based feature selection
    Afza, Farhat
    Khan, Muhammad Attique
    Sharif, Muhammad
    Kadry, Seifedine
    Manogaran, Gunasekaran
    Saba, Tanzila
    Ashraf, Imran
    Damasevicius, Robertas
    [J]. IMAGE AND VISION COMPUTING, 2021, 106
  • [2] Albaraq M. O. A., 2020, International Journal of Advanced Research in Computer Science, V11
  • [3] Harris Hawks Sparse Auto-Encoder Networks for Automatic Speech Recognition System
    Ali, Mohammed Hasan
    Jaber, Mustafa Musa
    Abd, Sura Khalil
    Rehman, Amjad
    Awan, Mazhar Javed
    Vitkute-Adzgauskiene, Daiva
    Damasevicius, Robertas
    Bahaj, Saeed Ali
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [4] Unsupervised learning blocking keys technique for indexing Arabic entity resolution
    Alian, Marwah
    Awajan, Arafat
    Ramadan, Bandan
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) : 621 - 628
  • [5] DGR: Gender Recognition of Human Speech Using One-Dimensional Conventional Neural Network
    Alkhawaldeh, Rami S.
    [J]. SCIENTIFIC PROGRAMMING, 2019, 2019
  • [6] Alrajhi Khwlah, 2019, International Journal of Computing and Digital Systems, V8, P307, DOI 10.12785/ijcds/080310
  • [7] Investigating the effects of gender, dialect, and training size on the performance of Arabic speech recognition
    Alsharhan, Eiman
    Ramsay, Allan
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2020, 54 (04) : 975 - 998
  • [8] Comparative Study of Fingerprint-Based Gender Identification
    Berriche, Lamia
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [9] Gender identification for Egyptian Arabic dialect in twitter using deep learning models
    ElSayed, Shereen
    Farouk, Mona
    [J]. EGYPTIAN INFORMATICS JOURNAL, 2020, 21 (03) : 159 - 167
  • [10] An effective gender recognition approach using voice data via deeper LSTM networks
    Ertam, Fatih
    [J]. APPLIED ACOUSTICS, 2019, 156 : 351 - 358