A focus module-based lightweight end-to-end CNN framework for voiceprint recognition

被引:9
作者
Velayuthapandian, Karthikeyan [1 ]
Subramoniam, Suja Priyadharsini [2 ]
机构
[1] Mepco Schlenk Engn Coll, Dept Elect & Commun Engn, Sivakasi, Tamil Nadu, India
[2] Anna Univ Reg Campus, Dept Elect & Commun Engn, Tirunelveli, Tamil Nadu, India
关键词
Speaker recognition; Deep neural network; Spectrogram; 1-D CNN; Focus module; SUPPORT VECTOR MACHINES; SPEAKER; SYSTEM;
D O I
10.1007/s11760-023-02500-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The process of identifying a spokesperson from a collection of subsequent time series data is referred to as speaker identification. Convolutional neural networks (CNNs) and deep neural networks are the two types of neural networks that are used in the majority of modern experimental approaches. This work presents a CNN model for speaker identification using a jump-connected one-dimensional convolutional neural network (1-D CNN) with a focus module (FM). The 1-D convolutional layer integrated with FM is employed in the presented model for speaker characteristic extraction and lessens heterogeneity in the temporal and spatial domains, allowing for quicker layer processing. Furthermore, the layered CNN hopping interconnection is employed to overcome the connectivity glitches, and a solution based on softmax loss and smooth L1-norm combined regulation is presented to increase efficiency. The recommended network model was evaluated using the ELSDSR, TIMIT, NIST, 16,000 PCM, and experimental audio datasets. According to experimental data, the equal error rate (EER) of end-to-end CNN for voiceprint identification is 9.02% higher than baseline approaches. In experiments, our proposed speaker recognition (SR) model, which we refer to as the deep FM-1D CNN, had a high recognition accuracy of 99.21%. Moreover, the observations demonstrate that the proposed network model is more robust than other models.
引用
收藏
页码:2817 / 2825
页数:9
相关论文
共 50 条
  • [31] End-to-End Calcification Distribution Pattern Recognition for Mammograms: An Interpretable Approach with GNN
    Yao, Melissa Min-Szu
    Du, Hao
    Hartman, Mikael
    Chan, Wing P.
    Feng, Mengling
    DIAGNOSTICS, 2022, 12 (06)
  • [32] ResSKNet-SSDP: Effective and Light End-To-End Architecture for Speaker Recognition
    Deng, Fei
    Deng, Lihong
    Jiang, Peifan
    Zhang, Gexiang
    Yang, Qiang
    SENSORS, 2023, 23 (03)
  • [33] An End-to-End Network for Continuous Human Motion Recognition via Radar Radios
    Zhao, Running
    Ma, Xiaolin
    Liu, Xinhua
    Liu, Jian
    IEEE SENSORS JOURNAL, 2021, 21 (05) : 6487 - 6496
  • [34] An end-to-end framework for real-time automatic sleep stage classification
    Patanaik, Amiya
    Ong, Ju Lynn
    Gooley, Joshua J.
    Ancoli-Israel, Sonia
    Chee, Michael W. L.
    SLEEP, 2018, 41 (05)
  • [35] A Multiscale Autoencoder (MSAE) Framework for End-to-End Neural Network Speech Enhancement
    Borgstrom, Bengt J.
    Brandstein, Michael S.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 2418 - 2431
  • [36] Multi-objective optimization based multi-task learning for end-to-end license plates recognition
    Zhou X.-J.
    Gao Y.
    Li C.-J.
    Yang C.-H.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2021, 38 (05): : 676 - 688
  • [37] END-TO-END NEURAL NETWORK BASED AUTOMATED SPEECH SCORING
    Chen, Lei
    Tao, Jidong
    Ghaffarzadegan, Shabnam
    Qian, Yao
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6234 - 6238
  • [38] Energy aware Clustered blockchain data for IoT: An end-to-end lightweight secure & Enroute filtering approach
    Ramamoorthi, S.
    Kumar, B. Muthu
    Appathurai, Ahilan
    COMPUTER COMMUNICATIONS, 2023, 202 : 166 - 182
  • [39] PLDPNet: End-to-end hybrid deep learning framework for potato leaf disease prediction
    Arshad, Fizzah
    Mateen, Muhammad
    Hayat, Shaukat
    Wardah, Maryam
    Al-Huda, Zaid
    Gu, Yeong Hyeon
    Al-antari, Mugahed A.
    ALEXANDRIA ENGINEERING JOURNAL, 2023, 78 : 406 - 418
  • [40] A Hand Gesture-Operated System for Rehabilitation Using an End-to-End Detection Framework
    Dutta H.P.J.
    Bhuyan M.K.
    Neog D.R.
    Macdorman K.F.
    Laskar R.H.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (02): : 698 - 708