A focus module-based lightweight end-to-end CNN framework for voiceprint recognition

被引：9

作者：

Velayuthapandian, Karthikeyan ^{[1
]}

Subramoniam, Suja Priyadharsini ^{[2
]}

机构：

[1] Mepco Schlenk Engn Coll, Dept Elect & Commun Engn, Sivakasi, Tamil Nadu, India

[2] Anna Univ Reg Campus, Dept Elect & Commun Engn, Tirunelveli, Tamil Nadu, India

来源：

SIGNAL IMAGE AND VIDEO PROCESSING | 2023年 / 17卷 / 06期

关键词：

Speaker recognition; Deep neural network; Spectrogram; 1-D CNN; Focus module; SUPPORT VECTOR MACHINES; SPEAKER; SYSTEM;

D O I：

10.1007/s11760-023-02500-7

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The process of identifying a spokesperson from a collection of subsequent time series data is referred to as speaker identification. Convolutional neural networks (CNNs) and deep neural networks are the two types of neural networks that are used in the majority of modern experimental approaches. This work presents a CNN model for speaker identification using a jump-connected one-dimensional convolutional neural network (1-D CNN) with a focus module (FM). The 1-D convolutional layer integrated with FM is employed in the presented model for speaker characteristic extraction and lessens heterogeneity in the temporal and spatial domains, allowing for quicker layer processing. Furthermore, the layered CNN hopping interconnection is employed to overcome the connectivity glitches, and a solution based on softmax loss and smooth L1-norm combined regulation is presented to increase efficiency. The recommended network model was evaluated using the ELSDSR, TIMIT, NIST, 16,000 PCM, and experimental audio datasets. According to experimental data, the equal error rate (EER) of end-to-end CNN for voiceprint identification is 9.02% higher than baseline approaches. In experiments, our proposed speaker recognition (SR) model, which we refer to as the deep FM-1D CNN, had a high recognition accuracy of 99.21%. Moreover, the observations demonstrate that the proposed network model is more robust than other models.

引用

页码：2817 / 2825

页数：9

共 50 条

[1] A focus module-based lightweight end-to-end CNN framework for voiceprint recognition
Karthikeyan Velayuthapandian
Suja Priyadharsini Subramoniam
Signal, Image and Video Processing, 2023, 17 : 2817 - 2825
[2] CNN-Based End-To-End Language Identification
Wang, Yutian
Zhou, Huan
Wang, Zheng
Wang, Jingling
Wang, Hui
PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 2475 - 2479
[3] Use AF-CNN for End-to-End Fiber Vibration Signal Recognition
Ruan, Saisai
Mo, Jiaqing
Xu, Liang
Zhou, Gang
Liu, Yajun
Zhang, Xin
IEEE ACCESS, 2021, 9 : 6713 - 6720
[4] Exploring end-to-end framework towards Khasi speech recognition system
Bronson Syiem
L. Joyprakash Singh
International Journal of Speech Technology, 2021, 24 : 419 - 424
[5] Exploring end-to-end framework towards Khasi speech recognition system
Syiem, Bronson
Singh, L. Joyprakash
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (02) : 419 - 424
[6] Cascaded Cross-Module Residual Learning towards Lightweight End-to-End Speech Coding
Zhen, Kai
Sung, Jongmo
Lee, Mi Suk
Beack, Seungkwon
Kim, Minje
INTERSPEECH 2019, 2019, : 3396 - 3400
[7] An End-to-end Speech Recognition Algorithm based on Attention Mechanism
Chen, Jia-nan
Gao, Shuang
Sun, Han-zhe
Liu, Xiao-hui
Wang, Zi-ning
Zheng, Yan
PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 2935 - 2940
[8] Tunisian Dialectal End-to-end Speech Recognition based on DeepSpeech
Messaoudi, Abir
Haddad, Hatem
Fourati, Chayma
Hmida, Moez BenHaj
Mabrouk, Aymen Ben Elhaj
Graiet, Mohamed
AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 183 - 190
[9] End-to-End Speech Recognition of Tamil Language
Changrampadi, Mohamed Hashim
Shahina, A.
Narayanan, M. Badri
Khan, A. Nayeemulla
INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 32 (02) : 1309 - 1323
[10] End-to-End Speech Recognition For Arabic Dialects
Nasr, Seham
Duwairi, Rehab
Quwaider, Muhannad
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (08) : 10617 - 10633

← 1 2 3 4 5 →