A robust pathological voices recognition system based on DCNN and scattering transform

被引：12

作者：

Souli, Sameh ^{[1
,2
]}

Amami, Rimah ^{[3
]}

Ben Yahia, Sadok ^{[1
,4
]}

机构：

[1] Univ Tunis El Manar, Fac Sci Tunis, Tunis 2092, Tunisia

[2] Private Int Polytech Sch Tunis, Polytech Innovat Lab PI LAB, Tunis, Tunisia

[3] Imam AbdulRahman Bin Faisal Univ, Comp Sci Dept, Deanship Preparatory Year & Supporting Studies, Dammam, Saudi Arabia

[4] Tallinn Univ Technol, Dept Software Sci, Tallinn, Estonia

来源：

APPLIED ACOUSTICS | 2021年 / 177卷

关键词：

DCNN; Scattering transform; Pathology recognition; Deep Learning; NEURAL-NETWORKS; FEATURES;

D O I：

10.1016/j.apacoust.2020.107854

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The Deep Neural Networks (DNNs) have recently shown a high performance applied to speech classification tasks. In this paper, we argue that the improved accuracy generated by the Deep Convolutional Neural Network (DCNN) classifier is the result of their ability to extract discriminative representations. They are efficient to the different sources of variability in speech signals. We propose, in this study, a new algorithm, called ST-DCNN in order to classify normal and pathological voices. We demonstrate the improvement of recognizing voices theory with advances in speech features in order to improve the identification pathological voices. The proposed approach operates in two steps: First, we extract scatter wavelet features. Then, we introduce the DCNN for voices classification. The performance of the proposed system is evaluated based on silent and noisy environments using various Signal-to-Noise Ratio (SNR) levels. The results underscore that our proposed system shows better performance using scattering wavelet and DCNN in a silent environment with 99.62% of recognition rate. (C) 2020 Elsevier Ltd. All rights reserved.

引用

页数：7

共 50 条

[21] Iris Based Recognition System Using Wavelet Transform
Narote, Sandipan P.
Narotte, Abhilasha S.
Waghmare, Laxman M.
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9 (11): : 101 - 104
[22] Speaker Recognition Based on 3DCNN-LSTM
Hu, ZhangFang
Si, XingTong
Luo, Yuan
Tang, ShanShan
Jian, Fang
ENGINEERING LETTERS, 2021, 29 (02) : 463 - 470
[23] Attributed Scattering Center Guided Adversarial Attack for DCNN SAR Target Recognition
Zhou, Junfan
Feng, Sijia
Sun, Hao
Zhang, Linbin
Kuang, Gangyao
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[24] Attributed Scattering Center Guided Adversarial Attack for DCNN SAR Target Recognition
Zhou, Junfan
Feng, Sijia
Sun, Hao
Zhang, Linbin
Kuang, Gangyao
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[25] Eisoc with ifodpso and dcnn classifier for diabetic retinopathy recognition system
Thomas, Neetha Merin
Jerome, S. Albert
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) : 42561 - 42583
[26] Eisoc with ifodpso and dcnn classifier for diabetic retinopathy recognition system
Neetha Merin Thomas
S. Albert Jerome
Multimedia Tools and Applications, 2024, 83 : 42561 - 42583
[27] A robust 2D-Cochlear transform-based palmprint recognition
Chaudhary, Gopal
Srivastava, Smriti
SOFT COMPUTING, 2020, 24 (03) : 2311 - 2328
[28] A robust 2D-Cochlear transform-based palmprint recognition
Gopal Chaudhary
Smriti Srivastava
Soft Computing, 2020, 24 : 2311 - 2328
[29] Robust face recognition based on illumination invariant in nonsubsampled contourlet transform domain
Cheng, Yong
Hou, Yingkun
Zhao, Chunxia
Li, Zuoyong
Hu, Yong
Wang, Cailing
NEUROCOMPUTING, 2010, 73 (10-12) : 2217 - 2224
[30] Robust Iris Recognition Based on Statistical Properties of Walsh Hadamard Transform Domain
Dhavale, Sunita V.
International Journal of Computer Science Issues, 2012, 9 (1 1-2): : 118 - 123

← 1 2 3 4 5 →