A robust pathological voices recognition system based on DCNN and scattering transform

被引:12
|
作者
Souli, Sameh [1 ,2 ]
Amami, Rimah [3 ]
Ben Yahia, Sadok [1 ,4 ]
机构
[1] Univ Tunis El Manar, Fac Sci Tunis, Tunis 2092, Tunisia
[2] Private Int Polytech Sch Tunis, Polytech Innovat Lab PI LAB, Tunis, Tunisia
[3] Imam AbdulRahman Bin Faisal Univ, Comp Sci Dept, Deanship Preparatory Year & Supporting Studies, Dammam, Saudi Arabia
[4] Tallinn Univ Technol, Dept Software Sci, Tallinn, Estonia
关键词
DCNN; Scattering transform; Pathology recognition; Deep Learning; NEURAL-NETWORKS; FEATURES;
D O I
10.1016/j.apacoust.2020.107854
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The Deep Neural Networks (DNNs) have recently shown a high performance applied to speech classification tasks. In this paper, we argue that the improved accuracy generated by the Deep Convolutional Neural Network (DCNN) classifier is the result of their ability to extract discriminative representations. They are efficient to the different sources of variability in speech signals. We propose, in this study, a new algorithm, called ST-DCNN in order to classify normal and pathological voices. We demonstrate the improvement of recognizing voices theory with advances in speech features in order to improve the identification pathological voices. The proposed approach operates in two steps: First, we extract scatter wavelet features. Then, we introduce the DCNN for voices classification. The performance of the proposed system is evaluated based on silent and noisy environments using various Signal-to-Noise Ratio (SNR) levels. The results underscore that our proposed system shows better performance using scattering wavelet and DCNN in a silent environment with 99.62% of recognition rate. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] Iris Based Recognition System Using Wavelet Transform
    Narote, Sandipan P.
    Narotte, Abhilasha S.
    Waghmare, Laxman M.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9 (11): : 101 - 104
  • [22] Speaker Recognition Based on 3DCNN-LSTM
    Hu, ZhangFang
    Si, XingTong
    Luo, Yuan
    Tang, ShanShan
    Jian, Fang
    ENGINEERING LETTERS, 2021, 29 (02) : 463 - 470
  • [23] Attributed Scattering Center Guided Adversarial Attack for DCNN SAR Target Recognition
    Zhou, Junfan
    Feng, Sijia
    Sun, Hao
    Zhang, Linbin
    Kuang, Gangyao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [24] Attributed Scattering Center Guided Adversarial Attack for DCNN SAR Target Recognition
    Zhou, Junfan
    Feng, Sijia
    Sun, Hao
    Zhang, Linbin
    Kuang, Gangyao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [25] Eisoc with ifodpso and dcnn classifier for diabetic retinopathy recognition system
    Thomas, Neetha Merin
    Jerome, S. Albert
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) : 42561 - 42583
  • [26] Eisoc with ifodpso and dcnn classifier for diabetic retinopathy recognition system
    Neetha Merin Thomas
    S. Albert Jerome
    Multimedia Tools and Applications, 2024, 83 : 42561 - 42583
  • [27] A robust 2D-Cochlear transform-based palmprint recognition
    Chaudhary, Gopal
    Srivastava, Smriti
    SOFT COMPUTING, 2020, 24 (03) : 2311 - 2328
  • [28] A robust 2D-Cochlear transform-based palmprint recognition
    Gopal Chaudhary
    Smriti Srivastava
    Soft Computing, 2020, 24 : 2311 - 2328
  • [29] Robust face recognition based on illumination invariant in nonsubsampled contourlet transform domain
    Cheng, Yong
    Hou, Yingkun
    Zhao, Chunxia
    Li, Zuoyong
    Hu, Yong
    Wang, Cailing
    NEUROCOMPUTING, 2010, 73 (10-12) : 2217 - 2224
  • [30] Robust Iris Recognition Based on Statistical Properties of Walsh Hadamard Transform Domain
    Dhavale, Sunita V.
    International Journal of Computer Science Issues, 2012, 9 (1 1-2): : 118 - 123