A robust pathological voices recognition system based on DCNN and scattering transform

被引:12
|
作者
Souli, Sameh [1 ,2 ]
Amami, Rimah [3 ]
Ben Yahia, Sadok [1 ,4 ]
机构
[1] Univ Tunis El Manar, Fac Sci Tunis, Tunis 2092, Tunisia
[2] Private Int Polytech Sch Tunis, Polytech Innovat Lab PI LAB, Tunis, Tunisia
[3] Imam AbdulRahman Bin Faisal Univ, Comp Sci Dept, Deanship Preparatory Year & Supporting Studies, Dammam, Saudi Arabia
[4] Tallinn Univ Technol, Dept Software Sci, Tallinn, Estonia
关键词
DCNN; Scattering transform; Pathology recognition; Deep Learning; NEURAL-NETWORKS; FEATURES;
D O I
10.1016/j.apacoust.2020.107854
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The Deep Neural Networks (DNNs) have recently shown a high performance applied to speech classification tasks. In this paper, we argue that the improved accuracy generated by the Deep Convolutional Neural Network (DCNN) classifier is the result of their ability to extract discriminative representations. They are efficient to the different sources of variability in speech signals. We propose, in this study, a new algorithm, called ST-DCNN in order to classify normal and pathological voices. We demonstrate the improvement of recognizing voices theory with advances in speech features in order to improve the identification pathological voices. The proposed approach operates in two steps: First, we extract scatter wavelet features. Then, we introduce the DCNN for voices classification. The performance of the proposed system is evaluated based on silent and noisy environments using various Signal-to-Noise Ratio (SNR) levels. The results underscore that our proposed system shows better performance using scattering wavelet and DCNN in a silent environment with 99.62% of recognition rate. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] On the use of Deep Learning and Scattering Transform for Pathological voices recognition
    Souli, S.
    Amami, R.
    Soltani, A.
    Ben Yahia, S.
    2022 8TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT'22), 2022, : 1055 - 1058
  • [2] Robust Face Recognition Based on DCNN and CRC
    Yuan, Li-Na
    Cen, Feng
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND INFORMATION SYSTEMS, 2016, 52 : 117 - 122
  • [3] Recognition of pathological voices
    Salma, Chekili
    Asma, Belhaj
    Aicha, Bouzid
    Noureddine, Ellouze
    2014 11TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2014,
  • [4] Recognition of sound vibration by DCNN based on φ-OTDR system
    Chen, Cong
    Li, Jiamin
    Qin, Zujun
    Xiong, Xianming
    Zhang, Wentao
    AOPC 2021: MICRO-OPTICS AND MOEMS, 2021, 12066
  • [5] A DCNN-Based Fast NIR Face Recognition System Robust to Reflected Light From Eyeglasses
    Kim, Jeyeon
    Ra, Moonsoo
    Kim, Whoi-Yul
    IEEE ACCESS, 2020, 8 : 80948 - 80963
  • [6] Robust movement human actions recognition using DCNN
    Xu, Honghua
    Li, Li
    Fang, Ming
    Zhang, Fengrong
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2018, 124 : 108 - 109
  • [7] Robust Pol-ISAR Target Recognition Based on ST-MC-DCNN
    Bai, Xueru
    Zhou, Xuening
    Zhang, Feng
    Wang, Li
    Xue, Ruihang
    Zhou, Feng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (12): : 9912 - 9927
  • [8] Applying the wavelet transform for the analysis of normal and pathological voices
    Jimenez, Carlos
    Diaz, Jose A.
    Del Pino, Paulino
    Rothman, Howard
    INGENIERIA UC, 2008, 15 (01): : 7 - 13
  • [9] Recognition of Pathological Voices Based on Fractal Theory Using Gaussian Mixture Model
    Gao, Junfen
    Yu, Yanping
    Hu, Weiping
    2010 INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONIC AND COMPUTER SCIENCE, VOLS 1-3, 2010, : 1668 - +
  • [10] Stockwell Transform based Pace Recognition: A Robust and an Accurate Approach
    Shekar, B. H.
    Rajesh, D. S.
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 168 - 174