A robust pathological voices recognition system based on DCNN and scattering transform

被引:12
|
作者
Souli, Sameh [1 ,2 ]
Amami, Rimah [3 ]
Ben Yahia, Sadok [1 ,4 ]
机构
[1] Univ Tunis El Manar, Fac Sci Tunis, Tunis 2092, Tunisia
[2] Private Int Polytech Sch Tunis, Polytech Innovat Lab PI LAB, Tunis, Tunisia
[3] Imam AbdulRahman Bin Faisal Univ, Comp Sci Dept, Deanship Preparatory Year & Supporting Studies, Dammam, Saudi Arabia
[4] Tallinn Univ Technol, Dept Software Sci, Tallinn, Estonia
关键词
DCNN; Scattering transform; Pathology recognition; Deep Learning; NEURAL-NETWORKS; FEATURES;
D O I
10.1016/j.apacoust.2020.107854
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The Deep Neural Networks (DNNs) have recently shown a high performance applied to speech classification tasks. In this paper, we argue that the improved accuracy generated by the Deep Convolutional Neural Network (DCNN) classifier is the result of their ability to extract discriminative representations. They are efficient to the different sources of variability in speech signals. We propose, in this study, a new algorithm, called ST-DCNN in order to classify normal and pathological voices. We demonstrate the improvement of recognizing voices theory with advances in speech features in order to improve the identification pathological voices. The proposed approach operates in two steps: First, we extract scatter wavelet features. Then, we introduce the DCNN for voices classification. The performance of the proposed system is evaluated based on silent and noisy environments using various Signal-to-Noise Ratio (SNR) levels. The results underscore that our proposed system shows better performance using scattering wavelet and DCNN in a silent environment with 99.62% of recognition rate. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] SVM-based identification of pathological voices
    Chen, Wenxi
    Peng, Ce
    Zhu, Xin
    Wan, Baikun
    Wei, Daming
    2007 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-16, 2007, : 3786 - 3789
  • [32] Detection of Pathological Voices Using Discrete Wavelet Transform and Artificial Neural Networks
    Shia, S. Emerald
    Jayasree, T.
    2017 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT TECHNIQUES IN CONTROL, OPTIMIZATION AND SIGNAL PROCESSING (INCOS), 2017,
  • [33] Robust Face Recognition System in Video using Hybrid Scale Invariant Feature Transform
    Mohanraj, V
    Vimalkumar, M.
    Mithila, M.
    Vaidehi, V.
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATIONS, 2016, 93 : 503 - 512
  • [34] A Robust Vein Pattern-based Recognition System
    Soni, Mohit
    Gupta, Phalguni
    JOURNAL OF COMPUTERS, 2012, 7 (11) : 2711 - 2718
  • [35] An Iris Recognition Based Robust Intrusion Detection System
    Joshi, Kavita
    Agrawal, Sunil
    2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [36] PC-based system for robust speaker recognition
    Central Laboratory of Biomedical Engineering, Bulgarian Academy of Sciences, Acad. G. Bonchev Str., Block 105, Sofia
    1113, Bulgaria
    J. Compt. Inf. Technol., 4 (415-423):
  • [37] A robust video based license plate recognition system
    Bremananth, R
    Chitra, A
    2005 International Conference on Intelligent Sensing and Information Processing, Proceedings, 2005, : 175 - 180
  • [38] DCNN and DNN Based Multi-modal Depression Recognition
    Yang, Le
    Jiang, Dongmei
    Han, Wenjing
    Sahli, Hichem
    2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2017, : 484 - 489
  • [39] HRRP image recognition of midcourse ballistic targets based on DCNN
    Xiang Q.
    Wang X.
    Li R.
    Lai J.
    Zhang G.
    1600, Chinese Institute of Electronics (42): : 2426 - 2433
  • [40] Acoustic analysis of pathological voices compressed with MPEG system
    Gonzalez, J
    Cervera, T
    Llau, MJ
    JOURNAL OF VOICE, 2003, 17 (02) : 126 - 139