A robust pathological voices recognition system based on DCNN and scattering transform

被引:12
|
作者
Souli, Sameh [1 ,2 ]
Amami, Rimah [3 ]
Ben Yahia, Sadok [1 ,4 ]
机构
[1] Univ Tunis El Manar, Fac Sci Tunis, Tunis 2092, Tunisia
[2] Private Int Polytech Sch Tunis, Polytech Innovat Lab PI LAB, Tunis, Tunisia
[3] Imam AbdulRahman Bin Faisal Univ, Comp Sci Dept, Deanship Preparatory Year & Supporting Studies, Dammam, Saudi Arabia
[4] Tallinn Univ Technol, Dept Software Sci, Tallinn, Estonia
关键词
DCNN; Scattering transform; Pathology recognition; Deep Learning; NEURAL-NETWORKS; FEATURES;
D O I
10.1016/j.apacoust.2020.107854
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The Deep Neural Networks (DNNs) have recently shown a high performance applied to speech classification tasks. In this paper, we argue that the improved accuracy generated by the Deep Convolutional Neural Network (DCNN) classifier is the result of their ability to extract discriminative representations. They are efficient to the different sources of variability in speech signals. We propose, in this study, a new algorithm, called ST-DCNN in order to classify normal and pathological voices. We demonstrate the improvement of recognizing voices theory with advances in speech features in order to improve the identification pathological voices. The proposed approach operates in two steps: First, we extract scatter wavelet features. Then, we introduce the DCNN for voices classification. The performance of the proposed system is evaluated based on silent and noisy environments using various Signal-to-Noise Ratio (SNR) levels. The results underscore that our proposed system shows better performance using scattering wavelet and DCNN in a silent environment with 99.62% of recognition rate. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Smart Data Driven System for Pathological Voices Classification
    Fernandes, Joana
    Candido Junior, Arnaldo
    Freitas, Diamantino
    Teixeira, Joao Paulo
    OPTIMIZATION, LEARNING ALGORITHMS AND APPLICATIONS, OL2A 2022, 2022, 1754 : 419 - 426
  • [42] Architecture of invariant transform based traffic sign recognition system
    Turan, Jan
    Turan, Jan, Jr.
    Ovsenik, Lubos
    Fifik, Martin
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA 2008, 2008, : 19 - 22
  • [43] An Intelligent System Based on Discrete Cosine Transform for Speech Recognition
    Silva, Washington
    Serra, Ginalber
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2012, 2012, 7637 : 320 - 329
  • [44] Extra Matters Recognition of Transmission System Based on Hough Transform
    Yan, Shujia
    Jin, Lijun
    2011 ASIA-PACIFIC POWER AND ENERGY ENGINEERING CONFERENCE (APPEEC), 2011,
  • [45] Iris recognition based on wavelet neural network transform system
    Wang, Anna
    Chen, Yu
    Zhang, Xinhua
    Wu, Jie
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 115 - +
  • [46] Classification and recognition approaches of tomato main organs based on DCNN
    Zhou Y.
    Xu T.
    Zheng W.
    Deng H.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2017, 33 (15): : 219 - 226
  • [47] Robust face recognition using the modified census transform
    Yun, Woo-han
    Yoon, Ho-Sub
    Kim, Do-Hyung
    Chi, Su-young
    2007 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES, VOLS 1-3, 2007, : 749 - 752
  • [48] Noise-Robust HRRP Target Recognition Based on Residual Scattering Network
    Huang, Pengjun
    Li, Shuai
    Zheng, Muhai
    Xie, Jingyang
    Tian, Biao
    Xu, Shiyou
    2024 9TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, ICSIP, 2024, : 37 - 41
  • [49] Exploring trace transform for robust human action recognition
    Goudelis, Georgios
    Karpouzis, Konstantinos
    Kollias, Stefanos
    PATTERN RECOGNITION, 2013, 46 (12) : 3238 - 3248
  • [50] Intelligent recognition of coal mine microseismic signal based on wavelet scattering decomposition transform
    Fan X.
    Cheng J.
    Wang Y.
    Li S.
    Duan J.
    Wang P.
    Meitan Xuebao/Journal of the China Coal Society, 2022, 47 (07): : 2722 - 2731