A robust pathological voices recognition system based on DCNN and scattering transform

被引：12

作者：

Souli, Sameh ^{[1
,2
]}

Amami, Rimah ^{[3
]}

Ben Yahia, Sadok ^{[1
,4
]}

机构：

[1] Univ Tunis El Manar, Fac Sci Tunis, Tunis 2092, Tunisia

[2] Private Int Polytech Sch Tunis, Polytech Innovat Lab PI LAB, Tunis, Tunisia

[3] Imam AbdulRahman Bin Faisal Univ, Comp Sci Dept, Deanship Preparatory Year & Supporting Studies, Dammam, Saudi Arabia

[4] Tallinn Univ Technol, Dept Software Sci, Tallinn, Estonia

来源：

APPLIED ACOUSTICS | 2021年 / 177卷

关键词：

DCNN; Scattering transform; Pathology recognition; Deep Learning; NEURAL-NETWORKS; FEATURES;

D O I：

10.1016/j.apacoust.2020.107854

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The Deep Neural Networks (DNNs) have recently shown a high performance applied to speech classification tasks. In this paper, we argue that the improved accuracy generated by the Deep Convolutional Neural Network (DCNN) classifier is the result of their ability to extract discriminative representations. They are efficient to the different sources of variability in speech signals. We propose, in this study, a new algorithm, called ST-DCNN in order to classify normal and pathological voices. We demonstrate the improvement of recognizing voices theory with advances in speech features in order to improve the identification pathological voices. The proposed approach operates in two steps: First, we extract scatter wavelet features. Then, we introduce the DCNN for voices classification. The performance of the proposed system is evaluated based on silent and noisy environments using various Signal-to-Noise Ratio (SNR) levels. The results underscore that our proposed system shows better performance using scattering wavelet and DCNN in a silent environment with 99.62% of recognition rate. (C) 2020 Elsevier Ltd. All rights reserved.

引用

页数：7

共 50 条

[1] On the use of Deep Learning and Scattering Transform for Pathological voices recognition
Souli, S.
Amami, R.
Soltani, A.
Ben Yahia, S.
2022 8TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT'22), 2022, : 1055 - 1058
[2] Robust Face Recognition Based on DCNN and CRC
Yuan, Li-Na
Cen, Feng
PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND INFORMATION SYSTEMS, 2016, 52 : 117 - 122
[3] Recognition of pathological voices
Salma, Chekili
Asma, Belhaj
Aicha, Bouzid
Noureddine, Ellouze
2014 11TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2014,
[4] Recognition of sound vibration by DCNN based on φ-OTDR system
Chen, Cong
Li, Jiamin
Qin, Zujun
Xiong, Xianming
Zhang, Wentao
AOPC 2021: MICRO-OPTICS AND MOEMS, 2021, 12066
[5] A DCNN-Based Fast NIR Face Recognition System Robust to Reflected Light From Eyeglasses
Kim, Jeyeon
Ra, Moonsoo
Kim, Whoi-Yul
IEEE ACCESS, 2020, 8 : 80948 - 80963
[6] Robust movement human actions recognition using DCNN
Xu, Honghua
Li, Li
Fang, Ming
Zhang, Fengrong
BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2018, 124 : 108 - 109
[7] Robust Pol-ISAR Target Recognition Based on ST-MC-DCNN
Bai, Xueru
Zhou, Xuening
Zhang, Feng
Wang, Li
Xue, Ruihang
Zhou, Feng
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (12): : 9912 - 9927
[8] Applying the wavelet transform for the analysis of normal and pathological voices
Jimenez, Carlos
Diaz, Jose A.
Del Pino, Paulino
Rothman, Howard
INGENIERIA UC, 2008, 15 (01): : 7 - 13
[9] Recognition of Pathological Voices Based on Fractal Theory Using Gaussian Mixture Model
Gao, Junfen
Yu, Yanping
Hu, Weiping
2010 INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONIC AND COMPUTER SCIENCE, VOLS 1-3, 2010, : 1668 - +
[10] Stockwell Transform based Pace Recognition: A Robust and an Accurate Approach
Shekar, B. H.
Rajesh, D. S.
2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 168 - 174

← 1 2 3 4 5 →