A robust pathological voices recognition system based on DCNN and scattering transform

被引：12

作者：

Souli, Sameh ^{[1
,2
]}

Amami, Rimah ^{[3
]}

Ben Yahia, Sadok ^{[1
,4
]}

机构：

[1] Univ Tunis El Manar, Fac Sci Tunis, Tunis 2092, Tunisia

[2] Private Int Polytech Sch Tunis, Polytech Innovat Lab PI LAB, Tunis, Tunisia

[3] Imam AbdulRahman Bin Faisal Univ, Comp Sci Dept, Deanship Preparatory Year & Supporting Studies, Dammam, Saudi Arabia

[4] Tallinn Univ Technol, Dept Software Sci, Tallinn, Estonia

来源：

APPLIED ACOUSTICS | 2021年 / 177卷

关键词：

DCNN; Scattering transform; Pathology recognition; Deep Learning; NEURAL-NETWORKS; FEATURES;

D O I：

10.1016/j.apacoust.2020.107854

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The Deep Neural Networks (DNNs) have recently shown a high performance applied to speech classification tasks. In this paper, we argue that the improved accuracy generated by the Deep Convolutional Neural Network (DCNN) classifier is the result of their ability to extract discriminative representations. They are efficient to the different sources of variability in speech signals. We propose, in this study, a new algorithm, called ST-DCNN in order to classify normal and pathological voices. We demonstrate the improvement of recognizing voices theory with advances in speech features in order to improve the identification pathological voices. The proposed approach operates in two steps: First, we extract scatter wavelet features. Then, we introduce the DCNN for voices classification. The performance of the proposed system is evaluated based on silent and noisy environments using various Signal-to-Noise Ratio (SNR) levels. The results underscore that our proposed system shows better performance using scattering wavelet and DCNN in a silent environment with 99.62% of recognition rate. (C) 2020 Elsevier Ltd. All rights reserved.

引用

页数：7

共 50 条

[31] SVM-based identification of pathological voices
Chen, Wenxi
Peng, Ce
Zhu, Xin
Wan, Baikun
Wei, Daming
2007 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-16, 2007, : 3786 - 3789
[32] Detection of Pathological Voices Using Discrete Wavelet Transform and Artificial Neural Networks
Shia, S. Emerald
Jayasree, T.
2017 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT TECHNIQUES IN CONTROL, OPTIMIZATION AND SIGNAL PROCESSING (INCOS), 2017,
[33] Robust Face Recognition System in Video using Hybrid Scale Invariant Feature Transform
Mohanraj, V
Vimalkumar, M.
Mithila, M.
Vaidehi, V.
PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATIONS, 2016, 93 : 503 - 512
[34] A Robust Vein Pattern-based Recognition System
Soni, Mohit
Gupta, Phalguni
JOURNAL OF COMPUTERS, 2012, 7 (11) : 2711 - 2718
[35] An Iris Recognition Based Robust Intrusion Detection System
Joshi, Kavita
Agrawal, Sunil
2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
[36] PC-based system for robust speaker recognition
Central Laboratory of Biomedical Engineering, Bulgarian Academy of Sciences, Acad. G. Bonchev Str., Block 105, Sofia
1113, Bulgaria
J. Compt. Inf. Technol., 4 (415-423):
[37] A robust video based license plate recognition system
Bremananth, R
Chitra, A
2005 International Conference on Intelligent Sensing and Information Processing, Proceedings, 2005, : 175 - 180
[38] DCNN and DNN Based Multi-modal Depression Recognition
Yang, Le
Jiang, Dongmei
Han, Wenjing
Sahli, Hichem
2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2017, : 484 - 489
[39] HRRP image recognition of midcourse ballistic targets based on DCNN
Xiang Q.
Wang X.
Li R.
Lai J.
Zhang G.
1600, Chinese Institute of Electronics (42): : 2426 - 2433
[40] Acoustic analysis of pathological voices compressed with MPEG system
Gonzalez, J
Cervera, T
Llau, MJ
JOURNAL OF VOICE, 2003, 17 (02) : 126 - 139

← 1 2 3 4 5 →