Voice pathology detection by using the deep network architecture

被引:7
作者
Ankishan, Haydar [1 ]
Inam, Sitki Cagdas [2 ]
机构
[1] Baskent Univ, Vocat Sch Tech Sci, Ankara, Turkey
[2] Baskent Univ, Elect & Elect Engn, Ankara, Turkey
关键词
Voice disorders; Hybrid feature vector; Voice pathology detection; Deep network architecture; DISORDERS; SIGNAL;
D O I
10.1016/j.asoc.2021.107310
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pathological voice disorders are among the conditions affecting negatively our daily life. The aim of this study is to introduce the new feature vector in the hybrid axis and multi-model in order to diagnose these disorders with more conventional methods. Two different databases are used, and the results are compared with the previous studies. Here, two types of fusion models (feature and decision level fusion) are used to increase the classification accuracy of the multi-model. The experimental results show that the proposed multi-model gives the highest classification accuracies with decision level fusion (DLF). Inspecting the results obtained from two databases, the highest accuracy rate (99.58%) is obtained with DLF. It is also seen from the experiments that the proposed feature vector helps to classify pathological data successfully, depending on their pathological conditions. Together with the proposed multi-model, both LSTM and CNN are found to be similarly successful in the classification of data in multi-model architecture. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 54 条
[1]   Employing linear prediction residual signal of wavelet sub-bands in automatic detection of laryngeal pathology [J].
Akbari, Ali ;
Arjmandi, Meisam Khalil .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2015, 18 :293-302
[2]  
Akkiraju Nataraj, 1995, P 1 INT COMPUTATIONA
[3]  
Al-Nasheri, 2017, IEEE ACCESS
[4]   Classification of acoustic signals with new feature: Fibonacci space (FSp) [J].
Ankishan, H. .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2019, 48 :221-233
[5]  
Ankishan H, 2017, INT C ENG TECHN 2017
[6]   Blood pressure prediction from speech recordings [J].
Ankishan, Haydar .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 58
[7]   A hybrid measure for the discrimination of the acoustic signals: Feature matrix (FMx) [J].
Ankishan, Haydar ;
Inam, Sitki Cagdas .
APPLIED ACOUSTICS, 2019, 152 :88-100
[8]   A New Approach for Detection of Pathological Voice Disorders with Reduced Parameters [J].
Ankishan, Haydar .
ELECTRICA, 2018, 18 (01) :60-71
[9]   Estimation of heartbeat rate from speech recording with hybrid feature vector (HFV) [J].
Ankishan, Haydar .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2019, 49 :483-492
[10]  
[Anonymous], 2018, ARXIV180602923