Bivariate Empirical Mode Decomposition of Speech Signals for Disordered Voices Assessment

被引:0
作者
Boubekiria, Kawther [1 ]
Kacha, Abdellah [1 ]
机构
[1] Univ Mohammed Seddik Benyahia Jijel, Lab Radiat Phys & Applicat, BP 98 Ouled Aissa, Jijel 18000, Jijel, Algeria
关键词
Disordered voices assessment; Vocal dysperiodicities; Bivariate empirical mode decomposition; Intrinsic mode functions; Multivariate analysis; TO-NOISE RATIO; ACOUSTIC ANALYSIS;
D O I
10.1007/s00034-025-03028-6
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The acoustic analysis of the speech signal offers a privileged means for clinical assessment of the quality of the voice with a view to diagnosis and quantitative documentation of pathologies of the vocal folds. The major challenge of the acoustic analysis of the speech signal is to find reliable and accurate acoustic cues in order to determine the characteristics of the voice which provide information on the state of the speaker's larynx. We propose bivariate empirical mode decomposition (BEMD) for multivariate analysis of vocal dysperiodicities for disordered voices assessment. The BEMD is an extension of the conventional empirical mode decomposition devoted to the decomposition of bivariate signals. The BEMD is applied jointly in the time and frequency domains. The acoustic cues computed from the complex intrinsic mode functions (IMFs) in the time and frequency domains are used as predictor variables of the scores of the perceived hoarseness. The proposed method is tested on datasets comprising Spanish sustained vowels /a/, English sustained vowels /a/ and English sentences produced by healthy and pathological speakers. The results indicate that the proposed approach outperforms reference methods which are multi-band generalized variogram-based vocal dysperiodicies analysis method, higher-order statistics-based method and correlation-based method in terms of correlation of the acoustic cue with the scores of perceived hoarseness. The achieved improvements exceed 29% for Spanish sustained vowels, 2.6% for English sustained vowels and 21% for English sentences.
引用
收藏
页码:4423 / 4454
页数:32
相关论文
共 64 条
[1]   A Survey of Voice Pathology Surveillance Systems Based on Internet of Things and Machine Learning Algorithms [J].
Al-Dhief, Fahad Taha ;
Latiff, Nurul Mu'azzah Abdul ;
Abd Malik, Nik Noordini Nik ;
Salim, Naseer Sabri ;
Baki, Marina Mat ;
Albadr, Musatafa Abbas Abbood ;
Mohammed, Mazin Abed .
IEEE ACCESS, 2020, 8 :64514-64533
[2]   Investigation of Voice Pathology Detection and Classification on Different Frequency Regions Using Correlation Functions [J].
Al-nasheri, Ahmed ;
Muhammad, Ghulam ;
Alsulaiman, Mansour ;
Ali, Zulfiqar .
JOURNAL OF VOICE, 2017, 31 (01) :3-15
[3]   An Automatic Health Monitoring System for Patients Suffering From Voice Complications in Smart Cities [J].
Ali, Zulfiqar ;
Muhammad, Ghulam ;
Alhamid, Mohammed F. .
IEEE ACCESS, 2017, 5 :3900-3908
[4]   Multi-band dysperiodicity analyses of disordered connected speech [J].
Alpan, A. ;
Maryn, Y. ;
Kacha, A. ;
Grenez, F. ;
Schoentgen, J. .
SPEECH COMMUNICATION, 2011, 53 (01) :131-141
[5]   A clinical comparison between two acoustic analysis softwares: MDVP and Praat [J].
Amir, Ofer ;
Wolf, Michael ;
Amir, Noam .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2009, 4 (03) :202-205
[6]  
Bermudez de Alvear R. M., 2015, 11 PAN EUR VOIC C PE
[7]  
Bin Altaf MU, 2007, INT CONF ACOUST SPEE, P1009
[8]  
Boersma P., 2023, Praat: doing phonetics by computer
[9]  
Boubekiria K., 2019, P 4 INT C EMB SYST T
[10]   Voice Disorder Detection via an m-Health System: Design and Results of a Clinical Study to Evaluate Vox4Health [J].
Cesari, Ugo ;
De Pietro, Giuseppe ;
Marciano, Elio ;
Niri, Ciro ;
Sannino, Giovanna ;
Verde, Laura .
BIOMED RESEARCH INTERNATIONAL, 2018, 2018