Exploring the feasibility of smart phone microphone for measurement of acoustic voice parameters and voice pathology screening

被引:58
作者
Uloza, Virgilijus [1 ]
Padervinskis, Evaldas [1 ]
Vegiene, Aurelija [1 ]
Pribuisiene, Ruta [1 ]
Saferis, Viktoras [2 ]
Vaiciukynas, Evaldas [3 ]
Gelzinis, Adas [3 ]
Verikas, Antanas [3 ,4 ]
机构
[1] Lithuanian Univ Hlth Sci, Dept Otolaryngol, LT-50009 Kaunas, Lithuania
[2] Lithuanian Univ Hlth Sci, Dept Phys Math & Biophys, LT-50009 Kaunas, Lithuania
[3] Kaunas Univ Technol, Dept Elect Power Syst, Kaunas, Lithuania
[4] Halmstad Univ, Dept Intelligent Syst, Halmstad, Sweden
关键词
Acoustic analysis; Voice screening; Smart phone; CLASSIFICATION; SPEECH; QUESTIONNAIRE; RELIABILITY; PREVALENCE; EFFICACY; PROGRAM;
D O I
10.1007/s00405-015-3708-4
中图分类号
R76 [耳鼻咽喉科学];
学科分类号
100213 ;
摘要
The objective of this study is to evaluate the reliability of acoustic voice parameters obtained using smart phone (SP) microphones and investigate the utility of use of SP voice recordings for voice screening. Voice samples of sustained vowel/a/obtained from 118 subjects (34 normal and 84 pathological voices) were recorded simultaneously through two microphones: oral AKG Perception 220 microphone and SP Samsung Galaxy Note3 microphone. Acoustic voice signal data were measured for fundamental frequency, jitter and shimmer, normalized noise energy (NNE), signal to noise ratio and harmonic to noise ratio using Dr. Speech software. Discriminant analysis-based Correct Classification Rate (CCR) and Random Forest Classifier (RFC) based Equal Error Rate (EER) were used to evaluate the feasibility of acoustic voice parameters classifying normal and pathological voice classes. Lithuanian version of Glottal Function Index (LT_GFI) questionnaire was utilized for self-assessment of the severity of voice disorder. The correlations of acoustic voice parameters obtained with two types of microphones were statistically significant and strong (r = 0.73-1.0) for the entire measurements. When classifying into normal/pathological voice classes, the Oral-NNE revealed the CCR of 73.7 % and the pair of SP-NNE and SP-shimmer parameters revealed CCR of 79.5 %. However, fusion of the results obtained from SP voice recordings and GFI data provided the CCR of 84.60 % and RFC revealed the EER of 7.9 %, respectively. In conclusion, measurements of acoustic voice parameters using SP microphone were shown to be reliable in clinical settings demonstrating high CCR and low EER when distinguishing normal and pathological voice classes, and validated the suitability of the SP microphone signal for the task of automatic voice analysis and screening.
引用
收藏
页码:3391 / 3399
页数:9
相关论文
共 37 条
[1]  
[Anonymous], 2013, ARXIV13042865
[2]   Validity and reliability of the Glottal Function Index [J].
Bach, KK ;
Belafsky, PC ;
Wasylik, K ;
Postma, GN ;
Koufman, JA .
ARCHIVES OF OTOLARYNGOLOGY-HEAD & NECK SURGERY, 2005, 131 (11) :961-964
[3]   Reliability of OperaVOX against Multidimensional Voice Program (MDVP) [J].
Baki, Mat M. ;
Wood, G. ;
Alston, M. ;
Ratcliffe, P. ;
Sandhu, G. ;
Rubin, J. S. ;
Birchall, M. A. .
CLINICAL OTOLARYNGOLOGY, 2015, 40 (01) :22-28
[4]   The Prevalence of Voice Problems Among Adults in the United States [J].
Bhattacharyya, Neil .
LARYNGOSCOPE, 2014, 124 (10) :2359-2362
[5]   STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT [J].
BLAND, JM ;
ALTMAN, DG .
LANCET, 1986, 1 (8476) :307-310
[6]   Measuring Quality of Life in Dysphonic Patients: A Systematic Review of Content Development in Patient-Reported Outcomes Measures [J].
Branski, Ryan C. ;
Cukier-Blaj, Sabrina ;
Pusic, Andrea ;
Cano, Stefan J. ;
Klassen, Anne ;
Mener, David ;
Patel, Snehal ;
Kraus, Dennis H. .
JOURNAL OF VOICE, 2010, 24 (02) :193-198
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]  
Cohen SM, 2014, AM J MED, V128, P11
[9]   A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques - Guideline elaborated by the Committee on Phoniatrics of the European Laryngological Society (ELS) [J].
Dejonckere, PH ;
Bradley, P ;
Clemente, P ;
Cornut, G ;
Crevier-Buchman, L ;
Friedrich, G ;
Van de Heyning, P ;
Remacle, M ;
Woisard, V .
EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2001, 258 (02) :77-82
[10]   Classification of dysphonic voice: Acoustic and auditory-perceptual measures [J].
Eadie, TL ;
Doyle, TC .
JOURNAL OF VOICE, 2005, 19 (01) :1-14