Toward Improved Ecological Validity in the Acoustic Measurement of Overall Voice Quality: Combining Continuous Speech and Sustained Vowels

被引:276
作者
Maryn, Youri [1 ,2 ,3 ]
Corthals, Paul [2 ,3 ]
Van Cauwenberge, Paul [3 ]
Roy, Nelson [4 ]
De Bodt, Marc [3 ,5 ]
机构
[1] Sint Jan Gen Hosp, Dept Otorhinolaryngol Head & Neck Surg Speech Lan, B-8000 Brugge, Belgium
[2] Univ Coll Ghent, Fac Hlth Care Vesalius, Ghent, Belgium
[3] Univ Ghent, Dept Otorhinolaryngol & Head & Neck Surg & Speech, Fac Med & Hlth Sci, B-9000 Ghent, Belgium
[4] Univ Utah, Dept Commun Sci & Disorders, Salt Lake City, UT USA
[5] Univ Hosp, Dept Otorhinolaryngol & Head & Neck Surg & Commun, Antwerp, Belgium
关键词
Overall voice quality; Multivariate acoustic measurement; Perceptual rating; Cepstral measure; Amplitude perturbation; Spectral measure; Harmonics-to-noise ratio; Sustained vowel; Continuous speech; CEPSTRAL PEAK PROMINENCE; PERCEPTUAL EVALUATION; VOCAL QUALITY; DYSPHONIA SEVERITY; GRBAS SCALE; RELIABILITY; DISCRIMINATION; EXPERIENCE; FREQUENCY; RATINGS;
D O I
10.1016/j.jvoice.2008.12.014
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
To improve ecological validity, perceptual and instrumental assessment of disordered voice, including overall voice quality, should ideally sample both sustained vowels and continuous speech. This investigation assessed the utility of combining both voice contexts for the purpose of auditory-perceptual ratings as well as acoustic measurement of overall voice quality. Sustained vowel and continuous speech samples from 251 subjects with (n = 229) or without (n = 22) various voice disorders were concatenated and perceptually rated on overall voice quality by five experienced voice clinicians. After removing the nonvoiced segments within the continuous speech samples, the concatenated samples were analyzed using 13 acoustic measures based on fundamental frequency perturbation, amplitude perturbation, spectral and cepstral analyses. Stepwise multiple regression analysis yielded a six-variable acoustic model for the multiparametric measurement of overall voice quality of the concatenated samples (with a cepstral measure as the main contributor to the prediction of overall voice quality). The correlation of this model with mean ratings of overall voice quality resulted in r(s) = 0.78. A cross-validation approach involving the iterated internal cross-correlations with 30 subgroups of 100, 50, and 10 samples confirmed a comparable degree of association. Furthermore, the ability of the model to distinguish voice-disordered from vocally normal participants was assessed using estimates of diagnostic precision including receiver operating characteristic (ROC) curve analysis, sensitivity, and specificity, as well as likelihood ratios (LRs), which adjust for base-rate differences between the groups. Depending on the cutoff criteria employed, the analyses revealed an impressive area under ROC = 0.895 as well as respectable sensitivity, specificity, and LR. The results support the diagnostic utility of combining voice samples from both continuous speech and sustained vowels in acoustic and perceptual analysis of disordered voice. The findings are discussed in relation to the extant literature and the need for further refinement of the acoustic algorithm.
引用
收藏
页码:540 / 555
页数:16
相关论文
共 68 条
[1]  
*AKG AC, 2000, MICROMIC SER, V2
[2]   SPEECH WAVE-FORM PERTURBATION ANALYSIS - A PERCEPTUAL ACOUSTICAL COMPARISON OF 7 MEASURES [J].
ASKENFELT, AG ;
HAMMARBERG, B .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1986, 29 (01) :50-64
[3]   Toward the development of an objective index of dysphonia severity: A four-factor acoustic model [J].
Awan, SN ;
Roy, N .
CLINICAL LINGUISTICS & PHONETICS, 2006, 20 (01) :35-49
[4]   Reliability in perceptual analysis of voice quality [J].
Bele, IV .
JOURNAL OF VOICE, 2005, 19 (04) :555-573
[5]   Perceptual evaluation of voice quality and its correlation with acoustic measurements [J].
Bhuta, T ;
Patrick, L ;
Garnett, JD .
JOURNAL OF VOICE, 2004, 18 (03) :299-304
[6]  
Boersma P., 2006, Praat: doing phonetics by computer
[7]  
Boersma P., 2013, Praat: doing phonetics by computer, DOI DOI 10.1097/AUD.0B013E31821473F7
[8]  
Buder E.H., 2000, VOICE QUALITY MEASUR, P119
[9]   The effect of anchors and training on the reliability of perceptual voice evaluation [J].
Chan, KMK ;
Yiu, EML .
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2002, 45 (01) :111-126
[10]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46