Leveraging the temporal dynamics of anticipatory vowel-to-vowel coarticulation in linguistic prediction: A statistical modeling approach

被引:2
作者
Flego, Stefon [1 ]
Forrest, Jon [2 ]
机构
[1] Indiana Univ, Dept Linguist, Bloomington, IN 47405 USA
[2] Univ Georgia, Dept Linguist, Athens, GA 30602 USA
关键词
Anticipatory coarticulation; Linguistic prediction; Spectral change; Formant dynamics; Curve fitting; Bayesian modeling; TIME-COURSE; SPEECH-PERCEPTION; AMERICAN ENGLISH; VERTICAL-BAR; ACOUSTIC ANALYSIS; SPOKEN-LANGUAGE; TRACKING; SPEAKERS; IDENTIFICATION; ARTICULATION;
D O I
10.1016/j.wocn.2021.101093
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Previous research has shown that coarticulatory information in the signal orients listeners in spoken word recognition, and that articulatory and perceptual dynamics closely parallel one another. The current study uses statistical classification to test the power of time-varying anticipatory coarticulatory information present in the acoustic signal for predicting upcoming sounds in the speech stream. Bayesian mixed-effects multinomial logistic regression models were trained on several different representations of spectral variation present in V-1 in order to predict the identity of V-2 in naturally coarticulated transconsonantal V-1...V-2 sequences. Models trained on simple measures of spectral variation (e.g. formant measures taken at V-1 midpoint) were compared with models trained on more sophisticated time-varying representations (e.g. the estimated coefficients of polynomial curves fit to whole formant trajectories of V-1). Accuracy in predicting V-2 was greater when models were trained on dynamic representations of spectral variation in V-1, and those trained on quadratic and cubic polynomial representations achieved the greatest accuracy, with more than 15 percentage points in correct classification over using midpoint formant frequencies alone. The results demonstrate that spectral representations with high temporal resolution capture more disambiguating anticipatory information available in the signal than representations with lower temporal resolution. (C) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:19
相关论文
共 88 条