Testing acoustic voice quality classification across languages and speech styles

被引:1
|
作者
Braun, Bettina [1 ]
Dehe, Nicole [1 ]
Einfeldt, Marieke [1 ]
Wochner, Daniela [1 ]
Zahner-Ritter, Katharina [2 ]
机构
[1] Univ Konstanz, Dept Linguist, Constance, Germany
[2] Univ Trier, Dept 2, Phonet, Trier, Germany
来源
INTERSPEECH 2021 | 2021年
关键词
voice quality; phonation type; acoustic measures; random forest; cross-linguistic generalization; infant-directed speech; German; Chinese; Icelandic; INFANT-DIRECTED SPEECH; PERCEPTION; EMOTION; BREATHY; FEMALE;
D O I
10.21437/Interspeech.2021-315
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Many studies relate acoustic voice quality measures to perceptual classification. We extend this line of research by training a classifier on a balanced set of perceptually annotated voice quality categories with high inter-rater agreement, and test it on speech samples from a different language and on a different speech style. Annotations were done on continuous speech from different laboratory settings. In Experiment 1, we trained a random forest with Standard Chinese and German recordings labelled as modal, breathy, or glottalized. The model had an accuracy of 78.7% on unseen data from the same sample (most important variables were harmonics-to-noise ratio, cepstral-peak prominence, and H1-A2). This model was then used to classify data from a different language (Icelandic, Experiment 2) and to classify a different speech style (German infant-directed speech (IDS), Experiment 3). Cross-linguistic generalizability was high for Icelandic (78.6% accuracy), but lower for German IDS (71.7% accuracy). Accuracy of recordings of adult-directed speech from the same speakers as in Experiment 3 (77%, Experiment 4) suggests that it is the special speech style of IDS, rather than the recording setting that led to lower performance. Results are discussed in terms of efficiency of coding and generalizability across languages and speech styles.
引用
收藏
页码:3920 / 3924
页数:5
相关论文
共 50 条
  • [21] Acoustic characteristics of the metallic voice quality
    Xavier Fadel, Congeta Bruniere
    Dassie-Leite, Ana Paula
    Santos, Rosane Sampaio
    Rosa, Marcelo de Oliveira
    Marques, Jair Mendes
    CODAS, 2015, 27 (01): : 97 - 100
  • [22] Beyond Correlation: Acoustic Transformation Methods for the Experimental Study of Emotional Voice and Speech
    Arias, Pablo
    Rachman, Laura
    Liuni, Marco
    Aucouturier, Jean-Julien
    EMOTION REVIEW, 2021, 13 (01) : 12 - 24
  • [23] An acoustic study of ATR in Tima vowels: vowel quality, voice quality and duration
    Tabain, Marija
    Padgett, Jaye
    Schneider-Blum, Gertrud
    Gregory, Adele
    Beare, Richard
    PHONOLOGY, 2024, 41
  • [24] Controlled voice quality modifications : Acoustic, perceptual and ASR analysis
    Nechansky, Tomas
    Houzar, Alzbeta
    Boril, Tomas
    Skarnitzl, Radek
    INTERNATIONAL JOURNAL OF SPEECH LANGUAGE AND THE LAW, 2024, 31 (01) : 49 - 76
  • [25] Acoustic parameters for the evaluation of voice quality in patients with voice disorders
    Li, Gelin
    Hou, Qian
    Zhang, Chi
    Jiang, Zhen
    Gong, Shusheng
    ANNALS OF PALLIATIVE MEDICINE, 2021, 10 (01) : 130 - 136
  • [26] The Impact of Languages and Cultural Backgrounds on Voice Quality Analyses
    Englert, Marina
    Latoszek, Ben Barsties, V
    Behlau, Mara
    FOLIA PHONIATRICA ET LOGOPAEDICA, 2022, 74 (02) : 141 - 152
  • [27] The Sound of Voice: Voice-Based Categorization of Speakers' Sexual Orientation within and across Languages
    Sulpizio, Simone
    Fasoli, Fabio
    Maass, Anne
    Paladino, Maria Paola
    Vespignani, Francesco
    Eyssel, Friederike
    Bentler, Dominik
    PLOS ONE, 2015, 10 (07):
  • [28] Effects of age on speech and voice quality ratings
    Goy, Huiwen
    Pichora-Fuller, M. Kathleen
    van Lieshout, Pascal
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 139 (04): : 1648 - 1659
  • [29] Voice Quality Modelling for Expressive Speech Synthesis
    Monzo, Carlos
    Iriondo, Ignasi
    Socoro, Joan Claudi
    SCIENTIFIC WORLD JOURNAL, 2014,
  • [30] The cepstral spectral index of dysphonia, the acoustic voice quality index and the acoustic breathiness index as novel multiparametric indices for acoustic assessment of voice quality
    Barsties V. Latoszek, Ben
    Mathmann, Philipp
    Neumann, Katrin
    CURRENT OPINION IN OTOLARYNGOLOGY & HEAD AND NECK SURGERY, 2021, 29 (06): : 451 - 457