Testing acoustic voice quality classification across languages and speech styles

被引:1
|
作者
Braun, Bettina [1 ]
Dehe, Nicole [1 ]
Einfeldt, Marieke [1 ]
Wochner, Daniela [1 ]
Zahner-Ritter, Katharina [2 ]
机构
[1] Univ Konstanz, Dept Linguist, Constance, Germany
[2] Univ Trier, Dept 2, Phonet, Trier, Germany
来源
INTERSPEECH 2021 | 2021年
关键词
voice quality; phonation type; acoustic measures; random forest; cross-linguistic generalization; infant-directed speech; German; Chinese; Icelandic; INFANT-DIRECTED SPEECH; PERCEPTION; EMOTION; BREATHY; FEMALE;
D O I
10.21437/Interspeech.2021-315
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Many studies relate acoustic voice quality measures to perceptual classification. We extend this line of research by training a classifier on a balanced set of perceptually annotated voice quality categories with high inter-rater agreement, and test it on speech samples from a different language and on a different speech style. Annotations were done on continuous speech from different laboratory settings. In Experiment 1, we trained a random forest with Standard Chinese and German recordings labelled as modal, breathy, or glottalized. The model had an accuracy of 78.7% on unseen data from the same sample (most important variables were harmonics-to-noise ratio, cepstral-peak prominence, and H1-A2). This model was then used to classify data from a different language (Icelandic, Experiment 2) and to classify a different speech style (German infant-directed speech (IDS), Experiment 3). Cross-linguistic generalizability was high for Icelandic (78.6% accuracy), but lower for German IDS (71.7% accuracy). Accuracy of recordings of adult-directed speech from the same speakers as in Experiment 3 (77%, Experiment 4) suggests that it is the special speech style of IDS, rather than the recording setting that led to lower performance. Results are discussed in terms of efficiency of coding and generalizability across languages and speech styles.
引用
收藏
页码:3920 / 3924
页数:5
相关论文
共 50 条
  • [31] Automatic classification of phonation types in spontaneous speech: towards a new workflow for the characterization of speakers' voice quality
    Chanclu, Anais
    Ben Amor, Imen
    Gendrot, Cedric
    Ferragne, Emmanuel
    Bonastre, Jean-Francois
    INTERSPEECH 2021, 2021, : 1015 - 1018
  • [32] Is this my voice or yours? The role of emotion and acoustic quality in self-other voice discrimination in schizophrenia
    Pinheiro, Ana P.
    Rezaii, Neguine
    Rauber, Andreia
    Niznikiewicz, Margaret
    COGNITIVE NEUROPSYCHIATRY, 2016, 21 (04) : 335 - 353
  • [33] Voice Quality Evaluation in Patients With COVID-19: An Acoustic Analysis
    Asiaee, Maral
    Vahedian-azimi, Amir
    Atashi, Seyed Shahab
    Keramatfar, Abdalsamad
    Nourbakhsh, Mandana
    JOURNAL OF VOICE, 2022, 36 (06) : 879.e13 - 879.e19
  • [34] Accuracy of Acoustic Voice Quality Index and Its Isolated Acoustic Measures to Discriminate the Severity of Voice Disorders
    Englert, Marina
    Lopes, Leonardo
    Vieira, Vinicius
    Behlau, Mara
    JOURNAL OF VOICE, 2022, 36 (04) : 582.e1 - 582.e10
  • [35] Acoustic measures of dysphonic severity across and within voice types
    Wolfe, V
    Fitch, J
    Martin, D
    FOLIA PHONIATRICA ET LOGOPAEDICA, 1997, 49 (06) : 292 - 299
  • [36] Anatomical and functional correlates of voice quality in tracheoesophageal speech
    van As-Brooks, CJ
    Hilgers, FJM
    Koopmans-van Beinum, FJ
    Pols, LCW
    JOURNAL OF VOICE, 2005, 19 (03) : 360 - 372
  • [37] The Role of Voice Quality in the Perception of Prominence in Synthetic Speech
    Murphy, Andy
    Yanushevskaya, Irena
    Chasaide, Ailbhe Ni
    Gobl, Christer
    INTERSPEECH 2019, 2019, : 2543 - 2547
  • [38] Voice quality and gender: some insights on correlations between perceptual and acoustic dimensions
    Camargo, Zuleica
    Madureira, Sandra
    Pessoa, Aline Neves
    Rusilo, Luiz Carlos
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SPEECH PROSODY, VOLS I AND II, 2012, : 115 - 118
  • [39] Computing scores of voice quality and speech intelligibility in tracheoesophageal speech for speech stimuli of varying lengths
    Clapham, Renee P.
    Martens, Jean-Pierre
    van Son, Rob J. J. H.
    Hilgers, Frans J. M.
    van den Brekel, Michiel M. W.
    Middag, Catherine
    COMPUTER SPEECH AND LANGUAGE, 2016, 37 : 1 - 10
  • [40] Clinical Utility and Validation of the Acoustic Voice Quality and Acoustic Breathiness Indexes for Voice Disorder Assessment in English Speakers
    Castillo-Allendes, Adrian
    Codino, Juliana
    Cantor-Cutiva, Lady Catherine
    Nudelman, Charles J.
    Rubin, Adam D.
    Barsties V. Latoszek, Ben
    Hunter, Eric J.
    JOURNAL OF CLINICAL MEDICINE, 2023, 12 (24)