Testing acoustic voice quality classification across languages and speech styles

被引:1
|
作者
Braun, Bettina [1 ]
Dehe, Nicole [1 ]
Einfeldt, Marieke [1 ]
Wochner, Daniela [1 ]
Zahner-Ritter, Katharina [2 ]
机构
[1] Univ Konstanz, Dept Linguist, Constance, Germany
[2] Univ Trier, Dept 2, Phonet, Trier, Germany
来源
INTERSPEECH 2021 | 2021年
关键词
voice quality; phonation type; acoustic measures; random forest; cross-linguistic generalization; infant-directed speech; German; Chinese; Icelandic; INFANT-DIRECTED SPEECH; PERCEPTION; EMOTION; BREATHY; FEMALE;
D O I
10.21437/Interspeech.2021-315
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Many studies relate acoustic voice quality measures to perceptual classification. We extend this line of research by training a classifier on a balanced set of perceptually annotated voice quality categories with high inter-rater agreement, and test it on speech samples from a different language and on a different speech style. Annotations were done on continuous speech from different laboratory settings. In Experiment 1, we trained a random forest with Standard Chinese and German recordings labelled as modal, breathy, or glottalized. The model had an accuracy of 78.7% on unseen data from the same sample (most important variables were harmonics-to-noise ratio, cepstral-peak prominence, and H1-A2). This model was then used to classify data from a different language (Icelandic, Experiment 2) and to classify a different speech style (German infant-directed speech (IDS), Experiment 3). Cross-linguistic generalizability was high for Icelandic (78.6% accuracy), but lower for German IDS (71.7% accuracy). Accuracy of recordings of adult-directed speech from the same speakers as in Experiment 3 (77%, Experiment 4) suggests that it is the special speech style of IDS, rather than the recording setting that led to lower performance. Results are discussed in terms of efficiency of coding and generalizability across languages and speech styles.
引用
收藏
页码:3920 / 3924
页数:5
相关论文
共 50 条
  • [41] Artificial intelligence in acoustic signals for the determination of voice quality
    Schlegel, Patrick
    SPRACHE-STIMME-GEHOR, 2023, 47 (03): : 139 - 144
  • [42] The reliability and sensitivity to change of acoustic measures of voice quality
    Carding, PN
    Steen, IN
    Webb, A
    Mackenzie, K
    Deary, IJ
    Wilson, JA
    CLINICAL OTOLARYNGOLOGY, 2004, 29 (05) : 538 - 544
  • [43] Mapping across feature spaces in forensic voice comparison: the contribution of auditory-based voice quality to (semi-)automatic system testing
    Hughes, Vincent
    Harrison, Philip
    Foulkes, Paul
    French, Peter
    Kavanagh, Colleen
    San Segundo, Eugenia
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3892 - 3896
  • [44] Convolutional Neural Networks for Classification of Voice Qualities from Speech and Neck Surface Accelerometer Signals
    Kadiri, Sudarsana Reddy
    Javanmardi, Farhad
    Alku, Paavo
    INTERSPEECH 2022, 2022, : 5253 - 5257
  • [45] The Performance of the Acoustic Voice Quality Index and Acoustic Breathiness Index in Synthesized Voices
    von Latoszek, Ben Barsties
    Englert, Marina
    Lucero, Jorge C.
    Behlau, Mara
    JOURNAL OF VOICE, 2023, 37 (05) : 804.e21 - 804.e28
  • [46] Acoustic Parameters in the Evaluation of Voice Quality of Choral Singers. Prototype of Mobile Application for Voice Quality Evaluation
    Szklanny, Krzysztof
    ARCHIVES OF ACOUSTICS, 2019, 44 (03) : 439 - 446
  • [47] Integrating Voice Quality Cues in the Pitch Perception of Speech and Non-speech Utterances
    Kuang, Jianjing
    Liberman, Mark
    FRONTIERS IN PSYCHOLOGY, 2018, 09
  • [48] The Expression and Recognition of Emotions in the Voice Across Five Nations: A Lens Model Analysis Based on Acoustic Features
    Laukka, Petri
    Elfenbein, Hillary Anger
    Thingujam, Nutankumar S.
    Rockstuhl, Thomas
    Iraki, Frederick K.
    Chui, Wanda
    Althoff, Jean
    JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 2016, 111 (05) : 686 - 705
  • [49] Validation of the Acoustic Voice Quality Index Version 03.01 and Acoustic Breathiness Index in German
    Latoszek, Ben Barsties, V
    Lehnert, Bernhard
    Janotte, Ben
    JOURNAL OF VOICE, 2020, 34 (01) : 157.e17 - 157.e25
  • [50] The Relationship Between Acoustic and Perceived Intraspeaker Variability in Voice Quality
    Kreiman, Jody
    Park, Soo Jin
    Keating, Patricia A.
    Alwan, Abeer
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2357 - 2360