Testing acoustic voice quality classification across languages and speech styles

被引:1
|
作者
Braun, Bettina [1 ]
Dehe, Nicole [1 ]
Einfeldt, Marieke [1 ]
Wochner, Daniela [1 ]
Zahner-Ritter, Katharina [2 ]
机构
[1] Univ Konstanz, Dept Linguist, Constance, Germany
[2] Univ Trier, Dept 2, Phonet, Trier, Germany
来源
INTERSPEECH 2021 | 2021年
关键词
voice quality; phonation type; acoustic measures; random forest; cross-linguistic generalization; infant-directed speech; German; Chinese; Icelandic; INFANT-DIRECTED SPEECH; PERCEPTION; EMOTION; BREATHY; FEMALE;
D O I
10.21437/Interspeech.2021-315
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Many studies relate acoustic voice quality measures to perceptual classification. We extend this line of research by training a classifier on a balanced set of perceptually annotated voice quality categories with high inter-rater agreement, and test it on speech samples from a different language and on a different speech style. Annotations were done on continuous speech from different laboratory settings. In Experiment 1, we trained a random forest with Standard Chinese and German recordings labelled as modal, breathy, or glottalized. The model had an accuracy of 78.7% on unseen data from the same sample (most important variables were harmonics-to-noise ratio, cepstral-peak prominence, and H1-A2). This model was then used to classify data from a different language (Icelandic, Experiment 2) and to classify a different speech style (German infant-directed speech (IDS), Experiment 3). Cross-linguistic generalizability was high for Icelandic (78.6% accuracy), but lower for German IDS (71.7% accuracy). Accuracy of recordings of adult-directed speech from the same speakers as in Experiment 3 (77%, Experiment 4) suggests that it is the special speech style of IDS, rather than the recording setting that led to lower performance. Results are discussed in terms of efficiency of coding and generalizability across languages and speech styles.
引用
收藏
页码:3920 / 3924
页数:5
相关论文
共 50 条
  • [1] Analysis and classification of phonation types in speech and singing voice
    Kadiri, Sudarsana Reddy
    Alku, Paavo
    Yegnanarayana, B.
    SPEECH COMMUNICATION, 2020, 118 : 33 - 47
  • [2] Analysis of acoustic and voice quality features for the classification of infant and mother vocalizations
    Li, Jialu
    Hasegawa-Johnson, Mark
    McElwain, Nancy L.
    SPEECH COMMUNICATION, 2021, 133 : 41 - 61
  • [3] Bilingual acoustic voice variation is similarly structured across languages
    Johnson, Khia A.
    Babel, Molly
    Fuhrman, Robert A.
    INTERSPEECH 2020, 2020, : 2387 - 2391
  • [4] "Softened" Voice Quality in Poetry Reading An Acoustic Study
    Gafni, Chen
    Tsur, Reuven
    STYLE, 2017, 51 (04) : 456 - 481
  • [5] Acoustic voice variation in spontaneous speech
    Lee, Yoonjeong
    Kreiman, Jody
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 151 (05): : 3462 - 3472
  • [6] VOICE QUALITY AND SPEAKING STYLES
    Madureira, Sandra
    de Souza Fontes, Mario Augusto
    Fonseca, Beatriz Coelho
    DIALECTOLOGIA, 2016, : 171 - 190
  • [7] SPEAKER AND LANGUAGE INDEPENDENT VOICE QUALITY CLASSIFICATION APPLIED TO UNLABELLED CORPORA OF EXPRESSIVE SPEECH
    Kane, John
    Scherer, Stefan
    Aylett, Matthew
    Morency, Louis-Philippe
    Gobl, Christer
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7982 - 7986
  • [8] The Role of Voice Quality in Mandarin Sarcastic Speech: An Acoustic and Electroglottographic Study
    Li, Shanpeng
    Gu, Wentao
    Liu, Lei
    Tang, Ping
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2020, 63 (08): : 2578 - 2588
  • [9] Voice quality in telephone speech: Comparing acoustic measures between VoIP telephone and high-quality recordings
    Xu, Chenzi
    Wormald, Jessica
    Foulkes, Paul
    Harrison, Philip
    Hughes, Vincent
    Welch, Poppy
    Kelly, Finnian
    van de Vloed, David
    INTERSPEECH 2024, 2024, : 1570 - 1574
  • [10] Mel-frequency Cepstral Coefficients of Voice Source Waveforms for Classification of Phonation Types in Speech
    Kadiri, Sudarsana Reddy
    Alku, Paavo
    INTERSPEECH 2019, 2019, : 2508 - 2512