Text-based vs. vowel-based automatic evaluation of tracheoesophageal substitute voice

被引:0
|
作者
Haderlein, Tino [1 ,2 ]
Bocklet, Tobias [1 ,2 ]
Noeth, Elmar [2 ]
Rosanowski, Frank [1 ]
机构
[1] Univ Erlangen Nurnberg, Dept Phoniatr & Pedaudiol, Bohlenpl 21, D-91054 Erlangen, Germany
[2] Univ Erlangen Nurnberg, Chair Pattern Recognit Comp Sci 5, D-91058 Erlangen, Germany
来源
PROCEEDINGS OF IWSSIP 2008: 15TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING | 2008年
关键词
substitute voice; automatic speech recognition; Hoarseness Diagram; prosodic features;
D O I
10.1109/IWSSIP.2008.4604425
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Hoarseness Diagram, a program for voice quality analysis using recordings of sustained vowels, was compared to an automatic speech recognition system with a module for prosodic analysis. The latter computed prosodic features on a text recording. We examined whether the voice analysis of sustained vowel and text analysis correlate on a group of 24 male laryngectomees (average age: 60.6 +/- 8.9 years) using tracheoesophageal substitute speech. Each person read the German version of the text "The North Wind and the Sun" which consists of 108 words. Additionally, 5 sustained vowels were recorded from each patient. The correlation between the measures obtained by the Hoarseness Diagram and the prosodic features from the prosody module was determined. Parameters like jitter, shimmer, F0 and irregularity computed by the Hoarseness Diagram on vowel recordings show correlations of about -0.8 to prosodic features obtained from the text recordings. Hence, voice properties can reliably be evaluated both on a vowel and a text recording. The text analysis, however, offers also possibilities for automatic speech evaluation since it represents a real communication situation better.
引用
收藏
页码:295 / +
页数:3
相关论文
共 50 条
  • [1] Automatic Evaluation of Tracheoesophageal Substitute Voice: Sustained Vowel versus Standard Text
    Bocklet, Tobias
    Toy, Hikmet
    Noeth, Elmar
    Schuster, Maria
    Eysholdt, Ulrich
    Rosanowski, Frank
    Gottwald, Frank
    Haderlein, Tino
    FOLIA PHONIATRICA ET LOGOPAEDICA, 2009, 61 (02) : 112 - 116
  • [2] Automatic evaluation of prosodic features of tracheoesophageal substitute voice
    Tino Haderlein
    Elmar Nöth
    Hikmet Toy
    Anton Batliner
    Maria Schuster
    Ulrich Eysholdt
    Joachim Hornegger
    Frank Rosanowski
    European Archives of Oto-Rhino-Laryngology, 2007, 264 : 1315 - 1321
  • [3] Automatic evaluation of prosodic features of tracheoesophageal substitute voice
    Haderlein, Tino
    Noeth, Elmar
    Toy, Hikmet
    Batliner, Anton
    Schuster, Maria
    Eysholdt, Ulrich
    Hornegger, Joachim
    Rosanowski, Frank
    EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2007, 264 (11) : 1315 - 1321
  • [4] Comparative evaluation of image-based vs. text-based vs. multimodal AI approaches for automatic breast density assessment in mammograms
    Lopez-Ubeda, Pilar
    Martin-Noguerol, Teodoro
    Paulano-Godino, Felix
    Luna, Antonio
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 255
  • [5] Automatic Evaluation of Voice Quality Using Text-Based Laryngograph Measurements and Prosodic Analysis
    Haderlein, Tino
    Schwemmle, Cornelia
    Doellinger, Michael
    Matousek, Vaclav
    Ptok, Martin
    Noeth, Elmar
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2015, 2015
  • [6] A Frame of Mind: Frame-based vs. Text-based Editing
    Brown, Neil
    Kyfonidis, Charalampos
    Weill-Tessier, Pierre
    Becker, Brett
    Dillane, Joe
    Kolling, Michael
    UKICER '21: PROCEEDINGS OF THE 2021 UNITED KINGDOM AND IRELAND COMPUTING EDUCATION RESEARCH CONFERENCE, 2021,
  • [7] A Qualitative Evaluation of User Preference for Link-Based vs. Text-Based Recommendations of Wikipedia Articles
    Ostendorff, Malte
    Breitinger, Corinna
    Gipp, Bela
    TOWARDS OPEN AND TRUSTWORTHY DIGITAL SOCIETIES, ICADL 2021, 2021, 13133 : 63 - 79
  • [8] GUI-Based vs. Text-Based Assignments in CS1
    Ball, Robert
    DuHadway, Linda
    Hilton, Spencer
    Rague, Brian
    SIGCSE'18: PROCEEDINGS OF THE 49TH ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, 2018, : 1017 - 1022
  • [9] Automatic Rating of Hoarseness by Text-based Cepstral and Prosodic Evaluation
    Haderlein, Tino
    Moers, Cornelia
    Moebius, Bernd
    Noeth, Elmar
    TEXT, SPEECH AND DIALOGUE, TSD 2012, 2012, 7499 : 573 - 580
  • [10] The effects of computer-based vs. text-based instruction on remedial college readers
    Kuehner, AV
    JOURNAL OF ADOLESCENT & ADULT LITERACY, 1999, 43 (02) : 160 - 168