Feedback From Automatic Speech Recognition to Elicit Clear Speech in Healthy Speakers

被引:0
|
作者
Gutz, Sarah E. [1 ,2 ]
Maffei, Marc F. [1 ]
Green, Jordan R. [1 ,2 ]
机构
[1] MGH Inst Hlth Profess, Dept Commun Sci & Disorders, Boston, MA 02129 USA
[2] Harvard Univ, Program Speech & Hearing Biosci & Technol, Cambridge, MA 02138 USA
关键词
PARKINSONS-DISEASE; MOTOR-PERFORMANCE; SPEAKING RATE; ACOUSTIC CHARACTERISTICS; PERCEPTUAL CONSEQUENCES; CONVERSATIONAL SPEECH; MULTIPLE-SCLEROSIS; TALKER DIFFERENCES; INTELLIGIBILITY; DYSARTHRIA;
D O I
10.1044/2023_AJSLP-23-00030
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Purpose: This study assessed the effectiveness of feedback generated by automatic speech recognition (ASR) for eliciting clear speech from young, healthy individuals. As a preliminary step toward exploring a novel method for eliciting clear speech in patients with dysarthria, we investigated the effects of ASR feedback in healthy controls. If successful, ASR feedback has the potential to facilitate independent, at-home clear speech practice.Method: Twenty-three healthy control speakers (ages 23-40 years) read sentences aloud in three speaking modes: Habitual, Clear (over-enunciated), and in response to ASR feedback (ASR). In the ASR condition, we used Mozilla Deep Speech to transcribe speech samples and provide participants with a value indicating the accuracy of the ASR's transcription. For speakers who achieved sufficiently high ASR accuracy, noise was added to their speech at a participant specific signal-to-noise ratio to ensure that each participant had to over enunciate to achieve high ASR accuracy. Results: Compared to habitual speech, speech produced in the ASR and Clear conditions was clearer, as rated by speech-language pathologists, and more intelligible, per speech-language pathologist transcriptions. Speech in the Clear and ASR conditions aligned on several acoustic measures, particularly those associated with increased vowel distinctiveness and decreased speaking rate. However, ASR accuracy, intelligibility, and clarity were each correlated with different speech features, which may have implications for how people change their speech for ASR feedback. Conclusions: ASR successfully elicited outcomes similar to clear speech in healthy speakers. Future work should investigate its efficacy in eliciting clear speech in people with dysarthria.
引用
收藏
页码:2940 / 2959
页数:20
相关论文
共 50 条
  • [1] Effects of altered intensity feedback on speech in healthy speakers
    Senthinathan, Anita
    Adams, Scott
    Canadian Acoustics - Acoustique Canadienne, 2020, 48 (03): : 43 - 52
  • [2] Assessing Automatic Speech Recognition in measuring speech intelligibility: A study of Malay speakers with speech impairments
    Rosdi, Fadhilah
    Mustafa, Mumtaz Begum
    Salim, Siti Salwah
    PROCEEDINGS OF THE 2017 6TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICEEI'17), 2017,
  • [3] Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers
    Santiago Omar Caballero Morales
    Stephen J. Cox
    EURASIP Journal on Advances in Signal Processing, 2009
  • [4] Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers
    Morales, Santiago Omar Caballero
    Cox, Stephen J.
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2009,
  • [5] DEVELOPING SUCCESSFUL SPEAKERS FOR AN AUTOMATIC SPEECH RECOGNITION SYSTEM
    DANIS, CM
    PROCEEDINGS OF THE HUMAN FACTORS SOCIETY 33RD ANNUAL MEETING, VOL 1: PERSPECTIVES, 1989, : 301 - 304
  • [6] USER FEEDBACK REQUIREMENTS WITH AUTOMATIC SPEECH RECOGNITION
    SCHURICK, JM
    WILLIGES, BH
    MAYNARD, JF
    ERGONOMICS, 1985, 28 (11) : 1543 - 1555
  • [7] Corpus Construction for Deaf Speakers and Analysis by Automatic Speech Recognition
    Kobayashi, Akio
    Yasu, Keiichi
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 2294 - 2298
  • [8] Speech production and automatic speech recognition
    Acoustics Bulletin, 2000, 25 (02):
  • [9] AUTOMATIC SPEECH RECOGNITION OF IMPAIRED SPEECH
    CARLSON, GS
    BERNSTEIN, J
    INTERNATIONAL JOURNAL OF REHABILITATION RESEARCH, 1988, 11 (04) : 396 - 398
  • [10] Validity of Off-the-Shelf Automatic Speech Recognition for Assessing Speech Intelligibility and Speech Severity in Speakers With Amyotrophic Lateral Sclerosis
    Gutz, Sarah E.
    Stipancic, Kaila L.
    Yunusova, Yana
    Berry, James D.
    Green, Jordan R.
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2022, 65 (05): : 2128 - 2143