Computer-assisted transcription of speech based on confusion network reordering

被引:0
|
作者
Laurent, Antoine [1 ]
Meignier, Sylvain [1 ]
Merlin, Teva [1 ]
Deleglise, Paul [1 ]
机构
[1] Univ Maine, LIUM, Res Ctr Comp Sci, F-72017 Le Mans, France
来源
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年
关键词
Speech recognition; Automatic correction; Cache models; Confusion network; TRANSLATION; RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Large vocabulary automatic speech recognition (ASR) technologies perform well in known and controlled contexts. In less controlled conditions, however, human review is often necessary to check and correct the results of such systems in order to ensure that the output of ASR will be understandable. We propose a method for computer-assisted transcription of speech, based on automatic reordering confusion networks. Our method will be evaluated in terms of KSR (Keystroke Saving Rate) and WSR (Word Stroke Ratio). It allows to significantly reduce the number of actions needed to correct ASR outputs. WSR computed before and after every network reordering shows a gain of about 17.7% (3.4 points).
引用
收藏
页码:4884 / 4887
页数:4
相关论文
共 50 条
  • [41] Computer-Assisted Pronunciation Training: From Pronunciation Scoring Towards Spoken Language Learning
    Chen, Nancy F.
    Li, Haizhou
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [42] A computer-assisted tool for automatically measuring non-native Japanese oral proficiency
    Li, Wenchao
    Zhong, Zhentao
    Liu, Haitao
    COMPUTER ASSISTED LANGUAGE LEARNING, 2024,
  • [43] Computer-assisted sign language translation: a study of translators' practice to specify CAT software
    Kaczmarek, Marion
    Filhol, Michael
    MACHINE TRANSLATION, 2021, 35 (03) : 305 - 322
  • [44] Computer based speech prosody teaching system
    Sztaho, David
    Kiss, Gabor
    Vicsi, Klara
    COMPUTER SPEECH AND LANGUAGE, 2018, 50 : 126 - 140
  • [45] Confusion-Based Entropy-Weighted Decoding for Robust Speech Recognition
    Chen, Yi
    Wan, Chia-yu
    Lee, Lin-shan
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1008 - 1011
  • [46] Spanish Phone Confusion Analysis for EMG-Based Silent Speech Interfaces
    Salomons, Inge
    del Blanco, Eder
    Navas, Eva
    Hernaez, Inma
    INTERSPEECH 2023, 2023, : 1179 - 1183
  • [47] Hunter disease eClinic: interactive, computer-assisted, problem-based approach to independent learning about a rare genetic disease
    Al-Jasmi, Fatma
    Moldovan, Laura
    Clarke, Joe T. R.
    BMC MEDICAL EDUCATION, 2010, 10
  • [48] Emotional Speech Recognition Method Based on Word Transcription
    Bekmanova, Gulmira
    Yergesh, Banu
    Sharipbay, Altynbek
    Mukanova, Assel
    SENSORS, 2022, 22 (05)
  • [49] RETRACTED ARTICLE: Speech-assisted intelligent software architecture based on deep game neural network
    Yue Li
    International Journal of Speech Technology, 2021, 24 : 57 - 66
  • [50] Computer-Assisted Photo Identification Outperforms Visible Implant Elastomers in an Endangered Salamander, Eurycea tonkawae
    Bendik, Nathan F.
    Morrison, Thomas A.
    Gluesenkamp, Andrew G.
    Sanders, Mark S.
    O'Donnell, Lisa J.
    PLOS ONE, 2013, 8 (03):