Computer-assisted transcription of speech based on confusion network reordering

被引:0
|
作者
Laurent, Antoine [1 ]
Meignier, Sylvain [1 ]
Merlin, Teva [1 ]
Deleglise, Paul [1 ]
机构
[1] Univ Maine, LIUM, Res Ctr Comp Sci, F-72017 Le Mans, France
来源
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年
关键词
Speech recognition; Automatic correction; Cache models; Confusion network; TRANSLATION; RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Large vocabulary automatic speech recognition (ASR) technologies perform well in known and controlled contexts. In less controlled conditions, however, human review is often necessary to check and correct the results of such systems in order to ensure that the output of ASR will be understandable. We propose a method for computer-assisted transcription of speech, based on automatic reordering confusion networks. Our method will be evaluated in terms of KSR (Keystroke Saving Rate) and WSR (Word Stroke Ratio). It allows to significantly reduce the number of actions needed to correct ASR outputs. WSR computed before and after every network reordering shows a gain of about 17.7% (3.4 points).
引用
收藏
页码:4884 / 4887
页数:4
相关论文
共 50 条
  • [21] Confusion analysis in phoneme based speech recognition in Hindi
    Bhatt, Shobha
    Dev, Amita
    Jain, Anurag
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (10) : 4213 - 4238
  • [22] Spoken Document Retrieval Based on Confusion Network with Syllable Fragments
    Lei, Zhang
    Gotoh, Yoshihiko
    Khan, Muhammad Usman Ghani
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2012, 9
  • [23] Gold-standard for computer-assisted morphological sperm analysis
    Chang, Violeta
    Garcia, Alejandra
    Hitschfeld, Nancy
    Hartel, Steffen
    COMPUTERS IN BIOLOGY AND MEDICINE, 2017, 83 : 143 - 150
  • [24] An Analysis on Computer-Assisted Translation through Google Translator Toolkit
    张嵩松
    校园英语, 2018, (01) : 212 - 213
  • [25] Development of basic reading skills in Latin: a corpus-based tool for computer-assisted fluency training
    Kuehnast, Milena
    Schulz, Konstantin
    Luedeling, Anke
    COGENT EDUCATION, 2024, 11 (01):
  • [26] EFFECTS ON LEARNING LOGOGRAPHIC CHARACTER FORMATION IN COMPUTER-ASSISTED HANDWRITING INSTRUCTION
    Tsai, Chen-hui
    Kuo, Chin-Hwa
    Horng, Wen-Bing
    Chen, Chun-Wen
    LANGUAGE LEARNING & TECHNOLOGY, 2012, 16 (01): : 110 - 130
  • [27] Phone-level Mispronunciation Detection for Computer-Assisted Language Learning
    Feng, Xin
    Wang, Lan
    PROCEEDINGS OF THE 2008 CHINESE CONFERENCE ON PATTERN RECOGNITION (CCPR 2008), 2008, : 396 - +
  • [28] Identification of Erythrocyte Types in Greyscale MGG Images for Computer-Assisted Diagnosis
    Frejlichowski, Dariusz
    PATTERN RECOGNITION AND IMAGE ANALYSIS: 5TH IBERIAN CONFERENCE, IBPRIA 2011, 2011, 6669 : 636 - 643
  • [29] Computer-assisted closed-captioning of live TV broadcasts in French
    Boulianne, G.
    Beaumont, J. -F.
    Boisvert, M.
    Brousseau, J.
    Cardinal, P.
    Chapdelaine, C.
    Comeau, M.
    Ouellet, P.
    Osterrath, F.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 273 - 276
  • [30] ANALYSIS OF PHONE CONFUSION IN EMG-BASED SPEECH RECOGNITION
    Wand, Michael
    Schultz, Tanja
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 757 - 760