Computer-assisted transcription of speech based on confusion network reordering

被引:0
作者
Laurent, Antoine [1 ]
Meignier, Sylvain [1 ]
Merlin, Teva [1 ]
Deleglise, Paul [1 ]
机构
[1] Univ Maine, LIUM, Res Ctr Comp Sci, F-72017 Le Mans, France
来源
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年
关键词
Speech recognition; Automatic correction; Cache models; Confusion network; TRANSLATION; RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Large vocabulary automatic speech recognition (ASR) technologies perform well in known and controlled contexts. In less controlled conditions, however, human review is often necessary to check and correct the results of such systems in order to ensure that the output of ASR will be understandable. We propose a method for computer-assisted transcription of speech, based on automatic reordering confusion networks. Our method will be evaluated in terms of KSR (Keystroke Saving Rate) and WSR (Word Stroke Ratio). It allows to significantly reduce the number of actions needed to correct ASR outputs. WSR computed before and after every network reordering shows a gain of about 17.7% (3.4 points).
引用
收藏
页码:4884 / 4887
页数:4
相关论文
共 16 条
[11]  
Galliano S., 2009, ICSLP, V1, P2583
[12]  
Mangu H., 2000, COMPUTER SPEECH LANG, V14, P373
[13]  
Rodríguez L, 2007, LECT NOTES COMPUT SC, V4477, P241
[14]  
Tomas J., 2006, ACL, P835
[15]   Computer-assisted translation using speech recognition [J].
Vidal, E ;
Casacuberta, F ;
Rodríguez, L ;
Civera, J ;
Hinarejos, CDM .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (03) :941-951
[16]  
Wood M. E., 1996, AUTUMN C SPEECH HEAR, V18, P315