Computer-assisted transcription of speech based on confusion network reordering

被引：0

作者：

Laurent, Antoine ^{[1
]}

Meignier, Sylvain ^{[1
]}

Merlin, Teva ^{[1
]}

Deleglise, Paul ^{[1
]}

机构：

[1] Univ Maine, LIUM, Res Ctr Comp Sci, F-72017 Le Mans, France

来源：

2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年

关键词：

Speech recognition; Automatic correction; Cache models; Confusion network; TRANSLATION; RECOGNITION;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Large vocabulary automatic speech recognition (ASR) technologies perform well in known and controlled contexts. In less controlled conditions, however, human review is often necessary to check and correct the results of such systems in order to ensure that the output of ASR will be understandable. We propose a method for computer-assisted transcription of speech, based on automatic reordering confusion networks. Our method will be evaluated in terms of KSR (Keystroke Saving Rate) and WSR (Word Stroke Ratio). It allows to significantly reduce the number of actions needed to correct ASR outputs. WSR computed before and after every network reordering shows a gain of about 17.7% (3.4 points).

引用

页码：4884 / 4887

页数：4

共 50 条

[41] Computer-Assisted Pronunciation Training: From Pronunciation Scoring Towards Spoken Language Learning
Chen, Nancy F.
Li, Haizhou
2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
[42] A computer-assisted tool for automatically measuring non-native Japanese oral proficiency
Li, Wenchao
Zhong, Zhentao
Liu, Haitao
COMPUTER ASSISTED LANGUAGE LEARNING, 2024,
[43] Computer-assisted sign language translation: a study of translators' practice to specify CAT software
Kaczmarek, Marion
Filhol, Michael
MACHINE TRANSLATION, 2021, 35 (03) : 305 - 322
[44] Computer based speech prosody teaching system
Sztaho, David
Kiss, Gabor
Vicsi, Klara
COMPUTER SPEECH AND LANGUAGE, 2018, 50 : 126 - 140
[45] Confusion-Based Entropy-Weighted Decoding for Robust Speech Recognition
Chen, Yi
Wan, Chia-yu
Lee, Lin-shan
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1008 - 1011
[46] Spanish Phone Confusion Analysis for EMG-Based Silent Speech Interfaces
Salomons, Inge
del Blanco, Eder
Navas, Eva
Hernaez, Inma
INTERSPEECH 2023, 2023, : 1179 - 1183
[47] Hunter disease eClinic: interactive, computer-assisted, problem-based approach to independent learning about a rare genetic disease
Al-Jasmi, Fatma
Moldovan, Laura
Clarke, Joe T. R.
BMC MEDICAL EDUCATION, 2010, 10
[48] Emotional Speech Recognition Method Based on Word Transcription
Bekmanova, Gulmira
Yergesh, Banu
Sharipbay, Altynbek
Mukanova, Assel
SENSORS, 2022, 22 (05)
[49] RETRACTED ARTICLE: Speech-assisted intelligent software architecture based on deep game neural network
Yue Li
International Journal of Speech Technology, 2021, 24 : 57 - 66
[50] Computer-Assisted Photo Identification Outperforms Visible Implant Elastomers in an Endangered Salamander, Eurycea tonkawae
Bendik, Nathan F.
Morrison, Thomas A.
Gluesenkamp, Andrew G.
Sanders, Mark S.
O'Donnell, Lisa J.
PLOS ONE, 2013, 8 (03):

← 1 2 3 4 5 →