Investigation of Efficient Semi-automatic Correction Method Using STD for Automatic Captioning

被引:0
作者
Terada, Yuji [1 ]
Tamiya, Kenta [1 ]
Kai, Atsuhiko [1 ]
机构
[1] Shizuoka Univ, Grad Sch Integrated Sci & Technol, Dept Engn, Naka Ku, 3-5-1 Johoku, Hamamatsu, Shizuoka, Japan
来源
2017 IEEE 6TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE) | 2017年
关键词
Spoken term detection; Human correction; Automatic speech recognition; Language model adaptation;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Captioning lecture speech is very useful for better understanding. However, it takes high cost to do real-time manual captioning or even if we employ automatic speech recognition system and human correction together. In this paper, we propose a method to reduce a cost for human correction as a prerequisite of a framework for captioning using automatic speech recognition system. Specifically, we investigate the effect of incorporating a simple human's feedback which only includes error words such as technical terms and proper nouns, and identifying and correcting the caption text by using spoken term detection system. Moreover, we investigate the method to improve the accuracy of the automatic captioning system by using the corrected text for semi-supervised language model adaptation. Throughout the preliminary experiments, it was found that the proposed caption correcting system could improve the word error rate.
引用
收藏
页数:2
相关论文
共 5 条
[1]   Real-time transcription system for simultaneous subtitling of Japanese broadcast news programs [J].
Ando, A ;
Imai, T ;
Kobayashi, A ;
Isono, H ;
Nakabayashi, K .
IEEE TRANSACTIONS ON BROADCASTING, 2000, 46 (03) :189-196
[2]   Manipulating Word Lattices to Incorporate Human Corrections [J].
Gaur, Yashesh ;
Metze, Florian ;
Bigham, Jeffrey P. .
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, :3062-3065
[3]  
Kawahara T., 2015, PRMU2015111 IEICE
[4]  
Makino Mitsuaki, 2014, P INTERSPEECH, P1732
[5]  
National Institute for Japanese Language, 2004, CORP SPONT JAP CSJ