Automatic Lecture Subtitle Generation and How It Helps

被引:11
作者
Che, Xiaoyin [1 ]
Luo, Sheng [1 ]
Yang, Haojin [1 ]
Meinel, Christoph [1 ]
机构
[1] Hasso Plattner Inst, Prof Dr Helmert Str 2-3, D-14482 Potsdam, Germany
来源
2017 IEEE 17TH INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT) | 2017年
关键词
Automatic Subtitling; Sentence Boundary Detection; Lecture Videos; MOOC;
D O I
10.1109/ICALT.2017.11
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we propose an integrated framework of automatic bilingual subtitle generation for lecture videos, especially for MOOCs. The framework consists of Automatic Speech Recognition (ASR), Sentence Boundary Detection (SBD), and Machine Translation (MT). Then we quantitatively evaluate the auto-generated subtitles, the manually produced subtitles from scratch, and the auto-generated subtitles with manual modification in term of accuracy and time expenditure, in both original and target languages. The result shows that the auto-generated subtitles in the original language (English) are fairly accurate already. By using them as the draft, human subtitle producers can save 54% of the working time and simultaneously reduce the error rate by 54.3%, which is a significant improvement. However, the effectiveness of machine translated subtitles (English to Chinese) is limited. In the end, if the proposed framework is applied, the total working time in preparing bilingual subtitles can be shortened by approximately 1/3, with no decline in quality.
引用
收藏
页码:34 / 38
页数:5
相关论文
共 24 条
[1]  
Aliprandi C., 2014, P NAB BROADC ENG C P
[2]  
Alvarez Aitor, 2010, Proceedings 2010 International Multiconference on Computer Science and Information Technology (IMCSIT 2010), P567
[3]  
Alvarez A, 2016, LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P3049
[4]   Towards customized automatic segmentation of subtitles [J].
Álvarez, Aitor ;
Arzelus, Haritz ;
Etchegoyhen, Thierry .
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8854 :229-238
[5]  
[Anonymous], 2013, P 2013 C N AM CHAPTE
[6]  
[Anonymous], 10 INT C LANG RES EV
[7]  
[Anonymous], 1998, TRANSLATION J
[8]  
Beaven T., 2013, J INTERACTIVE MEDIA, V2013
[9]   A neural probabilistic language model [J].
Bengio, Y ;
Ducharme, R ;
Vincent, P ;
Jauvin, C .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1137-1155
[10]   Sentence Boundary Detection Based on Parallel Lexical and Acoustic Models [J].
Che, Xiaoyin ;
Luo, Sheng ;
Yang, Haojin ;
Meinel, Christoph .
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, :2528-2532