Rich Punctuations Prediction Using Large-scale Deep Learning

被引:0
作者
Wu, Xueyang [1 ]
Zhu, Su [1 ]
Wu, Yue [1 ]
Yu, Kai [1 ]
机构
[1] Shanghai Jiao Tong Univ, Key Lab Shanghai Educ Commiss Intelligent Interac, Brain Sci & Technol Res Ctr, SpeechLab,Dept Comp Sci & Engn, Shanghai, Peoples R China
来源
2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2016年
关键词
deep learning; neural networks; punctuation prediction; large-scale;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Punctuation plays an important role in language processing. However, automatic speech recognition systems only output plain word sequences. It is then of interest to predict punctuations on plain word sequences. Previous works have focused on using lexical features or prosodic cues captured from small corpus to predict simple punctuations. Compared with simple punctuations, rich punctuations provide more meaningful information and are more difficult to predict. In this paper, a multi-view LSTM model is proposed to predict rich punctuations on large-scale corpora. In particular, predictions on both in-domain and out-of-domain datasets are investigated. Experiments showed that LSTM can significantly outperform the traditional CRF-based model. Moreover, large-scale corpora are proved to bring large progress, and introducing POS tags and Chunking information in a multi-view structure improves performance of LSTM model on small corpus.
引用
收藏
页数:5
相关论文
共 16 条
[1]  
[Anonymous], 2011, Advances in Neural Information Processing Systems
[2]   Punctuating speech for information extraction [J].
Favre, Benoit ;
Grishman, Ralph ;
Hillard, Dustin ;
Ji, Heng ;
Hakkani-Tuer, Dilek ;
Ostendorf, Mari .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :5013-+
[3]  
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[4]  
Huang J., 2002, P ICSLP, P917
[5]  
Lafferty John, 2001, INT C MACH LEARN ICM
[6]  
Liu Y., 2005, P ANN M ASS COMPUTAT, P451, DOI DOI 10.3115/1219840.1219896
[7]  
Lu W., 2010, P 2010 C EMPIRICAL M, P177
[8]  
Ma J, 2014, PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, P791
[9]  
Spitkovsky V. I., 2011, P 15 C COMP NAT LANG, P19
[10]  
Tilk O, 2015, 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, P683