A hybrid neural network hidden Markov model approach for automatic story segmentation

被引:0
作者
Jia Yu
Lei Xie
Xiong Xiao
Eng Siong Chng
机构
[1] Northwestern Polytechnical University,Shaanxi Provincial Key Laboratory of Speech and Image Information Processing, School of Computer Science
[2] School of Computer and Information Engineering,Temasek Laboratories@NTU
[3] Luoyang Institute of Science and Technology,undefined
[4] Nanyang Technological University,undefined
来源
Journal of Ambient Intelligence and Humanized Computing | 2017年 / 8卷
关键词
Neural network; Long short-term memory; Hidden Markov model; Multi-task learning; Story segmentation; Topic modeling;
D O I
暂无
中图分类号
学科分类号
摘要
We propose a hybrid neural network hidden Markov model (NN-HMM) approach for automatic story segmentation. A story is treated as an instance of an underlying topic (a hidden state) and words are generated from the distribution of the topic. The transition from one topic to another indicates a story boundary. Different from the traditional HMM approach, in which the emission probability of each state is calculated from a topic-dependent language model, we use deep neural network (DNN) to directly map the word distribution into topic posterior probabilities. DNN is known to be able to learn meaningful continuous features for words and hence has better discriminative and generalization capability than n-gram models. Specifically, we investigate three neural network structures: a feed-forward neural network, a recurrent neural network with long short-term memory cells (LSTM-RNN) and a modified LSTM-RNN with multi-task learning ability. Experimental results on the TDT2 corpus show that the proposed NN-HMM approach outperforms the traditional HMM approach significantly and achieves state-of-the-art performance in story segmentation.
引用
收藏
页码:925 / 936
页数:11
相关论文
共 51 条
[1]  
Beeferman D(1999)Statistical models for text segmentation Mach Learn 34 177-210
[2]  
Berger A(2003)Latent dirichlet allocation J Mach Learn Res 3 993-1022
[3]  
Lafferty J(2003)A multi-modal approach to story segmentation for news video World Wide Web Internet Web Inf Syst 6 187-208
[4]  
Blei DM(2016)A hybrid input-type recurrent neural network for LVCSR language modeling Eurasip J Audio Speech Music Process 1 15-197
[5]  
Ng AY(2004)A dynamic programming algorithm for linear text segmentation J Intell Inf Syst 23 179-64
[6]  
Jordan MI(1997)Texttiling: segmenting text into multi-paragraph subtopic passages Comput Linguist 23 33-60
[7]  
Chaisorn L(2005)Spoken document understanding and organization Signal Process Mag IEEE 22 42-3119
[8]  
Chua TS(2013)Distributed representations of words and phrases and their compositionality Adv Neural Inf Process Syst 26 3111-16
[9]  
Lee CH(1986)An introduction to hidden Markov models ASSP Mag IEEE 3 4-428
[10]  
Chunwijitra V(1989)Information extraction and text summarization using linguistic knowledge acquisition Inf Process Manag 25 419-51