A hybrid neural network hidden Markov model approach for automatic story segmentation

被引:0
作者
Jia Yu
Lei Xie
Xiong Xiao
Eng Siong Chng
机构
[1] Northwestern Polytechnical University,Shaanxi Provincial Key Laboratory of Speech and Image Information Processing, School of Computer Science
[2] School of Computer and Information Engineering,Temasek Laboratories@NTU
[3] Luoyang Institute of Science and Technology,undefined
[4] Nanyang Technological University,undefined
来源
Journal of Ambient Intelligence and Humanized Computing | 2017年 / 8卷
关键词
Neural network; Long short-term memory; Hidden Markov model; Multi-task learning; Story segmentation; Topic modeling;
D O I
暂无
中图分类号
学科分类号
摘要
We propose a hybrid neural network hidden Markov model (NN-HMM) approach for automatic story segmentation. A story is treated as an instance of an underlying topic (a hidden state) and words are generated from the distribution of the topic. The transition from one topic to another indicates a story boundary. Different from the traditional HMM approach, in which the emission probability of each state is calculated from a topic-dependent language model, we use deep neural network (DNN) to directly map the word distribution into topic posterior probabilities. DNN is known to be able to learn meaningful continuous features for words and hence has better discriminative and generalization capability than n-gram models. Specifically, we investigate three neural network structures: a feed-forward neural network, a recurrent neural network with long short-term memory cells (LSTM-RNN) and a modified LSTM-RNN with multi-task learning ability. Experimental results on the TDT2 corpus show that the proposed NN-HMM approach outperforms the traditional HMM approach significantly and achieves state-of-the-art performance in story segmentation.
引用
收藏
页码:925 / 936
页数:11
相关论文
共 50 条
[21]   A hybrid approach of traffic volume forecasting based on wavelet transform, neural network and markov model [J].
Chen, SY ;
Wang, W ;
Ren, G .
INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS, 2005, :393-398
[22]   A topical VAEGAN-IHMM approach for automatic story segmentation [J].
Yu, Jia ;
Peng, Huiling ;
Wang, Guoqiang ;
Shi, Nianfeng .
Mathematical Biosciences and Engineering, 2024, 21 (07) :6608-6630
[23]   Multiscale Hidden Markov Model applied to ECG segmentation [J].
Graja, S ;
Boucher, JM .
2003 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING, PROCEEDINGS: FROM CLASSICAL MEASUREMENT TO COMPUTING WITH PERCEPTIONS, 2003, :105-109
[24]   Semantic image segmentation with a multidimensional hidden Markov model [J].
Jiten, Joakim ;
Merialdo, Bernard .
ADVANCES IN MULTIMEDIA MODELING, PT 1, 2007, 4351 :616-624
[25]   Hidden Markov Model for Event Photo Stream Segmentation [J].
Gozali, Jesse Prabawa ;
Kan, Min-Yen ;
Sundaram, Hari .
2012 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2012, :25-30
[26]   A neural network model of hidden markov model applied to the auditory periphery for speech processing and recognition [J].
Ye, DT ;
Songhua ;
Ying, LX ;
Krishnan, SM .
PROCEEDINGS OF THE 19TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 19, PTS 1-6: MAGNIFICENT MILESTONES AND EMERGING OPPORTUNITIES IN MEDICAL ENGINEERING, 1997, 19 :1371-1376
[27]   Probabilistic model for fatigue damage estimation of wind turbines with hidden markov model and neural network [J].
Zhu, Dongping ;
Ding, Zhixia ;
Huang, Xiaogang .
OCEAN ENGINEERING, 2024, 310
[28]   Joint Action Segmentation and Classification by an Extended Hidden Markov Model [J].
Borzeshi, Ehsan Zare ;
Concha, Oscar Perez ;
Xu, Richard Yi Da ;
Piccardi, Massimo .
IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (12) :1207-1210
[29]   Thai Word Segmentation with Hidden Markov Model and Decision Tree [J].
Bheganan, Poramin ;
Nayak, Richi ;
Xu, Yue .
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 :74-85
[30]   Joint scene classification and segmentation based on hidden Markov model [J].
Huang, JC ;
Liu, Z ;
Wang, Y .
IEEE TRANSACTIONS ON MULTIMEDIA, 2005, 7 (03) :538-550