A topical VAEGAN-IHMM approach for automatic story segmentation

被引:0
作者
Yu, Jia [1 ,2 ]
Peng, Huiling [1 ]
Wang, Guoqiang [1 ]
Shi, Nianfeng [1 ]
机构
[1] School of Computer and Information Engineering, Luoyang Institute of Science and Technology
[2] Software Research Institute, Technological University of Shannon
关键词
generative adversarial network; HDP; hidden Markov model; story segmentation; variational autoencoder;
D O I
10.3934/mbe.2024289
中图分类号
学科分类号
摘要
Feature representations with rich topic information can greatly improve the performance of story segmentation tasks. VAEGAN offers distinct advantages in feature learning by combining variational autoencoder (VAE) and generative adversarial network (GAN), which not only captures intricate data representations through VAE's probabilistic encoding and decoding mechanism but also enhances feature diversity and quality via GAN's adversarial training. To better learn topical domain representation, we used a topical classifier to supervise the training process of VAEGAN. Based on the learned feature, a segmentor splits the document into shorter ones with different topics. Hidden Markov model (HMM) is a popular approach for story segmentation, in which stories are viewed as instances of topics (hidden states). The number of states has to be set manually but it is often unknown in real scenarios. To solve this problem, we proposed an infinite HMM (IHMM) approach which utilized an HDP prior on transition matrices over countably infinite state spaces to automatically infer the state's number from the data. Given a running text, a Blocked Gibbis sampler labeled the states with topic classes. The position where the topic changes was a story boundary. Experimental results on the TDT2 corpus demonstrated that the proposed topical VAEGAN-IHMM approach was significantly better than the traditional HMM method in story segmentation tasks and achieved state-of-the-art performance. © 2024 the Author(s).
引用
收藏
页码:6608 / 6630
页数:22
相关论文
共 51 条
[1]  
Gondhi U. R., Intra-Topic Clustering for Social Media, (2020)
[2]  
Adedoyin-Olowe M., Gaber M. M., Stahl F., A survey of data mining techniques for social media analysis, J. Data Mining Digital Humanit, 2014, (2014)
[3]  
Rau L. F., Jacobs P. S., Zernik U., Information extraction and text summarization using linguistic knowledge acquisition, Inf. Process. Manage, 25, pp. 419-428, (1989)
[4]  
Lee L., Chen B., Spoken document understanding and organization, IEEE Signal Process. Mag, 22, pp. 42-60, (2005)
[5]  
Dan W., Liu C., Eye tracking analysis in interactive information retrieval research, J. Libr. Sci. China, 2, pp. 109-128, (2019)
[6]  
Zhang B., Chen Z., Peng D., Benediktsson J. A., Liu B., Zhou L., Et al., Remotely sensed big data: Evolution in model development for information extraction [point of view], Proc. IEEE, 107, pp. 2294-2301, (2019)
[7]  
Soderland S., Learning information extraction rules for semi-structured and free text, Mach. Learn, 34, pp. 233-272, (1999)
[8]  
Chen W., Liu B., Guan W., ERNIE and multi-feature fusion for news topic classification, Artif. Intell. Appl, 2, pp. 149-154, (2024)
[9]  
Hsu W., Kennedy L., Huang C. W., Chang S. F., Lin C. Y., Iyengar G., News video story segmentation using fusion of multi-level multi-modal features in trecvid 2003, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, (2004)
[10]  
Wan J., Peng T., Li B., News video story segmentation based on naive bayes model, 2009 Fifth International Conference on Natural Computation, pp. 77-81, (2009)