News Hotspots Detection and Tracking Based on LDA Topic Model

被引:0
|
作者
Hu, Xiao [1 ]
机构
[1] Capital Normal Univ, Beijing 100048, Peoples R China
来源
PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), VOL 1 | 2016年
基金
美国国家科学基金会; 北京市自然科学基金;
关键词
LDA topic model; news reports; hotspots; detection; traking;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid spread of Internet and the mobile web, the number of news pages is increasing quickly as well as the content of news becomes highly dynamic. It's difficult for normal users to obtain specific information contained in a mass of news streams. So it's of great research significance to study how to analyze massive news, detect and track news hotspots automatically. This research proposes to apply LDA (Latent Dirichlet Allocation) model to the application of topic detection and tracking. The news articles collected by crawlers are modeled by the LDA model in a form of document-topic-word distribution. We propose a method to compute the heat of topics based on the distribution and to detect the news hotspots. In addition, we track the evolution of the topic trends in different time-slices. Jenson-Shannon distance is used to measure the similarity between topics to identify topic inheritance and topic mutation. We conducted experiments on a dataset consisting of 3462 news texts from news portals. The result revealed that the proposed model has a good effect both in detecting hotspots and discovering meaningful topical evolution trends.
引用
收藏
页码:248 / 252
页数:5
相关论文
共 50 条
  • [1] Research on Hotspots of Educational Application of Natural Language Processing Based on LDA Topic Model
    Wang, Meng
    Xie, Yuyang
    Tian, Yu
    CHINESE LEXICAL SEMANTICS, CLSW 2022, PT II, 2023, 13496 : 315 - 325
  • [2] Design and Implementation of Parallelized LDA Topic Model Based on MapReduce
    Yan, Duan-wu
    Li, Tie-jun
    Yang, Xiong-fei
    Chen, Kun
    2018 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND NETWORK TECHNOLOGY (CCNT 2018), 2018, 291 : 274 - 278
  • [3] A Doctor Recommendation Based on Graph Computing and LDA Topic Model
    Meng, Qiuqing
    Xiong, Huixiang
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) : 808 - 817
  • [4] Sentiment Classification of Crowdsourcing Participants' Reviews Text Based on LDA Topic Model
    Huang, Yanrong
    Wang, Rui
    Huang, Bin
    Wei, Bo
    Zheng, Shu Li
    Chen, Min
    IEEE ACCESS, 2021, 9 : 108131 - 108143
  • [5] Multi-aspect Blog Sentiment Analysis Based on LDA Topic Model and Hownet Lexicon
    Fu, Xianghua
    Liu, Guo
    Guo, Yanyan
    Guo, Wubiao
    WEB INFORMATION SYSTEMS AND MINING, PT II, 2011, 6988 : 131 - 138
  • [6] Data Mining of e-commerce agricultural product reviews Based on LDA topic Model
    Fu, Xiuqing
    Chen, Meixia
    Zhuo, Shulong
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION TECHNOLOGY AND COMPUTER ENGINEERING, EITCE 2023, 2023, : 878 - 882
  • [7] Identifying Urban Functional Regions by LDA Topic Model with POI Data
    Huang, Yuhao
    Zhang, Lijun
    Wang, Haijun
    Wang, Siqi
    BIG DATA, BIGDATA 2022, 2022, 1709 : 72 - 91
  • [8] Short text data model of secondary equipment faults in power grids based on LDA topic model and convolutional neural network
    Wei, Wei
    Nan, Dongliang
    Zhang, Lu
    Zhou, Jie
    Wang, Lichao
    Tang, Xiaobing
    2020 35TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2020, : 156 - 160
  • [9] Evolution of CCUS Technologies Using LDA Topic Model and Derwent Patent Data
    Huang, Liangchao
    Hou, Zhengmeng
    Fang, Yanli
    Liu, Jianhua
    Shi, Tianle
    ENERGIES, 2023, 16 (06)
  • [10] Finding Collaboration Opportunities from Emerging Issues with LDA Topic Model and Link Prediction
    Liu J.
    Long Z.
    Wang F.
    Data Analysis and Knowledge Discovery, 2019, 3 (01) : 104 - 117