Latent semantics for hotspot information clustering

被引:0
|
作者
He, Ping [1 ]
Wang, Xi [1 ]
Xu, Xiaofei [1 ]
Li, Li [1 ]
机构
[1] Faculty of Computer and Information Science, Southwest University, Chongqing , China
来源
Journal of Computational Information Systems | 2014年 / 10卷 / 15期
关键词
Calculations; -; Statistics;
D O I
10.12733/jcis11039
中图分类号
学科分类号
摘要
The growing interest about research on hotspot information clustering and discovery can be attributed to our need to harness information on the internet. Currently, there are many similarity calculation methods with their own strength and weakness. In this paper, HowNet, LSA (Latent Semantic Analysis) and LDA (Latent Dirichlet Allocation) are introduced, their strengths and weaknesses evaluated and a revised similarity calculation method is presented. The experimental results show that LSA is advantageous in handling shorter texts when compared with HowNet, which consumes significantly more time to perform the same task. The efficiency of the LSA is over 100 times that of HowNet. It is also found that LDA is more suited to processing longer texts than shorter texts. A prototype is developed to demonstrate the plausibility of our revised similarity method. It presents the evolving of the hot topics within a short period, which will be a great help for hotspot information prediction. © 2014 Binary Information Press
引用
收藏
页码:6517 / 6525
相关论文
共 50 条
  • [1] The Semantics Latent in Shannon Information
    Isaac, Alistair M. C.
    BRITISH JOURNAL FOR THE PHILOSOPHY OF SCIENCE, 2019, 70 (01): : 103 - 125
  • [2] Information filtering using latent semantics
    Yokoi, Takeru
    Yanagimoto, Hidekazu
    Omatu, Sigeru
    ELECTRICAL ENGINEERING IN JAPAN, 2008, 165 (02) : 53 - 59
  • [3] Discovering Latent Semantics in Web Documents Using Fuzzy Clustering
    Chiang, I-Jen
    Liu, Charles Chih-Ho
    Tsai, Yi-Hsin
    Kumar, Ajit
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2015, 23 (06) : 2122 - 2134
  • [4] Modelling the Latent Semantics of Diffusion Sources in Information Cascade Prediction
    Huang, Ningbo
    Zhou, Gang
    Zhang, Mengli
    Zhang, Meng
    Yu, Ze
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [5] Fuzzy clustering for multiview data by combining latent information
    Wei, Huiqin
    Chen, Long
    Chen, C. L. Philip
    Duan, Junwei
    Han, Ruizhi
    Guo, Li
    APPLIED SOFT COMPUTING, 2022, 126
  • [6] Latent concept extraction and text clustering based on information theory
    Department of Information, Liaoning University, Shenyang 110036, China
    不详
    Ruan Jian Xue Bao, 2008, 9 (2276-2284):
  • [7] Semantics Based Clustering through Cover-Kmeans with OntoVsm for Information Retrieval
    Kumar, R. Lakshmana
    Kannammal, N.
    Krishnamoorthy, Sujatha
    Kadry, Seifedine
    Nam, Yunyoung
    INFORMATION TECHNOLOGY AND CONTROL, 2020, 49 (03): : 370 - 380
  • [8] Latent semantics in language models
    Brychcin, Tomas
    Konopik, Miloslav
    COMPUTER SPEECH AND LANGUAGE, 2015, 33 (01): : 88 - 108
  • [9] Agglomerative Hierarchical Clustering for Information Retrieval Using Latent Semantic Index
    Park, Hansaem
    Kwon, Kyunglag
    Khiati, Abdel-ilah Zakaria
    Lee, Jeungmin
    Chung, In-Jeong
    2015 IEEE INTERNATIONAL CONFERENCE ON SMART CITY/SOCIALCOM/SUSTAINCOM (SMARTCITY), 2015, : 426 - 431
  • [10] Hotspot detection and clustering: ways and means
    Andrew B. Lawson
    Environmental and Ecological Statistics, 2010, 17 : 231 - 245