Latent semantics for hotspot information clustering

被引:0
|
作者
He, Ping [1 ]
Wang, Xi [1 ]
Xu, Xiaofei [1 ]
Li, Li [1 ]
机构
[1] Faculty of Computer and Information Science, Southwest University, Chongqing , China
来源
Journal of Computational Information Systems | 2014年 / 10卷 / 15期
关键词
Calculations; -; Statistics;
D O I
10.12733/jcis11039
中图分类号
学科分类号
摘要
The growing interest about research on hotspot information clustering and discovery can be attributed to our need to harness information on the internet. Currently, there are many similarity calculation methods with their own strength and weakness. In this paper, HowNet, LSA (Latent Semantic Analysis) and LDA (Latent Dirichlet Allocation) are introduced, their strengths and weaknesses evaluated and a revised similarity calculation method is presented. The experimental results show that LSA is advantageous in handling shorter texts when compared with HowNet, which consumes significantly more time to perform the same task. The efficiency of the LSA is over 100 times that of HowNet. It is also found that LDA is more suited to processing longer texts than shorter texts. A prototype is developed to demonstrate the plausibility of our revised similarity method. It presents the evolving of the hot topics within a short period, which will be a great help for hotspot information prediction. © 2014 Binary Information Press
引用
收藏
页码:6517 / 6525
相关论文
共 50 条
  • [21] Clip Clustering for Early Lithographic Hotspot Classification
    Oliveira, Andre
    Puget, Julia Casarin
    Metzler, Carolina
    Reis, Ricardo
    2019 IEEE 10TH LATIN AMERICAN SYMPOSIUM ON CIRCUITS & SYSTEMS (LASCAS), 2019, : 149 - 152
  • [22] CLUSTERING OF MUTATIONS INACTIVATING A CHI RECOMBINATIONAL HOTSPOT
    SCHULTZ, DW
    SWINDLE, J
    SMITH, GR
    JOURNAL OF MOLECULAR BIOLOGY, 1981, 146 (03) : 275 - 286
  • [23] Remarks on the semantics of "information"
    Braitenberg, V
    Imagination and Rigor: ESSAYS ON EDUARDO R. CAIANIELLO'S SCIENTIFIC HERITAGE, 2006, : 31 - 38
  • [24] THOUGHTS ON SEMANTICS OF INFORMATION
    BERGER, LS
    ETC-REVIEW OF GENERAL SEMANTICS, 1971, 28 (04): : 421 - 425
  • [25] Statistical semantics for enhancing document clustering
    Farahat, Ahmed K.
    Kamel, Mohamed S.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 28 (02) : 365 - 393
  • [26] Statistical semantics for enhancing document clustering
    Ahmed K. Farahat
    Mohamed S. Kamel
    Knowledge and Information Systems, 2011, 28 : 365 - 393
  • [27] Semantics-driven ontology clustering
    Zhang, Lei
    Xie, Qiang
    Li, Ling-Zhi
    Ding, Qiu-Lin
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2008, 40 (07): : 1159 - 1164
  • [28] Latent Semantics Encoding for Label Distribution Learning
    Xu, Suping
    Shang, Lin
    Shen, Furao
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3982 - 3988
  • [29] Householder Projector for Unsupervised Latent Semantics Discovery
    Song, Yue
    Zhang, Jichao
    Sebe, Nicu
    Wang, Wei
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7678 - 7688
  • [30] Probabilistic Distributional Semantics with Latent Variable Models
    Seaghdha, Diarmuid O.
    Korhonen, Anna
    COMPUTATIONAL LINGUISTICS, 2014, 40 (03) : 587 - 631