A Fuzzy, Incremental and Semantic Trending Topic Detection in Social Feeds

被引:1
作者
Abou-Of, Mona A. [1 ]
机构
[1] Pharos Univ Alexandria, Dept Comp Engn, Alexandria, Egypt
来源
2020 11TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS) | 2020年
关键词
News Aggregator; Trending Topics Detection; Semantic Similarity; Text and Web Mining; NLP; Incremental FCM Clustering;
D O I
10.1109/ICICS49469.2020.239492
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Nowadays, a huge number of people participating in social networks is triggering a fast and wide spectrum of topics. Such trending topics are usually derived from the most frequent searches, the published posts and the daily news. The automated analysis for such data requires topics detection and tracking methods. Many challenges are being faced. It is difficult to discover the semantic relatedness when the same event is presented by different titles and to handle merging semantically identical topics from different channels (aggregation). Other hardships are the vagueness regarding the vast web collection, the scalability to analyze them, and the fact that it is a time consuming task. The framework introduced in this paper aims to solve these issues. Because a web document often consists of several topics, the suggested model employs a fuzzy C-Means (FCM) clustering based trending topics detection. It applies a semantic document similarity algorithm to resolve such ambiguity issues caused by the usage of synonyms, homonyms or different abstraction levels. This algorithm is also used to summarize the long documents. Furthermore, an incremental clustering technique is utilized to preserve high cohesiveness up-to-date top trending topics. The experimental results finally illustrate the effectiveness and the superiority of this model, compared with other trending topics detection algorithms, in terms of entropy and F-score measures.
引用
收藏
页码:118 / 124
页数:7
相关论文
共 25 条
  • [11] The Stanford CoreNLP Natural Language Processing Toolkit
    Manning, Christopher D.
    Surdeanu, Mihai
    Bauer, John
    Finkel, Jenny
    Bethard, Steven J.
    McClosky, David
    [J]. PROCEEDINGS OF 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: SYSTEM DEMONSTRATIONS, 2014, : 55 - 60
  • [12] Merchant K, 2018, 2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), P1803, DOI 10.1109/ICACCI.2018.8554831
  • [13] WORDNET - A LEXICAL DATABASE FOR ENGLISH
    MILLER, GA
    [J]. COMMUNICATIONS OF THE ACM, 1995, 38 (11) : 39 - 41
  • [14] Mirhosseini M., 2016, INTERTURKISH J ELECT
  • [15] Morchid M., 2015, INT ARCH PHOTOGRAMME
  • [16] New automatic fuzzy relational clustering algorithms using multi-objective NSGA-II
    Paul, Animesh Kumar
    Shill, Pintu Chandra
    [J]. INFORMATION SCIENCES, 2018, 448 : 112 - 133
  • [17] Detection of Trending Topic Communities: Bridging Content Creators and Distributors
    Recalde, Lorena
    Nettleton, David F.
    Baeza-Yates, Ricardo
    Boratto, Ludovico
    [J]. PROCEEDINGS OF THE 28TH ACM CONFERENCE ON HYPERTEXT AND SOCIAL MEDIA (HT'17), 2017, : 205 - 213
  • [18] Axiomatic generalization of the membership degree weighting function for fuzzy C means clustering: Theoretical development and convergence analysis
    Saha, Arkajyoti
    Das, Swagatam
    [J]. INFORMATION SCIENCES, 2017, 408 : 129 - 145
  • [19] Sapul MSC, 2017, INT JOINT CONF COMP
  • [20] Shamir R., 2018, P RAD ONC, V127