Event Graph-Based News Clustering: The Role of Named Entity-Centered Subgraphs

被引:0
|
作者
Komecoglu, Basak Buluz [1 ]
Yilmaz, Burcu [1 ]
机构
[1] Gebze Tech Univ, Inst Informat Technol, TR-41400 Gebze, Kocaeli, Turkiye
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Task analysis; Clustering algorithms; Vectors; Context modeling; Computational modeling; Analytical models; Semantics; Natural language processing; Text processing; Frequent subgraph mining; low-resource language; natural language processing; text clustering; TOPIC DETECTION; SIMILARITY;
D O I
10.1109/ACCESS.2024.3435343
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In an era of exponential growth in online news sources, the need for intelligent digital solutions capable of efficiently analyzing and organizing large amounts of news content has become crucial. This paper presents a graph-based methodology designed to enhance Topic Detection and Tracking (TDT) tasks in natural language processing by efficiently clustering news events into coherent stories. The proposed approach leverages a novel event graph model that captures not only the characteristics of individual news events but also their collective narrative context. Using Named Entity Centred Frequent Subgraphs, the model excels in identifying recurring patterns of events and thus provides a framework for learning a robust, language-independent, and structured representation for structuring news stories, which represents a significant advance in the refinement of traditional clustering algorithms. Empirical experiments using a multilingual benchmark dataset, the News Clustering Dataset, highlight the superior clustering performance of our approach compared to state-of-the-art monolingual document clustering techniques, particularly in English and the competitive results in Spanish. To underline the adaptability of the methodology to low-resource languages, the Turkish 'Story-Based News Dataset' developed specifically for this study also promises to serve as an important resource for a wide range of natural language processing tasks.
引用
收藏
页码:105613 / 105632
页数:20
相关论文
共 27 条
  • [1] Named entity linking in microblog posts using graph-based centrality scoring
    Kalloubi, Fahd
    Nfaoui, El Habib
    El Beqqali, Omar
    2014 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS: THEORIES AND APPLICATIONS (SITA'14), 2014,
  • [2] An Balanced, and Scalable Graph-Based Multiview Clustering Method
    Zhao, Zihua
    Nie, Feiping
    Wang, Rong
    Wang, Zheng
    Li, Xuelong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (12) : 7643 - 7656
  • [3] Accurate Complementarity Learning for Graph-Based Multiview Clustering
    Xiao, Xiaolin
    Gong, Yue-Jiao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 16106 - 16118
  • [4] Graph-Based Interactive Matching for Pairs of News Articles
    Pan, Kunhao
    Zhang, Guowei
    Liao, Meng
    Xu, Jin
    COGNITIVE COMPUTATION, 2024, 16 (02) : 507 - 516
  • [5] Graph-Based Interactive Matching for Pairs of News Articles
    Kunhao Pan
    Guowei Zhang
    Meng Liao
    Jin Xu
    Cognitive Computation, 2024, 16 : 507 - 516
  • [6] Structured Optimal Graph-Based Clustering With Flexible Embedding
    Ren, Pengzhen
    Xiao, Yun
    Chang, Xiaojun
    Prakash, Mahesh
    Nie, Feiping
    Wang, Xin
    Chen, Xiaojiang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (10) : 3801 - 3813
  • [7] GMC: Graph-Based Multi-View Clustering
    Wang, Hao
    Yang, Yan
    Liu, Bing
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (06) : 1116 - 1129
  • [8] Robust Chinese Named Entity Recognition Based on Fusion Graph Embedding
    Song, Xuhui
    Yu, Hongtao
    Li, Shaomei
    Wang, Huansha
    ELECTRONICS, 2023, 12 (03)
  • [9] A graph-based clustering algorithm for software systems modularization
    Pourasghar, Babak
    Izadkhah, Habib
    Isazadeh, Ayaz
    Lotfi, Shahriar
    INFORMATION AND SOFTWARE TECHNOLOGY, 2021, 133
  • [10] Graph-Based Soft-Balanced Fuzzy Clustering
    Liu, Chaodie
    Nie, Feiping
    Wang, Rong
    Li, Xuelong
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2023, 31 (06) : 2044 - 2055