Event Graph-Based News Clustering: The Role of Named Entity-Centered Subgraphs

被引:0
|
作者
Komecoglu, Basak Buluz [1 ]
Yilmaz, Burcu [1 ]
机构
[1] Gebze Tech Univ, Inst Informat Technol, TR-41400 Gebze, Kocaeli, Turkiye
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Task analysis; Clustering algorithms; Vectors; Context modeling; Computational modeling; Analytical models; Semantics; Natural language processing; Text processing; Frequent subgraph mining; low-resource language; natural language processing; text clustering; TOPIC DETECTION; SIMILARITY;
D O I
10.1109/ACCESS.2024.3435343
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In an era of exponential growth in online news sources, the need for intelligent digital solutions capable of efficiently analyzing and organizing large amounts of news content has become crucial. This paper presents a graph-based methodology designed to enhance Topic Detection and Tracking (TDT) tasks in natural language processing by efficiently clustering news events into coherent stories. The proposed approach leverages a novel event graph model that captures not only the characteristics of individual news events but also their collective narrative context. Using Named Entity Centred Frequent Subgraphs, the model excels in identifying recurring patterns of events and thus provides a framework for learning a robust, language-independent, and structured representation for structuring news stories, which represents a significant advance in the refinement of traditional clustering algorithms. Empirical experiments using a multilingual benchmark dataset, the News Clustering Dataset, highlight the superior clustering performance of our approach compared to state-of-the-art monolingual document clustering techniques, particularly in English and the competitive results in Spanish. To underline the adaptability of the methodology to low-resource languages, the Turkish 'Story-Based News Dataset' developed specifically for this study also promises to serve as an important resource for a wide range of natural language processing tasks.
引用
收藏
页码:105613 / 105632
页数:20
相关论文
共 27 条
  • [11] conteNXt: A Graph-Based Approach to Assimilate Content and Context for Event Detection in OSN
    Sharma, Sielvie
    Abulaish, Muhammad
    Ahmad, Tanvir
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04): : 5483 - 5495
  • [12] Graph-Based Short Text Clustering via Contrastive Learning with Graph Embedding
    Wei, Yujie
    Zhou, Weidong
    Zhou, Jin
    Wang, Yingxu
    Han, Shiyuan
    Du, Tao
    Yang, Cheng
    Liu, Bowen
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT I, 2023, 14086 : 727 - 738
  • [13] An Entity Ontology-Based Knowledge Graph Embedding Approach to News Credibility Assessment
    Liu, Qi
    Jin, Yuanyuan
    Cao, Xuefei
    Liu, Xiaodong
    Zhou, Xiaokang
    Zhang, Yonghong
    Xu, Xiaolong
    Qi, Lianyong
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04): : 5308 - 5318
  • [14] An End-to-End Approach for Graph-Based Multi-View Data Clustering
    Dornaika, Fadi
    El Hajjar, Sally
    IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (05) : 644 - 654
  • [15] New Graph based Sequence Clustering Approach for News Article Retrieval System
    Nagalavi, Deepa
    Hanumanthappa, M.
    2017 IEEE INTERNATIONAL CONFERENCE ON POWER, CONTROL, SIGNALS AND INSTRUMENTATION ENGINEERING (ICPCSI), 2017, : 1479 - 1482
  • [16] An Explainable Fake News Detector Based on Named Entity Recognition and Stance Classification Applied to COVID-19
    De Magistris, Giorgio
    Russo, Samuele
    Roma, Paolo
    Starczewski, Janusz T.
    Napoli, Christian
    INFORMATION, 2022, 13 (03)
  • [17] ModER: Graph-based Unsupervised Entity Resolution using Composite Modularity Optimization and Locality Sensitive Hashing
    Ebeid, Islam Akef
    Talburt, John R.
    Hagan, Nicholas Kofi Akortia
    Siddique, Md Abdus Salam
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (09) : 1 - 18
  • [18] Superpixel-Level Global and Local Similarity Graph-Based Clustering for Large Hyperspectral Images
    Zhao, Haishi
    Zhou, Fengfeng
    Bruzzone, Lorenzo
    Guan, Renchu
    Yang, Chen
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [19] Graph-Based Structural Deep Spectral-Spatial Clustering for Hyperspectral Image
    Peng, Bo
    Yao, Yuxuan
    Lei, Jianjun
    Fang, Leyuan
    Huang, Qingming
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [20] Self-Weighted Graph-Based Framework for Multi-View Clustering
    He, Yanfang
    Yusof, Umi Kalsom
    IEEE ACCESS, 2023, 11 : 30197 - 30207