Cluster-Based News Representative Generation with Automatic Incremental Clustering

被引:0
|
作者
Shabirin, Irsal [1 ]
Barakbah, Ali Ridho [1 ]
Syarif, Iwan [1 ]
机构
[1] Politekn Elekt Negeri Surabaya, Grad Sch Informat & Comp Engn, Jl Raya Its Sukolilo Sur 60111, Indonesia
关键词
Clustering; Metadata Aggregation; Automatic Incremental Clustering; Representative News;
D O I
10.24003/emitter.v7i2.378
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Nowadays we are facing much abundant information, especially news, and makes us confused in sorting out the information, so that it wastes our time in filtering that information. Though the news often contains similar contents that should save our time for reading. In this paper, we propose a new approach to provide aggregation mechanisms from cluster-based news and produce representative news, using our proposed Automatic Incremental Clustering. This approach presents a mechanism for clustering incremental news data and dynamically providing an automatic creation of new clusters. This approach consists of six main functions, which are (1) Data acquisition with incremental news sources from several news service providers, (2) Keyword extraction for term representation of news data, (3) Metadata aggregation for creating vector space of terms, (4) Automatic clustering for initiating news cluster generation, (5) Automatic incremental clustering for clustering incoming news data to pre-determined clusters or creating a new cluster of news data, and (6) News representation for selecting the most representative news of data clusters. For experimental study, we involved 95 news data service providers with 751 news data for for creating initial clusters with automatic clustering and 110 news data for incremental automatic clustering. Our approach performed 85.14% accuracy for incremental automatic clustering, and is able to dynamically create new clusters for incremental news data.
引用
收藏
页码:467 / 479
页数:13
相关论文
共 50 条
  • [41] DICLENS: Divisive Clustering Ensemble with Automatic Cluster Number
    Mimaroglu, Selim
    Aksehirli, Emin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (02) : 408 - 420
  • [42] CVIK: A MATLAB-based cluster validity index toolbox for automatic data clustering
    Jose-Garcia, Adan
    Gomez-Flores, Wilfrido
    SOFTWAREX, 2023, 22
  • [43] Cluster-Based Secure Aggregation for Federated Learning
    Kim, Jien
    Park, Gunryeong
    Kim, Miseung
    Park, Soyoung
    ELECTRONICS, 2023, 12 (04)
  • [44] A New Cluster-based Instance Selection Algorithm
    Czarnowski, Ireneusz
    Jedrzejowicz, Piotr
    AGENT AND MULTI-AGENT SYSTEMS: TECHNOLOGIES AND APPLICATIONS, 2011, 6682 : 436 - 445
  • [45] Cluster-based industrialization in China: Financing and performance
    Long, Cheryl
    Zhang, Xiaobo
    JOURNAL OF INTERNATIONAL ECONOMICS, 2011, 84 (01) : 112 - 123
  • [46] Cluster-based trajectory segmentation with local noise
    Maria Luisa Damiani
    Fatima Hachem
    Hamza Issa
    Nathan Ranc
    Paul Moorcroft
    Francesca Cagnacci
    Data Mining and Knowledge Discovery, 2018, 32 : 1017 - 1055
  • [47] A novel framework for cluster-based sensor fusion
    Wang, Xue
    Wang, Sheng
    Jiang, Aiguo
    2006 IMACS: MULTICONFERENCE ON COMPUTATIONAL ENGINEERING IN SYSTEMS APPLICATIONS, VOLS 1 AND 2, 2006, : 2033 - +
  • [48] An efficient incremental clustering based improved K-Medoids for IoT multivariate data cluster analysis
    Balakrishna, Sivadi
    Thirumaran, M.
    Padmanaban, R.
    Solanki, Vijender Kumar
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2020, 13 (04) : 1152 - 1175
  • [49] An efficient incremental clustering based improved K-Medoids for IoT multivariate data cluster analysis
    Sivadi Balakrishna
    M. Thirumaran
    R. Padmanaban
    Vijender Kumar Solanki
    Peer-to-Peer Networking and Applications, 2020, 13 : 1152 - 1175
  • [50] Clustering-Based Incremental Web Crawling
    Tan, Qingzhao
    Mitra, Prasenjit
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2010, 28 (04)