Cluster-Based News Representative Generation with Automatic Incremental Clustering

被引:0
|
作者
Shabirin, Irsal [1 ]
Barakbah, Ali Ridho [1 ]
Syarif, Iwan [1 ]
机构
[1] Politekn Elekt Negeri Surabaya, Grad Sch Informat & Comp Engn, Jl Raya Its Sukolilo Sur 60111, Indonesia
关键词
Clustering; Metadata Aggregation; Automatic Incremental Clustering; Representative News;
D O I
10.24003/emitter.v7i2.378
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Nowadays we are facing much abundant information, especially news, and makes us confused in sorting out the information, so that it wastes our time in filtering that information. Though the news often contains similar contents that should save our time for reading. In this paper, we propose a new approach to provide aggregation mechanisms from cluster-based news and produce representative news, using our proposed Automatic Incremental Clustering. This approach presents a mechanism for clustering incremental news data and dynamically providing an automatic creation of new clusters. This approach consists of six main functions, which are (1) Data acquisition with incremental news sources from several news service providers, (2) Keyword extraction for term representation of news data, (3) Metadata aggregation for creating vector space of terms, (4) Automatic clustering for initiating news cluster generation, (5) Automatic incremental clustering for clustering incoming news data to pre-determined clusters or creating a new cluster of news data, and (6) News representation for selecting the most representative news of data clusters. For experimental study, we involved 95 news data service providers with 751 news data for for creating initial clusters with automatic clustering and 110 news data for incremental automatic clustering. Our approach performed 85.14% accuracy for incremental automatic clustering, and is able to dynamically create new clusters for incremental news data.
引用
收藏
页码:467 / 479
页数:13
相关论文
共 50 条
  • [11] Cluster Feature-Based Incremental Clustering Approach (CFICA) For Numerical Data
    Sowjanya, A. M.
    Shashi, M.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2010, 10 (09): : 73 - 79
  • [12] Low Latency Clustering & Mapping Algorithm with Task Duplication Technique on Cluster-Based NoC
    Fu, Fangfa
    Bai, Yuxin
    Hu, Xin'an
    Wang, Jinxiang
    Yu, Mingyan
    INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS (IMECS 2010), VOLS I-III, 2010, : 1374 - 1379
  • [13] Cluster-Based Distributed Consensus
    Li, Wenjun
    Dai, Huaiyu
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2009, 8 (01) : 28 - 31
  • [14] Clustering Fusion with Automatic Cluster Number
    Muneeswaran, P.
    Velvizhy, P.
    Kannan, A.
    2014 INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION TECHNOLOGY (ICRTIT), 2014,
  • [15] A Limited Incremental Clustering Algorithm with Respect to Cluster Stability
    Zhu, Wenhao
    Yao, Wenxin
    Dai, Song
    Lu, Zhiguo
    HIGH PERFORMANCE COMPUTING AND APPLICATIONS, HPCA 2015, 2016, 9576 : 170 - 177
  • [16] Cluster-Based Graph Collaborative Filtering
    Liu, Fan
    Zhao, Shuai
    Cheng, Zhiyong
    Nie, Liqiang
    Kankanhalli, Mohan
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (06)
  • [17] Cluster-Based Improved Isolation Forest
    Shao, Chen
    Du, Xusheng
    Yu, Jiong
    Chen, Jiaying
    ENTROPY, 2022, 24 (05)
  • [18] Cluster-based dynamic scoring model
    Lim, Michael K.
    Sohn, So Young
    EXPERT SYSTEMS WITH APPLICATIONS, 2007, 32 (02) : 427 - 431
  • [19] Cluster-based Sliced Inverse Regression
    Kuentz, Vanessa
    Saracco, Jerome
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2010, 39 (02) : 251 - 267
  • [20] Cluster-based Sliced Inverse Regression
    Vanessa Kuentz
    Jérôme Saracco
    Journal of the Korean Statistical Society, 2010, 39 : 251 - 267