Cluster-Based News Representative Generation with Automatic Incremental Clustering

被引:0
|
作者
Shabirin, Irsal [1 ]
Barakbah, Ali Ridho [1 ]
Syarif, Iwan [1 ]
机构
[1] Politekn Elekt Negeri Surabaya, Grad Sch Informat & Comp Engn, Jl Raya Its Sukolilo Sur 60111, Indonesia
关键词
Clustering; Metadata Aggregation; Automatic Incremental Clustering; Representative News;
D O I
10.24003/emitter.v7i2.378
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Nowadays we are facing much abundant information, especially news, and makes us confused in sorting out the information, so that it wastes our time in filtering that information. Though the news often contains similar contents that should save our time for reading. In this paper, we propose a new approach to provide aggregation mechanisms from cluster-based news and produce representative news, using our proposed Automatic Incremental Clustering. This approach presents a mechanism for clustering incremental news data and dynamically providing an automatic creation of new clusters. This approach consists of six main functions, which are (1) Data acquisition with incremental news sources from several news service providers, (2) Keyword extraction for term representation of news data, (3) Metadata aggregation for creating vector space of terms, (4) Automatic clustering for initiating news cluster generation, (5) Automatic incremental clustering for clustering incoming news data to pre-determined clusters or creating a new cluster of news data, and (6) News representation for selecting the most representative news of data clusters. For experimental study, we involved 95 news data service providers with 751 news data for for creating initial clusters with automatic clustering and 110 news data for incremental automatic clustering. Our approach performed 85.14% accuracy for incremental automatic clustering, and is able to dynamically create new clusters for incremental news data.
引用
收藏
页码:467 / 479
页数:13
相关论文
共 50 条
  • [31] Automatic Depth Extraction from 2D Images Using a Cluster-Based Learning Framework
    Herrera, Jose L.
    del-Blanco, Carlos R.
    Garcia, Narciso
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (07) : 3288 - 3299
  • [32] A general stochastic clustering method for automatic cluster discovery
    Tan, Swee Chuan
    Ting, Kai Ming
    Teng, Shyh Wei
    PATTERN RECOGNITION, 2011, 44 (10-11) : 2786 - 2799
  • [33] Cluster-based trajectory segmentation with local noise
    Damiani, Maria Luisa
    Hachem, Fatima
    Issa, Hamza
    Ranc, Nathan
    Moorcroft, Paul
    Cagnacci, Francesca
    DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 32 (04) : 1017 - 1055
  • [34] GUIDED INPAINTING WITH CLUSTER-BASED AUXILIARY INFORMATION
    Maugey, Thomas
    Frossard, Pascal
    Guillemot, Christine
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 1702 - 1706
  • [35] SPOKEN LANGUAGE RECOGNITION WITH CLUSTER-BASED MODELING
    Kacprzak, Stanislaw
    Rybicka, Magdalena
    Kowalczyk, Konrad
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6867 - 6871
  • [36] Cluster-based color matching for image retrieval
    Kankanhalli, MS
    Mehtre, BM
    Wu, JK
    PATTERN RECOGNITION, 1996, 29 (04) : 701 - 708
  • [37] Output Regularization With Cluster-Based Soft Targets
    Mei, Jian-Ping
    Qiu, Wenhao
    Chen, Defang
    Yan, Rui
    Fan, Jing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 11463 - 11474
  • [38] Categorical Data Clustering with Automatic Selection of Cluster Number
    Liao, Hai-Yong
    Ng, Michael K.
    FUZZY INFORMATION AND ENGINEERING, 2009, 1 (01) : 5 - 25
  • [39] Cluster-Based Prediction for Batteries in Data Centers
    Haider, Syed Naeem
    Zhao, Qianchuan
    Li, Xueliang
    ENERGIES, 2020, 13 (05)
  • [40] Cluster-Based Similarity Search in Time Series
    Karamitopoulos, Leonidas
    Evangelidis, Georgios
    PROCEEDINGS OF THE 2009 FOURTH BALKAN CONFERENCE IN INFORMATICS, 2009, : 113 - 118