Cluster-Based News Representative Generation with Automatic Incremental Clustering

被引：0

作者：

Shabirin, Irsal ^{[1
]}

Barakbah, Ali Ridho ^{[1
]}

Syarif, Iwan ^{[1
]}

机构：

[1] Politekn Elekt Negeri Surabaya, Grad Sch Informat & Comp Engn, Jl Raya Its Sukolilo Sur 60111, Indonesia

来源：

EMITTER-INTERNATIONAL JOURNAL OF ENGINEERING TECHNOLOGY | 2019年 / 7卷 / 02期

关键词：

Clustering; Metadata Aggregation; Automatic Incremental Clustering; Representative News;

D O I：

10.24003/emitter.v7i2.378

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Nowadays we are facing much abundant information, especially news, and makes us confused in sorting out the information, so that it wastes our time in filtering that information. Though the news often contains similar contents that should save our time for reading. In this paper, we propose a new approach to provide aggregation mechanisms from cluster-based news and produce representative news, using our proposed Automatic Incremental Clustering. This approach presents a mechanism for clustering incremental news data and dynamically providing an automatic creation of new clusters. This approach consists of six main functions, which are (1) Data acquisition with incremental news sources from several news service providers, (2) Keyword extraction for term representation of news data, (3) Metadata aggregation for creating vector space of terms, (4) Automatic clustering for initiating news cluster generation, (5) Automatic incremental clustering for clustering incoming news data to pre-determined clusters or creating a new cluster of news data, and (6) News representation for selecting the most representative news of data clusters. For experimental study, we involved 95 news data service providers with 751 news data for for creating initial clusters with automatic clustering and 110 news data for incremental automatic clustering. Our approach performed 85.14% accuracy for incremental automatic clustering, and is able to dynamically create new clusters for incremental news data.

引用

页码：467 / 479

页数：13

共 50 条

[31] Automatic Depth Extraction from 2D Images Using a Cluster-Based Learning Framework
Herrera, Jose L.
del-Blanco, Carlos R.
Garcia, Narciso
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (07) : 3288 - 3299
[32] A general stochastic clustering method for automatic cluster discovery
Tan, Swee Chuan
Ting, Kai Ming
Teng, Shyh Wei
PATTERN RECOGNITION, 2011, 44 (10-11) : 2786 - 2799
[33] Cluster-based trajectory segmentation with local noise
Damiani, Maria Luisa
Hachem, Fatima
Issa, Hamza
Ranc, Nathan
Moorcroft, Paul
Cagnacci, Francesca
DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 32 (04) : 1017 - 1055
[34] GUIDED INPAINTING WITH CLUSTER-BASED AUXILIARY INFORMATION
Maugey, Thomas
Frossard, Pascal
Guillemot, Christine
2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 1702 - 1706
[35] SPOKEN LANGUAGE RECOGNITION WITH CLUSTER-BASED MODELING
Kacprzak, Stanislaw
Rybicka, Magdalena
Kowalczyk, Konrad
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6867 - 6871
[36] Cluster-based color matching for image retrieval
Kankanhalli, MS
Mehtre, BM
Wu, JK
PATTERN RECOGNITION, 1996, 29 (04) : 701 - 708
[37] Output Regularization With Cluster-Based Soft Targets
Mei, Jian-Ping
Qiu, Wenhao
Chen, Defang
Yan, Rui
Fan, Jing
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 11463 - 11474
[38] Categorical Data Clustering with Automatic Selection of Cluster Number
Liao, Hai-Yong
Ng, Michael K.
FUZZY INFORMATION AND ENGINEERING, 2009, 1 (01) : 5 - 25
[39] Cluster-Based Prediction for Batteries in Data Centers
Haider, Syed Naeem
Zhao, Qianchuan
Li, Xueliang
ENERGIES, 2020, 13 (05)
[40] Cluster-Based Similarity Search in Time Series
Karamitopoulos, Leonidas
Evangelidis, Georgios
PROCEEDINGS OF THE 2009 FOURTH BALKAN CONFERENCE IN INFORMATICS, 2009, : 113 - 118

← 1 2 3 4 5 →