High Performance Clustering Techniques: A Survey

被引:1
作者
Savvas, Ilias K. [1 ]
Michos, Christos [1 ]
Chernov, Andrey [2 ]
Butakova, Maria [2 ]
机构
[1] Univ Thessaly, Larisa, Greece
[2] Rostov State Transport Univ, Rostov Na Donu, Russia
来源
PROCEEDINGS OF THE FOURTH INTERNATIONAL SCIENTIFIC CONFERENCE INTELLIGENT INFORMATION TECHNOLOGIES FOR INDUSTRY (IITI'19) | 2020年 / 1156卷
关键词
Clustering; High performance computing; DBSCAN; K-means; DBSCAN ALGORITHM;
D O I
10.1007/978-3-030-50097-9_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We are living in a world of heavy data bombing and the term Big Data is a key issue these days. The variety of applications, where huge amounts of data are produced (can be expressed in PBs and more), is great in many areas such as: Biology, Medicine, Astronomy, Geology, Geography, to name just a few. This trend is steadily increasing. Data Mining is the process for extracting useful information from large data-sets. There are different approaches to discovering properties of datasets. Machine Learning is one of them. In Machine Learning, unsupervised learning deals with unlabeled datasets. One of the primary approaches to unsupervised learning is clustering which is the process of grouping similar entities together. Therefore, it is a challenge to improve the performance of such techniques, especially when we are dealing with huge amounts of data. In this work, we present a survey of techniques which increase the efficiency of two well-known clustering algorithms, k-means and DBSCAN.
引用
收藏
页码:252 / 259
页数:8
相关论文
共 28 条
[1]  
[Anonymous], 2014, IEICE T INFORM SYSTE, DOI DOI 10.1587/TRANSINF.E97.D.1947
[2]  
[Anonymous], 2008, Introduction to information retrieval
[3]  
[Anonymous], 2009, ACM Conference on Information and Knowledge Management, DOI DOI 10.1145/1645953.1646038
[4]  
[Anonymous], 1994, PVM: Parallel Virtual Machine: A Users' Guide and Tutorial for Networked Parallel Computing
[5]  
[Anonymous], 1990, P 1990 ACM SIGMOD IN
[6]  
Arlia D., 2001, Euro-Par 2001 Parallel Processing. 7th International Euro-Par Conference. Proceedings (Lecture Notes in Computer Science Vol.2150), P326
[7]  
Bahmani B., 2012, ABS12036402 CORR
[8]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[9]  
Ester M., 1996, KDD-96 Proceedings. Second International Conference on Knowledge Discovery and Data Mining, P226
[10]  
Farivar R., 2009, PDPTA, P340