An Enhanced Version of MDDB-GC Algorithm: Multi-Density DBSCAN Based on Grid and Contribution for Data Stream

被引:2
作者
Hu, Shuo [1 ,2 ]
Pang, Yonglin [1 ]
He, Yong [3 ]
Yang, Yuan [1 ]
Zhang, Henian [2 ]
Zhang, Linmeng [1 ]
Zheng, Beiyi [4 ]
Hu, Caiyun [5 ]
Wang, Qing [1 ]
机构
[1] Southeast Univ, Sch Instrument Sci & Engn, Nanjing 210096, Peoples R China
[2] Nanjing South New Town Dev & Construct Management, Nanjing 210022, Peoples R China
[3] Nanjing South New Town Dev & Construct Grp Co Ltd, Nanjing 210007, Peoples R China
[4] Nanjing Lib, Nanjing 210002, Peoples R China
[5] Anhui Jianzhu Univ, Sch Civil Engn, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
data stream; clustering analysis; DBSCAN; MDDSDB-GC; contribution; grid density;
D O I
10.3390/pr11041240
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
With the continuous enrichment of big data technology application scenarios, the clustering analysis of a data stream has become a research hotspot. However, the existing data stream clustering algorithms usually have some defects, such as inability to cluster arbitrary shapes, difficulty determining some important parameters, and "static" clustering. In this study, a novel algorithm MDDSDB-GC is innovated. It selected MDDB-GC as the original algorithm that cannot deal with a data stream. In MDDSDB-GC, the calculation methods of contribution, grid density, and migration factor are effectively improved, and other parts are adjusted accordingly. The experiments show that MDDSDB-GC retains the advantage of MDDB-GC and obtains the ability to cluster an analysis for a data stream. At the same time, it effectively overcomes the above conventional defects, and its overall performance is better.
引用
收藏
页数:18
相关论文
共 20 条
[1]  
Ashour W, 2011, LECT NOTES COMPUT SC, V6936, P446, DOI 10.1007/978-3-642-23878-9_53
[2]  
Chen X., 2017, J INTEGR TECHNOL, V6, P41
[3]  
Esfandani G, 2010, LECT NOTES ARTIF INT, V6440, P202, DOI 10.1007/978-3-642-17316-5_19
[4]   A varied density-based clustering algorithm [J].
Fahim, Ahmed .
JOURNAL OF COMPUTATIONAL SCIENCE, 2023, 66
[5]   A fast DBSCAN algorithm for big data based on efficient density calculation [J].
Hanafi, Nooshin ;
Saadatfar, Hamid .
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 203
[6]   Big data clustering with varied density based on MapReduce [J].
Heidari, Safanaz ;
Alborzi, Mahmood ;
Radfar, Reza ;
Afsharkazemi, Mohammad Ali ;
Ghatari, Ali Rajabzadeh .
JOURNAL OF BIG DATA, 2019, 6 (01)
[7]  
[侯思祖 Hou Sizu], 2018, [传感器与微系统, Transducer and Microsystem Technology], V37, P137
[8]  
Jia D., 2020, MULTIMED TOOLS APPL, V29, P145
[9]  
Jungan C, 2020, INT J COMPUT SCI ENG, V21, P173
[10]  
Kellner D, 2012, 2012 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), P365, DOI 10.1109/IVS.2012.6232167