Clustering the Imbalanced Datasets using Modified Kohonen Self-Organizing Map (KSOM)

被引:0
|
作者
Ahmad, Azlin [1 ]
Yusoff, Rubiyah [2 ]
Ismail, Mohd Najib [3 ]
Rosli, Nenny Ruthfalydia [2 ]
机构
[1] Univ Teknol MARA UiTM, Fac Comp & Math Sci, Shah Alam, Malaysia
[2] Univ Teknol Malaysia, Malaysia Japan Int Inst Technol, Kuala Lumpur, Malaysia
[3] Asia Pacific Univ Technol & Innovat, Kuala Lumpur, Malaysia
来源
关键词
Neural Network; Kohonen Self Organizing map (KSOM); clustering; imbalanced data set;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The distribution of data plays an important role in determining the successfulness of learning process in machine learning. Data sets with imbalanced distribution may lead to biased results, especially in clustering. If the data is insufficient, the clustering will not be able to cluster and this will add randomness to the grouping. Therefore, the KSOM algorithm is modified to improve the clustering process. This modification is done based on the exploration and exploitation procedures in Ant Clustering Algorithm (ACA). To investigate the effectiveness of the modified algorithm, three imbalanced data sets are chosen; glass, Wisconsin diagnostic breast cancer and tropical wood data set. From the result, the modified KSOM has able to produce accurate number of clusters, reduce the number of overlapped cluster and slightly improve the percentage of accuracy.
引用
收藏
页码:751 / 755
页数:5
相关论文
共 50 条