Integration of Association Rules and Clustering Models Obtained from Multiple Data Sources

被引:0
作者
Morales Vega, Daymi [1 ]
Martin Rodriguez, Diana [1 ]
Wilford Rivera, Ingrid [1 ]
Rosete Suarez, Alejandro [1 ]
机构
[1] Inst Super Politecn Jose Antonio Echeverria, Havana, Cuba
来源
COMPUTACION Y SISTEMAS | 2012年 / 16卷 / 02期
关键词
Integration; data mining models; association rules; clustering; patterns;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One possible way to discover knowledge over distributed data sources, using Data Mining techniques, is to reuse the models of local mining found in each data source and look for patterns globally valid. This process can be done without accessing the data directly. This paper focuses on the proposal of two methods for integrating data mining models: Association Rules and Clustering Models, specifically rules were obtained using support and confidence as measures of quality and clustering based on centroids. It was necessary to use metaheuristics algorithms to find a global model that is as close as possible to the local models. These models were obtained using homogeneous data sources. The experimental study showed that the proposed methods obtain global models of quality in a reasonable time when increasing the amount of local patterns to integrate.
引用
收藏
页码:175 / 189
页数:15
相关论文
共 17 条
  • [1] Agrawal R., 1994, 20 VLDB C, P487, DOI DOI 10.1007/BF02948845
  • [2] Fajardo J., 2011, REV INT INVESTIGACIO, V1, P57
  • [3] Fern X. Z., 2004, P 21 INT C MACHINE L, P36, DOI [DOI 10.1145/1015330.1015414, 10.1145/1015330.1015414]
  • [4] Gionis A, 2005, PROC INT CONF DATA, P341
  • [5] A scalable framework for cluster ensembles
    Hore, Prodip
    Hall, Lawrence O.
    Goldgof, Dmitry B.
    [J]. PATTERN RECOGNITION, 2009, 42 (05) : 676 - 688
  • [6] Horn J., 1994, FDN GENETIC ALGORITH, P243
  • [7] Jensen VC, 2000, LECT NOTES ARTIF INT, V1805, P49
  • [8] Lange T., 2005, P 11 ACM SIGKDD INT, P147
  • [9] Long B, 2005, FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, P282
  • [10] Paul S, 2008, INT J COMPUT SCI NET, V8, P334