Data mining by decomposition: Adaptive search for hypothesis generation

被引:7
作者
Bhargava, HK
机构
关键词
D O I
10.1287/ijoc.11.3.239
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Data mining methods search large databases for interesting patterns that may lead to useful decisions in organisations. When the database is defined over scores of attributes, the complexity of the search increases due to the combinatorial explosion at the attribute-space level, because billions of attribute subsets are candidates for forming interesting patterns in the database. A useful way to address this complexity is to partition the search problem and apply separate, but Intertwined, algorithms for attribute search and pattern search. A genetic algorithm Is applied on the attribute search problem to identify subsets that lead to more interesting patterns. This method is applied on a real world database arising from the investigations into the "Persian Gulf Illness." Computational experiments resulted in significant success compared to random or manual attribute selection.
引用
收藏
页码:239 / 247
页数:9
相关论文
共 15 条
[1]  
ABRAMS KR, 1996, J HLTH SERVICES POLI, V1, P253
[2]  
[Anonymous], 1991, KNOWLEDGE DISCOVERY
[3]  
[Anonymous], INTRO EPIDEMIOLOGY
[4]  
Bhargava HK, 1997, P ANN HICSS, P539, DOI 10.1109/HICSS.1997.663214
[5]  
*COMPR CLIN EV PRO, 1996, UNPUB CCEP REP 18 59
[6]  
ELDER J, 1996, ADV KNOWLEDGE DISCOV, pCH4
[7]  
FAYYAD UM, 1996, ADV KNOWLEDGE DISCOV, pCH1
[8]  
Goldberg D., 1989, GENETIC ALGORITHMS S
[9]  
HOLLAND JH, 1975, ADAPTATION NATURAL A
[10]  
KINGDON J, 1995, P 1 INT C GEN ALG EN, P543