A scalable association rule learning and recommendation algorithm for large-scale microarray datasets

被引:5
作者
Li, Haosong [1 ]
Sheu, Phillip C-Y [1 ]
机构
[1] Univ Calif Irvine, Dept Elect Engn & Comp Sci, Irvine, CA 92697 USA
关键词
Association rule learning; Microarray dataset; Frequent itemset mining; Scalability; Graph partitioning; Apriori algorithm;
D O I
10.1186/s40537-022-00577-4
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Association rule learning algorithms have been applied to microarray datasets to find association rules among genes. With the development of microarray technology, larger datasets have been generated recently that challenge the current association rule learning algorithms. Specifically, the large number of items per transaction significantly increases the running time and memory consumption of such tasks. In this paper, we propose the Scalable Association Rule Learning (SARL) heuristic that efficiently learns gene-disease association rules and gene-gene association rules from large-scale microarray datasets. The rules are ranked based on their importance. Our experiments show the SARL algorithm outperforms the Apriori algorithm by one to three orders of magnitude.
引用
收藏
页数:25
相关论文
共 20 条
[1]  
Agrawal R., P 20 INT C VERY LARG, DOI DOI 10.1055/S-2007-996789
[2]   A Selective Analysis of Microarray Data using Association Rule Mining [J].
Alagukumar, S. ;
Lawrance, R. .
GRAPH ALGORITHMS, HIGH PERFORMANCE IMPLEMENTATIONS AND ITS APPLICATIONS (ICGHIA 2014), 2015, 47 :3-12
[3]   ArrayExpress update - from bulk to single-cell expression data [J].
Athar, Awais ;
Fullgrabe, Anja ;
George, Nancy ;
Iqbal, Haider ;
Huerta, Laura ;
Ali, Ahmed ;
Snow, Catherine ;
Fonseca, Nuno A. ;
Petryszak, Robert ;
Papatheodorou, Irene ;
Sarkans, Ugis ;
Brazma, Alvis .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D711-D715
[4]  
Buluç A, 2016, LECT NOTES COMPUT SC, V9220, P117, DOI 10.1007/978-3-319-49487-6_4
[5]  
Cong G., 2004, P 2004 ACM SIGMOD IN, P143, DOI DOI 10.1145/1007568.1007587
[6]  
Dudoit S., 2003, A practical approach to microarray data analysis, P132, DOI 10.1007/0-306-47815-3_7
[7]   An efficient memetic algorithm for the graph partitioning problem [J].
Galinier, Philippe ;
Boujbel, Zied ;
Fernandes, Michael Coutinho .
ANNALS OF OPERATIONS RESEARCH, 2011, 191 (01) :1-22
[8]  
Han JW, 2000, SIGMOD RECORD, V29, P1
[9]   Large-scale regulatory network analysis from microarray data: modified Bayesian network learning and association rule mining [J].
Huang, Zan ;
Li, Jiexun ;
Su, Hua ;
Watts, George S. ;
Chen, Hsinchun .
DECISION SUPPORT SYSTEMS, 2007, 43 (04) :1207-1225
[10]   Multilevel k-way partitioning scheme for irregular graphs [J].
Karypis, G ;
Kumar, V .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1998, 48 (01) :96-129