A scalable association rule learning and recommendation algorithm for large-scale microarray datasets

被引:0
|
作者
Haosong Li
Phillip C.-Y. Sheu
机构
[1] University of California,Department of Electrical Engineering and Computer Science
来源
Journal of Big Data | / 9卷
关键词
Association rule learning; Microarray dataset; Frequent itemset mining; Scalability; Graph partitioning; Apriori algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
Association rule learning algorithms have been applied to microarray datasets to find association rules among genes. With the development of microarray technology, larger datasets have been generated recently that challenge the current association rule learning algorithms. Specifically, the large number of items per transaction significantly increases the running time and memory consumption of such tasks. In this paper, we propose the Scalable Association Rule Learning (SARL) heuristic that efficiently learns gene-disease association rules and gene–gene association rules from large-scale microarray datasets. The rules are ranked based on their importance. Our experiments show the SARL algorithm outperforms the Apriori algorithm by one to three orders of magnitude.
引用
收藏
相关论文
共 50 条
  • [21] SCORE:: A scalable communication protocol for large-scale virtual environments
    Léty, E
    Turletti, T
    Baccelli, F
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2004, 12 (02) : 247 - 260
  • [22] Scalable Parallel Distance Field Construction for Large-Scale Applications
    Yu, Hongfeng
    Xie, Jinrong
    Ma, Kwan-Liu
    Kolla, Hemanth
    Chen, Jacqueline H.
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2015, 21 (10) : 1187 - 1200
  • [23] Association Rule Mining in Big Datasets Using Improved Cuckoo Search Algorithm
    Yadav, Poonam
    CYBERNETICS AND SYSTEMS, 2023, 54 (06) : 787 - 808
  • [24] Association Rule Mining using Apriori for Large and Growing Datasets under Hadoop
    Govada, Aruna
    Patluri, Abhinav
    Honnalgere, Atmika
    PROCEEDINGS OF 2017 VI INTERNATIONAL CONFERENCE ON NETWORK, COMMUNICATION AND COMPUTING (ICNCC 2017), 2017, : 14 - 17
  • [25] Scalable Deep Hashing for Large-Scale Social Image Retrieval
    Cui, Hui
    Zhu, Lei
    Li, Jingjing
    Yang, Yang
    Nie, Liqiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 1271 - 1284
  • [26] A Distributed Algorithm for Large-Scale Graph Partitioning
    Rahimian, Fatemeh
    Payberah, Amir H.
    Girdzijauskas, Sarunas
    Jelasity, Mark
    Haridi, Seif
    ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2015, 10 (02)
  • [27] An improved association rule mining algorithm for large data
    Zhao, Zhenyi
    Jian, Zhou
    Gaba, Gurjot Singh
    Alroobaea, Roobaea
    Masud, Mehedi
    Rubaiee, Saeed
    JOURNAL OF INTELLIGENT SYSTEMS, 2021, 30 (01) : 750 - 762
  • [28] PPSI: Practical Private Set Intersection over Large-Scale Datasets
    Qiu, Shuo
    Dai, Zekun
    Zha Daren
    Zhang, Zheng
    Liu, Yanan
    2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 1249 - 1254
  • [29] Reproducible learning in large-scale graphical models
    Zhou, Jia
    Li, Yang
    Zheng, Zemin
    Li, Daoji
    JOURNAL OF MULTIVARIATE ANALYSIS, 2022, 189
  • [30] Magiclock: Scalable Detection of Potential Deadlocks in Large-Scale Multithreaded Programs
    Cai, Yan
    Chan, W. K.
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2014, 40 (03) : 266 - 281