DMET-Miner: Efficient discovery of association rules from pharmacogenomic data

被引:34
|
作者
Agapito, Giuseppe [1 ]
Guzzi, Pietro H. [1 ]
Cannataro, Mario [1 ,2 ]
机构
[1] Magna Graecia Univ Catanzaro, Dept Med & Surg Sci, Catanzaro, Italy
[2] CNR, ICAR, I-00185 Rome, Italy
关键词
Personalized medicine; Single nucleotide polymorphism; Frequent itemset mining; Association rules; COLORECTAL-CANCER PATIENTS; POLYMORPHISM;
D O I
10.1016/j.jbi.2015.06.005
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Microarray platforms enable the investigation of allelic variants that may be correlated to phenotypes. Among those, the Affymetrix DMET (Drug Metabolism Enzymes and Transporters) platform enables the simultaneous investigation of all the genes that are related to drug absorption, distribution, metabolism and excretion (ADME). Although recent studies demonstrated the effectiveness of the use of DMET data for studying drug response or toxicity in clinical studies, there is a lack of tools for the automatic analysis of DMET data. In a previous work we developed DMET-Analyzer, a methodology and a supporting platform able to automatize the statistical study of allelic variants, that has been validated in several clinical studies. Although DMET-Analyzer is able to correlate a single variant for each probe (related to a portion of a gene) through the use of the Fisher test, it is unable to discover multiple associations among allelic variants, due to its underlying statistic analysis strategy that focuses on a single variant for each time. To overcome those limitations, here we propose a new analysis methodology for DMET data based on Association Rules mining, and an efficient implementation of this methodology, named DMET-Miner. DMET-Miner extends the DMET-Analyzer tool with data mining capabilities and correlates the presence of a set of allelic variants with the conditions of patient's samples by exploiting association rules. To face the high number of frequent itemsets generated when considering large clinical studies based on DMET data, DMET-Miner uses an efficient data structure and implements an optimized search strategy that reduces the search space and the execution time. Preliminary experiments on synthetic DMET datasets, show how DMET-Miner outperforms off-the-shelf data mining suites such as the FP-Growth algorithms available in Weka and RapidMiner. To demonstrate the biological relevance of the extracted association rules and the effectiveness of the proposed approach from a medical point of view, some preliminary studies on a real clinical dataset are currently under medical investigation. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:273 / 283
页数:11
相关论文
共 37 条
  • [21] Mining Association Rules from Stream Data Based on the Dynamic Support
    Luo, Jia
    Chen, Shihe
    Pan, Fengping
    Zhu, Yaqin
    Wu, Le
    Sun, Yaqi
    Zhang, Chunkai
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, 2016, 127
  • [22] Scalable Approach for Mining Association Rules from Structured XML Data
    Abazeed, Ashraf
    Mamat, Ali
    Sulaiman, Md Nasir
    Ibrahim, Hamidah
    2009 2ND CONFERENCE ON DATA MINING AND OPTIMIZATION, 2009, : 5 - 9
  • [23] Knowledge discovery from object-oriented databases using an association rules mining algorithm
    Changchien, SW
    Lu, TC
    KNOWLEDGE-BASED INTELLIGENT INFORMATION ENGINEERING SYSTEMS & ALLIED TECHNOLOGIES, PTS 1 AND 2, 2001, 69 : 1083 - 1088
  • [24] Towards the use of Data Engineering, Advanced Visualization techniques and Association Rules to support knowledge discovery for public policies
    Conejero, Jose Maria
    Preciado, Juan Carlos
    Fernandez-Garcia, Antonio Jess
    Prieto, Alvaro E.
    Rodriguez-Echeverria, Roberto
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 170
  • [25] Knowledge Discovery from Academic Data using Association Rule Mining
    Ahmed, Shibbir
    Paul, Rajshakhar
    Hoque, Abu Sayed Md Latiful
    2014 17TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2014, : 314 - 319
  • [26] Integration of Association Rules and Clustering Models Obtained from Multiple Data Sources
    Morales Vega, Daymi
    Martin Rodriguez, Diana
    Wilford Rivera, Ingrid
    Rosete Suarez, Alejandro
    COMPUTACION Y SISTEMAS, 2012, 16 (02): : 175 - 189
  • [27] An Efficient Single-Pass Algorithm for Mining Association Rules from Wireless Sensor Networks
    Tanbeer, Syed Khairuzzaman
    Ahmed, Chowdhury Farhan
    Jeong, Byeong-Soo
    IETE TECHNICAL REVIEW, 2009, 26 (04) : 280 - 289
  • [28] A minimal perfect hashing scheme to mining association rules from frequently updated data
    Tseng, Judy C. R.
    Hwang, Gwo-Jen
    Tsai, Wen-Fu
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2006, 29 (03) : 391 - 401
  • [29] Pruning and Summarizing the Discovered Time Series Association Rules from Mechanical Sensor Data
    Yang, Qing
    Wang, Shao-Yu
    Zhang, Ting-Ting
    PROCEEDINGS OF THE 3RD ANNUAL INTERNATIONAL CONFERENCE ON ELECTRONICS, ELECTRICAL ENGINEERING AND INFORMATION SCIENCE (EEEIS 2017), 2017, 131 : 40 - 45
  • [30] An efficient algorithm for mining association rules for large itemsets in large databases: from sequential to parallel
    Wong, AKY
    Wu, SL
    Feng, L
    ENGINEERING INTELLIGENT SYSTEMS FOR ELECTRICAL ENGINEERING AND COMMUNICATIONS, 2000, 8 (02): : 109 - 117