Integrated Analysis of Gene Expression and Genome-wide DNA Methylation for Tumor Prediction: An Association Rule Mining-based Approach

被引:0
作者
Mallik, Saurav [1 ]
Mukhopadhyay, Anirban [2 ]
Maulik, Ujjwal [3 ]
Bandyopadhyay, Sanghamitra [1 ]
机构
[1] Indian Stat Inst, Machine Intelligence Unit, Kolkata 700108, India
[2] Univ Kalyani, Dept Syst & Comp Engn, Kalyani 741235, W Bengal, India
[3] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata 700032, India
来源
PROCEEDINGS OF THE 2013 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB) | 2013年
关键词
Gene expression; DNA methylation; statistical tests; association rule mining; Genetic Algorithm (GA) based rank aggregation; rule-in terestingness measures; STATISTICAL-METHODS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Statistical analysis and association rule mining are two most efficient techniques, where the first one is used to identify differentially expressed/methylated genes across different types of samples or experimental conditions and the second one is used to determine expression/methylation relationships among them. In this article, we have performed an integrated analysis of statistical methods and association rule mining on mRNA expression and DNA methylation datasets for the prediction of Uterine Leiomyoma. Moreover, we have proposed a novel rule-base classifier. Depending on 16 different rule-interestingness measures, we have applied a Genetic Algorithm based rank aggregation technique on the association rules which are generated from the training data by Apriori association rule mining algorithm. After determining the ranks of the rules, we have conducted a majority voting technique on each test point to determine its class-label (i.e. tumor or normal class-label) through weighted-sum method. We have run this classifier on the combined dataset using k-fold cross-validation and also performed a comparative performance analysis with other popular rule-base classifiers. Finally, we have predicted the status of some important genes (through frequency analysis in association rules for tumor and normal class-labels individually) that have a major role for tumor formation in Uterine Leiomyoma.
引用
收藏
页码:120 / 127
页数:8
相关论文
共 25 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
Anandhavalli N I., 2010, J COMPUTING, V2, P110
[3]  
[Anonymous], 2003, P 9 ACM SIGKDD INT C
[4]  
[Anonymous], BMC BIOINFORMATICS
[5]   Strategy for elucidating differentially expressed genes in leiomyomata identified by microarray technology [J].
Catherino, WH ;
Prupas, C ;
Tsibris, JCM ;
Leppert, PC ;
Payson, M ;
Nieman, LK ;
Segars, JH .
FERTILITY AND STERILITY, 2003, 80 (02) :282-290
[6]   Mining gene expression databases for association rules [J].
Creighton, C ;
Hanash, S .
BIOINFORMATICS, 2003, 19 (01) :79-86
[7]  
Dudoit S, 2002, STAT SINICA, V12, P111
[8]   A two-sample Bayesian t-test for microarray data [J].
Fox, RJ ;
Dimmic, MW .
BMC BIOINFORMATICS, 2006, 7 (1)
[9]   A TEST FOR NORMALITY OF OBSERVATIONS AND REGRESSION RESIDUALS [J].
JARQUE, CM ;
BERA, AK .
INTERNATIONAL STATISTICAL REVIEW, 1987, 55 (02) :163-172
[10]   Comparison of various statistical methods for identifying differential gene expression in replicated microarray data [J].
Kim, SY ;
Lee, JW ;
Sohn, IS .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2006, 15 (01) :3-20