Efficient mining of multilevel gene association rules from microarray and gene ontology

被引:0
作者
Vincent S. Tseng
Hsieh-Hui Yu
Shih-Chiang Yang
机构
[1] National Cheng Kung University,Department Computer Science and Information Engineering
[2] National Cheng Kung University,Institute of Medical Informatics
来源
Information Systems Frontiers | 2009年 / 11卷
关键词
Data mining; Microarray; Gene expression analysis; Association rules mining; Multi-level association rules; Gene ontology;
D O I
暂无
中图分类号
学科分类号
摘要
Some recent studies have shown that association rules can reveal the interactions between genes that might not have been revealed using traditional analysis methods like clustering. However, the existing studies consider only the association rules among individual genes. In this paper, we propose a new data mining method named MAGO for discovering the multilevel gene association rules from the gene microarray data and the concept hierarchy of Gene Ontology (GO). The proposed method can efficiently find out the relations between GO terms by analyzing the gene expressions with the hierarchy of GO. For example, with the biological process in GO, some rules like Process A (up) → Process B (up) cab be discovered, which indicates that the genes involved in Process B of GO are likely to be up-regulated whenever those involved in Process A are up-regulated. Moreover, we also propose a constrained mining method named CMAGO for discovering the multilevel gene expression rules with user-specified constraints. Through empirical evaluation, the proposed methods are shown to have excellent performance in discovering the hidden multilevel gene association rules.
引用
收藏
页码:433 / 447
页数:14
相关论文
共 69 条
  • [1] Brown M. P. S.(2000)Know-ledge-based analysis of microarray gene expression data by using support vector machines Proceedings of the National Academy of Sciences, USA 97 262-267
  • [2] Grundy W. N.(2006)Integrated analysis of gene expression by association rules discovery BMC Bioinformatics 7 1-16
  • [3] Lin D.(2006)Liver hepcidin and stainable iron expression in biliary atresia Pediatric Research 59 662-666
  • [4] Cristianini N.(2003)mining gene expression databases for association rules Bioinformatics 19 79-86
  • [5] Sug-net C. W.(1999)Molecular classification of cancer: class discovery and class prediction by gene expression monitoring Science 286 531-537
  • [6] Furey T. S.(2006)Interactive gene clustering—a case study of breast cancer microarray data Information Systems Frontiers 8 21-27
  • [7] Carmona-Saez P.(2007)Large-scale regulatory network analysis from microarray data: modified Bayesian network learning and association rule mining Decision Support Systems 43 1207-1225
  • [8] Chagoyen M.(2000)Functional Discovery via a compendium of expression profiles Cell 102 109-126
  • [9] Rodriguez A.(2003)Learning rule-based models of biological process from gene expression time profiles using Gene Ontology Bioinformatics 19 1116-1123
  • [10] Trelles O.(1967)Hierarchical Clustering Schemes Psychometrika 2 241-254