Using rule induction methods to analyze gene expression data

被引:0
|
作者
Livingston, G [1 ]
Li, X [1 ]
Li, GY [1 ]
Hao, LW [1 ]
Zhou, RP [1 ]
机构
[1] Univ Massachusetts, Lowell, MA 01854 USA
来源
PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE | 2003年
关键词
D O I
10.1109/CSB.2003.1227363
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We have applied rule induction to a publicly available adenocarcinoma gene expression data set. The typical approach to the analysis of gene expression data is to cluster the genes. However, interpreting the resulting clusters may be difficult. With rules, the interpretation is more obvious (e.g., (CDKN3 > 253) --> (tumor-stage = 3)). Our rule induction tool is a new semi-autonomous discovery system we are developing called HAMB, and we used it to learn rules for survival status, survival time, and tumor stage. When we searched the world-wide web for publications relating our top 53 genes from our discovered rules to lung cancer, we found that 9 of them are known to be associated with lung cancer, 19 of them are known to be associated with other types of cancer, and the remaining 25 were not known to be associated with cancer. Our results suggest that the latter two groups of genes should be examined more closely for their association with lung cancer.
引用
收藏
页码:439 / 440
页数:2
相关论文
共 50 条
  • [21] A rule induction algorithm for continuous data using analysis of variance
    Konda, R
    Rajurkar, KP
    Proceedings of the IEEE SoutheastCon 2004: EXCELLENCE IN ENGINEERING, SCIENCE, AND TECHNOLOGY, 2005, : 489 - 494
  • [22] NormExpression: An R Package to Normalize Gene Expression Data Using Evaluated Methods
    Wu, Zhenfeng
    Liu, Weixiang
    Jin, Xiufeng
    Ji, Haishuo
    Wang, Hua
    Glusman, Gustavo
    Robinson, Max
    Liu, Lin
    Ruan, Jishou
    Gao, Shan
    FRONTIERS IN GENETICS, 2019, 10
  • [23] Methods and approaches in the analysis of gene expression data
    Dopazo, J
    Zanders, E
    Dragoni, I
    Amphlett, C
    Falciani, F
    JOURNAL OF IMMUNOLOGICAL METHODS, 2001, 250 (1-2) : 93 - 112
  • [24] A framework for significance analysis of gene expression data using dimension reduction methods
    Gidskehaug, Lars
    Anderssen, Endre
    Flatberg, Arnar
    Alsberg, Bjorn K.
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [25] Clustering methods for microarray gene expression data
    Belacel, Nabil
    Wang, Qian
    Cuperlovic-Culf, Miroslava
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2006, 10 (04) : 507 - 531
  • [26] Machine Learning Methods for Cancer Classification Using Gene Expression Data: A Review
    Alharbi, Fadi
    Vakanski, Aleksandar
    BIOENGINEERING-BASEL, 2023, 10 (02):
  • [27] A framework for significance analysis of gene expression data using dimension reduction methods
    Lars Gidskehaug
    Endre Anderssen
    Arnar Flatberg
    Bjørn K Alsberg
    BMC Bioinformatics, 8
  • [28] Effective dimension reduction methods for tumor classification using gene expression data
    Antoniadis, A
    Lambert-Lacroix, S
    Leblanc, F
    BIOINFORMATICS, 2003, 19 (05) : 563 - 570
  • [29] PCA-FA:Applying Supervised Learning to Analyze Gene Expression Data
    翁时锋
    张长水
    张学工
    TsinghuaScienceandTechnology, 2004, (04) : 428 - 434
  • [30] Prediction of transcription factor binding to DNA using rule induction methods
    Huss, Mikael
    Nordstrom, Karin
    JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2006, 3 (02) : 247 - 263