Predicting the prognosis of breast cancer by integrating clinical and microarray data with Bayesian networks

被引:231
作者
Gevaert, Olivier
De Smet, Frank
Timmerman, Dirk
Moreau, Yves
De Moor, Bart
机构
[1] Katholieke Univ Leuven, Dept Elect Engn ESAT SCD, B-3001 Heverlee, Belgium
[2] Natl Alliance Christian Mutualities, Med Direct, B-1031 Brussels, Belgium
[3] Katholieke Univ Leuven, Univ Hosp Gasthuisberg, Dept Obstet & Gynecol, B-3000 Louvain, Belgium
关键词
D O I
10.1093/bioinformatics/btl230
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Clinical data, such as patient history, laboratory analysis, ultrasound parameters-which are the basis of day-to-day clinical decision support-are often underused to guide the clinical management of cancer in the presence of microarray data. We propose a strategy based on Bayesian networks to treat clinical and microarray data on an equal footing. The main advantage of this probabilistic model is that it allows to integrate these data sources in several ways and that it allows to investigate and understand the model structure and parameters. Furthermore using the concept of a Markov Blanket we can identify all the variables that shield off the class variable from the influence of the remaining network. Therefore Bayesian networks automatically perform feature selection by identifying the ( in) dependency relationships with the class variable. Results: We evaluated three methods for integrating clinical and microarray data: decision integration, partial integration and full integration and used them to classify publicly available data on breast cancer patients into a poor and a good prognosis group. The partial integration method is most promising and has an independent test set area under the ROC curve of 0.845. After choosing an operating point the classification performance is better than frequently used indices.
引用
收藏
页码:E184 / E190
页数:7
相关论文
共 34 条
[1]   Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses [J].
Bhattacharjee, A ;
Richards, WG ;
Staunton, J ;
Li, C ;
Monti, S ;
Vasa, P ;
Ladd, C ;
Beheshti, J ;
Bueno, R ;
Gillette, M ;
Loda, M ;
Weber, G ;
Mark, EJ ;
Lander, ES ;
Wong, W ;
Johnson, BE ;
Golub, TR ;
Sugarbaker, DJ ;
Meyerson, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (24) :13790-13795
[2]  
BLAMEY RW, 1979, CLIN ONCOL-US, V5, P227
[3]   Use of the St Gallen classification for patients with node-negative breast cancer may lead to overuse of adjuvant chemotherapy [J].
Boyages, J ;
Chua, B ;
Taylor, R ;
Bilous, M ;
Salisbury, E ;
Wilcken, N ;
Ung, O .
BRITISH JOURNAL OF SURGERY, 2002, 89 (06) :789-796
[4]   Exploring the new world of the genome with DNA microarrays [J].
Brown, PO ;
Botstein, D .
NATURE GENETICS, 1999, 21 (Suppl 1) :33-37
[5]  
COOPER GF, 1992, MACH LEARN, V9, P309, DOI 10.1007/BF00994110
[6]   Good Old clinical markers have similar power in breast cancer prognosis as microarray gene expression profilers [J].
Edén, P ;
Ritz, C ;
Rose, C ;
Fernö, M ;
Peterson, C .
EUROPEAN JOURNAL OF CANCER, 2004, 40 (12) :1837-1841
[7]  
Eifel P, 2001, JNCI-J NATL CANCER I, V93, P979
[8]  
GLENISSON P, 2004, GENOME BIOL, P5
[9]   Meeting highlights: International Consensus Panel on the Treatment of Primary Breast Cancer [J].
Goldhirsch, A ;
Glick, JH ;
Gelber, RD ;
Senn, HJ .
JOURNAL OF THE NATIONAL CANCER INSTITUTE, 1998, 90 (21) :1601-1608
[10]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537