Integrative analysis of survival-associated gene sets in breast cancer

被引:16
作者
Varn, Frederick S. [1 ]
Ung, Matthew H. [1 ]
Lou, Shao Ke [1 ]
Cheng, Chao [1 ,2 ,3 ]
机构
[1] Geisel Sch Med Dartmouth, Dept Genet, Hanover, NH 03755 USA
[2] Geisel Sch Med Dartmouth, Inst Quantitat Biomed Sci, Lebanon, NH 03766 USA
[3] Geisel Sch Med Dartmouth, Norris Cotton Canc Ctr, Lebanon, NH 03766 USA
关键词
Breast cancer; Gene sets; Prognosis; Survival prediction; EXPRESSION SIGNATURE; TRANSCRIPTION FACTORS; PROGNOSIS; GRADE;
D O I
10.1186/s12920-015-0086-0
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: Patient gene expression information has recently become a clinical feature used to evaluate breast cancer prognosis. The emergence of prognostic gene sets that take advantage of these data has led to a rich library of information that can be used to characterize the molecular nature of a patient's cancer. Identifying robust gene sets that are consistently predictive of a patient's clinical outcome has become one of the main challenges in the field. Methods: We inputted our previously established BASE algorithm with patient gene expression data and gene sets from MSigDB to develop the gene set activity score (GSAS), a metric that quantitatively assesses a gene set's activity level in a given patient. We utilized this metric, along with patient time-to-event data, to perform survival analyses to identify the gene sets that were significantly correlated with patient survival. We then performed cross-dataset analyses to identify robust prognostic gene sets and to classify patients by metastasis status. Additionally, we created a gene set network based on component gene overlap to explore the relationship between gene sets derived from MSigDB. We developed a novel gene set based on this network's topology and applied the GSAS metric to characterize its role in patient survival. Results: Using the GSAS metric, we identified 120 gene sets that were significantly associated with patient survival in all datasets tested. The gene overlap network analysis yielded a novel gene set enriched in genes shared by the robustly predictive gene sets. This gene set was highly correlated to patient survival when used alone. Most interestingly, removal of the genes in this gene set from the gene pool on MSigDB resulted in a large reduction in the number of predictive gene sets, suggesting a prominent role for these genes in breast cancer progression. Conclusions: The GSAS metric provided a useful medium by which we systematically investigated how gene sets from MSigDB relate to breast cancer patient survival. We used this metric to identify predictive gene sets and to construct a novel gene set containing genes heavily involved in cancer progression.
引用
收藏
页数:16
相关论文
共 36 条
[1]   Robustness, scalability, and integration of a wound-response gene expression signature in predicting breast cancer survival [J].
Chang, HY ;
Nuyten, DSA ;
Sneddon, JB ;
Hastie, T ;
Tibshirani, R ;
Sorlie, T ;
Dai, HY ;
He, YDD ;
van't Veer, LJ ;
Bartelink, H ;
van de Rijn, M ;
Brown, PO ;
van de Vijver, MJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (10) :3738-3743
[2]   Inferring activity changes of transcription factors by binding association with sorted expression profiles [J].
Cheng, Chao ;
Yan, Xiting ;
Sun, Fengzhu ;
Li, Lei M. .
BMC BIOINFORMATICS, 2007, 8 (1)
[3]   Network-based classification of breast cancer metastasis [J].
Chuang, Han-Yu ;
Lee, Eunjung ;
Liu, Yu-Tsueng ;
Lee, Doheon ;
Ideker, Trey .
MOLECULAR SYSTEMS BIOLOGY, 2007, 3 (1)
[4]   Strong time dependence of the 76-gene prognostic signature for node-negative breast cancer patients in the TRANSBIG multicenter independent validation series [J].
Desmedt, Christine ;
Piette, Fanny ;
Loi, Sherene ;
Wang, Yixin ;
d'assignies, Mahasti Saghatchian ;
Bergh, Jonas ;
Lidereau, Rosette ;
Ellis, Paul ;
Harris, Adrian L. ;
Klijn, Jan G. M. ;
Foekens, John A. ;
Cardoso, Fatima ;
Piccart, Martine J. ;
Buyse, Marc ;
Sotiriou, Christos .
CLINICAL CANCER RESEARCH, 2007, 13 (11) :3207-3214
[5]   Analysis and correction of crosstalk effects in pathway analysis [J].
Donato, Michele ;
Xu, Zhonghui ;
Tomoiaga, Alin ;
Granneman, James G. ;
MacKenzie, Robert G. ;
Bao, Riyue ;
Than, Nandor Gabor ;
Westfall, Peter H. ;
Romero, Roberto ;
Draghici, Sorin .
GENOME RESEARCH, 2013, 23 (11) :1885-1893
[6]   Thousands of samples are needed to generate a robust gene list for predicting outcome in cancer [J].
Ein-Dor, L ;
Zuk, O ;
Domany, E .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (15) :5923-5928
[7]  
Glinsky GV, 2005, J CLIN INVEST, V115, P1503, DOI 10.1172/JCI23412
[8]   Hallmarks of Cancer: The Next Generation [J].
Hanahan, Douglas ;
Weinberg, Robert A. .
CELL, 2011, 144 (05) :646-674
[9]   Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources [J].
Huang, Da Wei ;
Sherman, Brad T. ;
Lempicki, Richard A. .
NATURE PROTOCOLS, 2009, 4 (01) :44-57
[10]   Gene Pathways Associated With Prognosis and Chemotherapy Sensitivity in Molecular Subtypes of Breast Cancer [J].
Iwamoto, Takayuki ;
Bianchini, Giampaolo ;
Booser, Daniel ;
Qi, Yuan ;
Coutant, Charles ;
Shiang, Christine Ya-Hui ;
Santarpia, Libero ;
Matsuoka, Junji ;
Hortobagyi, Gabriel N. ;
Symmans, William Fraser ;
Holmes, Frankie A. ;
O'Shaughnessy, Joyce ;
Hellerstedt, Beth ;
Pippen, John ;
Andre, Fabrice ;
Simon, Richard ;
Pusztai, Lajos .
JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2011, 103 (03) :264-272