Feature extraction via composite scoring and voting in breast cancer

被引:2
作者
Koch, Martin [1 ]
Hanl, Markus [1 ]
Wiese, Michael [1 ]
机构
[1] Univ Bonn, Inst Pharmaceut, D-53121 Bonn, Germany
关键词
Classification; Machine learning; Triple negative breast cancer; Tumor subtypes; E2F4; GENE-EXPRESSION; CLASS DISCOVERY; QUALITY-CONTROL; CLASSIFICATION; SIGNATURE; SELECTION; CHEMOTHERAPY; VALIDATION; PREDICTION; DIAGNOSIS;
D O I
10.1007/s10549-012-2177-3
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Identification and characterization of tumor subtypes using gene expression profiles of triple negative breast cancer patients. Microarray data of four breast cancer studies were pooled and evaluated. Molecular subtype classification was performed using random forest and a novel algorithm for feature extraction via composite scoring and voting. Biological and clinical properties were evaluated via GSEA, functional annotation clustering and clinical endpoint analysis. The subtype signatures are highly predictive for distant metastasis free survival of tamoxifen-treated patients. Consensus clustering and the novel algorithm proposed three triple negative subtypes. One subtype shows low E2F4 gene expression and is predictive for survival of ER negative breast cancer patients. The other two subtypes share commonalities with luminal B tumors. Classification of breast cancer expression profiles may reveal novel tumor subtypes, possessing clinical impact. Furthermore, subtype characterizing gene signatures might hold potential for novel strategies in cancer therapy.
引用
收藏
页码:307 / 318
页数:12
相关论文
共 53 条
  • [1] Barrett T, 2005, NUCLEIC ACIDS RES, V33, pD562
  • [2] Repression of RAD51 gene expression by E2F4/p130 complexes in hypoxia
    Bindra, R. S.
    Glazer, P. M.
    [J]. ONCOGENE, 2007, 26 (14) : 2048 - 2057
  • [3] ArrayExpress - a public repository for microarray gene expression data at the EBI
    Brazma, A
    Parkinson, H
    Sarkans, U
    Shojatalab, M
    Vilo, J
    Abeygunawardena, N
    Holloway, E
    Kapushesky, M
    Kemmeren, P
    Lara, GG
    Oezcimen, A
    Rocca-Serra, P
    Sansone, SA
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 68 - 71
  • [4] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [5] Brier G. W., 1950, Monthly weather review, V78, P1, DOI [DOI 10.1175/1520-0493(1950)078, DOI 10.1175/1520-0493(1950)078ANDLT
  • [6] 0001:VOFEITANDGT
  • [7] 2.0.CO
  • [8] 2, 10.1175/1520-0493(1950)078()0001:VOFEIT()2.0.CO
  • [9] 2, DOI 10.1175/1520-0493(1950)0782.0.CO
  • [10] 2]