Sparse feature selection for classification and prediction of metastasis in endometrial cancer

被引:18
作者
Ahsen, Mehmet Eren [1 ]
Boren, Todd P. [2 ]
Singh, Nitin K. [3 ]
Misganaw, Burook [4 ]
Mutch, David G. [5 ]
Moore, Kathleen N. [6 ]
Backes, Floor J. [7 ]
McCourt, Carolyn K. [8 ]
Lea, Jayanthi S. [9 ]
Miller, David S. [9 ]
White, Michael A. [9 ]
Vidyasagar, Mathukumalli [10 ]
机构
[1] IBM Res, Yorktown Hts, NY USA
[2] Univ Tennessee, Coll Med, Knoxville, TN USA
[3] Apple R&D, Austin, TX USA
[4] Harvard Univ, Cambridge, MA 02138 USA
[5] Washington Univ, Sch Med, St Louis, MO USA
[6] Univ Oklohoma, Norman, OK USA
[7] Ohio State Univ, Columbus, OH 43210 USA
[8] Brown Univ, Women & Infants Hosp, Providence, RI USA
[9] Univ Texas Southwestern Med Ctr Dallas, Dallas, TX 75390 USA
[10] Univ Texas Dallas, Richardson, TX 75083 USA
来源
BMC GENOMICS | 2017年 / 18卷
关键词
Endometrial cancer; Lymph node metastasis; Sparse classification; Machine learning; LYMPH-NODE METASTASIS; PROSPECTIVE MULTICENTER; GENE SELECTION; MICRORNAS; CARCINOMA; SIRNAS; TRIAL;
D O I
10.1186/s12864-017-3604-y
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Metastasis via pelvic and/or para-aortic lymph nodes is a major risk factor for endometrial cancer. Lymph-node resection ameliorates risk but is associated with significant co-morbidities. Incidence in patients with stage I disease is 4-22% but no mechanism exists to accurately predict it. Therefore, national guidelines for primary staging surgery include pelvic and para-aortic lymph node dissection for all patients whose tumor exceeds 2cm in diameter. We sought to identify a robust molecular signature that can accurately classify risk of lymph node metastasis in endometrial cancer patients. 86 tumors matched for age and race, and evenly distributed between lymph node-positive and lymph node-negative cases, were selected as a training cohort. Genomic micro-RNA expression was profiled for each sample to serve as the predictive feature matrix. An independent set of 28 tumor samples was collected and similarly characterized to serve as a test cohort. Results: A feature selection algorithm was designed for applications where the number of samples is far smaller than the number of measured features per sample. A predictive miRNA expression signature was developed using this algorithm, which was then used to predict the metastatic status of the independent test cohort. A weighted classifier, using 18 micro-RNAs, achieved 100% accuracy on the training cohort. When applied to the testing cohort, the classifier correctly predicted 90% of node-positive cases, and 80% of node-negative cases (FDR = 6.25%). Conclusion: Results indicate that the evaluation of the quantitative sparse-feature classifier proposed here in clinical trials may lead to significant improvement in the prediction of lymphatic metastases in endometrial cancer patients.
引用
收藏
页数:12
相关论文
共 31 条
[1]  
[Anonymous], ARXIV201414108229
[2]   HE4 and CA125 levels in the preoperative assessment of endometrial cancer patients: a prospective multicenter study (ENDOMET) [J].
Antonsen, Sofie L. ;
Hogdall, Estrid ;
Christensen, Ib J. ;
Lydolph, Magnus ;
Tabor, Ann ;
Jakobsen, Annika Loft ;
Fago-Olsen, Carsten L. ;
Andersen, Erik S. ;
Jochumsen, Kirsten ;
Hogdall, Claus .
ACTA OBSTETRICIA ET GYNECOLOGICA SCANDINAVICA, 2013, 92 (11) :1313-1322
[3]   A NEW TEST FOR 2X2 TABLES [J].
BARNARD, GA .
NATURE, 1945, 156 (3954) :177-177
[4]   MicroRNAs: Genomics, biogenesis, mechanism, and function (Reprinted from Cell, vol 116, pg 281-297, 2004) [J].
Bartel, David P. .
CELL, 2007, 131 (04) :11-29
[5]   MicroRNAs and their target messenger RNAs associated with endometrial carcinogenesis [J].
Boren, Todd ;
Xiong, Yin ;
Hakam, Ardeshir ;
Wenham, Robert ;
Apte, Sachin ;
Wei, ZhengZheng ;
Kamath, Siddharth ;
Chen, Dung-Tsa ;
Dressman, Holly ;
Lancaster, Johnathan M. .
GYNECOLOGIC ONCOLOGY, 2008, 110 (02) :206-215
[6]  
Bradley P. S., 1998, Machine Learning. Proceedings of the Fifteenth International Conference (ICML'98), P82
[7]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[8]  
CREASMAN WT, 1987, CANCER, V60, P2035, DOI 10.1002/1097-0142(19901015)60:8+<2035::AID-CNCR2820601515>3.0.CO
[9]  
2-8
[10]   Post-transcriptional gene silencing by siRNAs and miRNAs [J].
Filipowicz, W ;
Jaskiewicz, L ;
Kolb, FA ;
Pillai, RS .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2005, 15 (03) :331-341