Classification of individual lung cancer cell lines based on DNA methylation markers - Use of linear discriminant analysis and artificial neural networks

被引:42
作者
Marchevsky, AM
Tsou, JA
Laird-Offringa, IA
机构
[1] Univ So Calif, Norris Canc Ctr, Keck Sch Med, Dept Surg, Los Angeles, CA 90089 USA
[2] Univ So Calif, Norris Canc Ctr, Keck Sch Med, Dept Biochem & Mol Biol, Los Angeles, CA 90089 USA
[3] Cedars Sinai Med Ctr, Dept Pathol & Lab Med, Los Angeles, CA 90048 USA
关键词
D O I
10.1016/S1525-1578(10)60488-6
中图分类号
R36 [病理学];
学科分类号
100104 ;
摘要
The classification of small cell lung cancer (SCLC) and non-small cell lung cancer (NSCLC) can pose diagnostic problems due to inter-observer variability and other limitations of histopathology. There is an interest in developing classificatory models of lung neoplasms based on the analysis of multivariate molecular data with statistical methods and/or neural networks. DNA methylation levels at 20 loci were measured in 41 SCLC and 46 NSCLC cell lines with the quantitative real-time PCR method MethyLight. The data were analyzed with artificial neural networks (ANN) and linear discriminant analysis (LIDA) to classify the cell lines into SCLC or into NSCLC. Models used either data from all 20 loci, or from five significant DNA methylation loci that were selected by a step-wise back-propagation procedure (PTGS2, CALCA, MTHFR, ESR1, and CDKN2A). The data were sorted randomly by cell line into 10 different data sets, each with training and testing subsets composed of 71 and 16 of the cases, respectively. Ten ANN models were trained using the 10 data sets: five using 20 variables, and five using the five variables selected by step-wise back-propagation. The ANN models with 20 input variables correctly classified 100% of the cell lines, while the models with only five variables correctly classified 87 to 100% of cases. For comparison, 10 different LDA models were trained and tested using the same data sets with either the original data or with logarithmically transformed data. Again, half of the models used all 20 variables while the others used only the five significant variables. LDA models provided correct classifications in 62.5% to, 87.5% of cases. The classifications provided by all of the different models were compared with kappa. statistics, yielding kappa values ranging from 0.25 to 1.0. We conclude that ANN models based on DNA methylation profiles can objectively classify SCLC and NSCLC cells lines with substantial to perfect concordance, while IDA models based on DNA methylation profiles provide poor to substantial concordance. Our work supports the promise of ANN analysis of DNA methylation data as a powerful approach for the development of automated methods for lung cancer classification.
引用
收藏
页码:28 / 36
页数:9
相关论文
共 50 条
[1]  
AFIFI AA, 1999, COMPUTER AIDED MULTI, P243
[2]  
An CST, 1997, HEPATOLOGY, V26, P1224
[3]  
BARTELS PH, 1994, IMAGE ANAL PRIMER PA, P2
[4]   Aberrant patterns of DNA methylation, chromatin formation and gene expression in cancer [J].
Baylin, SB ;
Esteller, M ;
Rountree, MR ;
Bachman, KE ;
Schuebel, K ;
Herman, JG .
HUMAN MOLECULAR GENETICS, 2001, 10 (07) :687-692
[5]  
Bellotti M, 1997, MODERN PATHOL, V10, P1221
[6]   DNA methylation patterns and epigenetic memory [J].
Bird, A .
GENES & DEVELOPMENT, 2002, 16 (01) :6-21
[7]  
Cenci M, 2000, ANTICANCER RES, V20, P3887
[8]   Aberrant CpG-island methylation has non-random and tumour-type-specific patterns [J].
Costello, JF ;
Frühwald, MC ;
Smiraglia, DJ ;
Rush, LJ ;
Robertson, GP ;
Gao, X ;
Wright, FA ;
Feramisco, JD ;
Peltomäki, P ;
Lang, JC ;
Schuller, DE ;
Yu, L ;
Bloomfield, CD ;
Caligiuri, MA ;
Yates, A ;
Nishikawa, R ;
Huang, HJS ;
Petrelli, NJ ;
Zhang, XL ;
O'Dorisio, MS ;
Held, WA ;
Cavenee, WK ;
Plass, C .
NATURE GENETICS, 2000, 24 (02) :132-138
[9]   Methylation matters [J].
Costello, JF ;
Plass, C .
JOURNAL OF MEDICAL GENETICS, 2001, 38 (05) :285-303
[10]   MethyLight: a high-throughput assay to measure DNA methylation [J].
Eads, Cindy A. ;
Danenberg, Kathleen D. ;
Kawakami, Kazuyuki ;
Saltz, Leonard B. ;
Blake, Corey ;
Shibata, Darryl ;
Danenberg, Peter V. ;
Laird, Peter W. .
NUCLEIC ACIDS RESEARCH, 2000, 28 (08) :32