Microarray-based prediction of Parkinson's disease using clinical data as additional response variables

被引:2
作者
Karlsson, Magdalena Kauczynska [1 ,2 ]
Lonneborg, Anders [1 ]
Saebo, Solve [2 ]
机构
[1] DiaGenic ASA, N-0663 Oslo, Norway
[2] Norwegian Univ Life Sci, Dept Chem Biotechnol & Food Sci, N-1432 As, Norway
关键词
microarray data; PLS regression; additional responses; Parkinson's disease; prediction; PARTIAL LEAST-SQUARES; GENE-EXPRESSION; CANCER; PLS; REPRODUCIBILITY; REGRESSION; DIAGNOSIS;
D O I
10.1002/sim.5588
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent discoveries and developments in the field of genomics have led to the commercialization of novel diagnostic devices for studying disease or estimating therapeutic outcomes in individual patients. With this emerging field, the emphasis is shifting to integration of clinical research into product development. Data acquisition is primary in the initial exploratory phase of product development, and during the process of sample collection and data generation in clinical microarray studies, great amounts of additional information, such as demographic, clinical, and study design variables associated with the data, are often accumulated and made available. Including additional information in classification has been addressed in many different ways. However, in previous studies, the additional information have consistently been treated as extra predictors, which can be a problem for future prediction if such information are not available or collectable for the new samples. We instead propose to adopt a method called canonical partial least squares, which for our purpose, only uses the additional information at the model building stage to stabilize the construction of a classifier for disease status from microarray data. The canonical partial least squares method is compared with regular partial least squares for the classification of Parkinson's disease from gene expression in peripheral blood samples and also through computer simulations. The present study showed that including clinical data in the model building produces simpler and more stable models for prediction of Parkinson's disease from gene expression data. Copyright (c) 2012 John Wiley & Sons, Ltd.
引用
收藏
页码:4369 / 4381
页数:13
相关论文
共 31 条
[21]  
2-Z
[22]   Canonical partial least squares-a unified PLS approach to classification and regression problems [J].
Indahl, Ulf G. ;
Liland, Kristian Hovde ;
Naes, Tormod .
JOURNAL OF CHEMOMETRICS, 2009, 23 (9-10) :495-504
[23]   A comparison of methods for analysing regression models with both spectral and designed variables [J].
Jorgensen, K ;
Segtnan, V ;
Thyholt, K ;
Næs, T .
JOURNAL OF CHEMOMETRICS, 2004, 18 (10) :451-464
[24]   How high is the level of technical noise in microarray data? [J].
Klebanov, Lev ;
Yakovlev, Andrei .
BIOLOGY DIRECT, 2007, 2 (1)
[25]   ST-PLS:: a multi-directional nearest shrunken centroid type classi er via PLS [J].
Saebo, S. ;
Almoy, T. ;
Aaroe, J. ;
Aastveit, A. H. .
JOURNAL OF CHEMOMETRICS, 2008, 22 (1-2) :54-62
[26]   The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements [J].
Shi, Leming ;
Reid, Laura H. ;
Jones, Wendell D. ;
Shippy, Richard ;
Warrington, Janet A. ;
Baker, Shawn C. ;
Collins, Patrick J. ;
de Longueville, Francoise ;
Kawasaki, Ernest S. ;
Lee, Kathleen Y. ;
Luo, Yuling ;
Sun, Yongming Andrew ;
Willey, James C. ;
Setterquist, Robert A. ;
Fischer, Gavin M. ;
Tong, Weida ;
Dragan, Yvonne P. ;
Dix, David J. ;
Frueh, Felix W. ;
Goodsaid, Federico M. ;
Herman, Damir ;
Jensen, Roderick V. ;
Johnson, Charles D. ;
Lobenhofer, Edward K. ;
Puri, Raj K. ;
Scherf, Uwe ;
Thierry-Mieg, Jean ;
Wang, Charles ;
Wilson, Mike ;
Wolber, Paul K. ;
Zhang, Lu ;
Amur, Shashi ;
Bao, Wenjun ;
Barbacioru, Catalin C. ;
Lucas, Anne Bergstrom ;
Bertholet, Vincent ;
Boysen, Cecilie ;
Bromley, Bud ;
Brown, Donna ;
Brunner, Alan ;
Canales, Roger ;
Cao, Xiaoxi Megan ;
Cebula, Thomas A. ;
Chen, James J. ;
Cheng, Jing ;
Chu, Tzu-Ming ;
Chudin, Eugene ;
Corson, John ;
Corton, J. Christopher ;
Croner, Lisa J. .
NATURE BIOTECHNOLOGY, 2006, 24 (09) :1151-1161
[27]   Diagnosis of multiple cancer types by shrunken centroids of gene expression [J].
Tibshirani, R ;
Hastie, T ;
Narasimhan, B ;
Chu, G .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) :6567-6572
[28]   Incidence of Parkinson's disease: Variation by age, gender, and Race/Ethnicity [J].
Van Den Eeden, SK ;
Tanner, CM ;
Bernstein, AL ;
Fross, RD ;
Leimpeter, A ;
Bloch, DA ;
Nelson, LM .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2003, 157 (11) :1015-1022
[29]  
Wen Z, 2011, MICROARRAY QUALITY C
[30]   A randomization test for PLS component selection [J].
Wiklund, Susanne ;
Nilsson, David ;
Eriksson, Lennart ;
Sjostrom, Michael ;
Wold, Svante ;
Faber, Klaas .
JOURNAL OF CHEMOMETRICS, 2007, 21 (10-11) :427-439