Integrative Analysis of High-throughput Cancer Studies With Contrasted Penalization

被引:23
作者
Shi, Xingjie [1 ,2 ]
Liu, Jin [3 ]
Huang, Jian [4 ,5 ]
Zhou, Yong [2 ]
Shia, BenChang [6 ]
Ma, Shuangge [1 ,7 ]
机构
[1] Yale Univ, Sch Publ Hlth, Dept Biostat, New Haven, CT USA
[2] Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai, Peoples R China
[3] UIC Sch Publ Hlth, Div Epidemiol & Biostat, Chicago, IL USA
[4] Univ Iowa, Dept Stat & Actuarial Sci, Iowa City, IA 52242 USA
[5] Univ Iowa, Dept Biostat, Iowa City, IA USA
[6] FuJen Catholic Univ, Dept Stat & Informat Sci, New Taipei City, Taiwan
[7] VA Cooperat Studies Program Coordinating Ctr, West Haven, CT USA
关键词
integrative analysis; contrasted penalization; marker selection; high-throughput cancer studies; PROGNOSIS; IDENTIFICATION; CONVERGENCE; MARKERS;
D O I
10.1002/gepi.21781
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
In cancer studies with high-throughput genetic and genomic measurements, integrative analysis provides a way to effectively pool and analyze heterogeneous raw data from multiple independent studies and outperforms classic meta-analysis and single-dataset analysis. When marker selection is of interest, the genetic basis of multiple datasets can be described using the homogeneity model or the heterogeneity model. In this study, we consider marker selection under the heterogeneity model, which includes the homogeneity model as a special case and can be more flexible. Penalization methods have been developed in the literature for marker selection. This study advances from the published ones by introducing the contrast penalties, which can accommodate the within- and across-dataset structures of covariates/regression coefficients and, by doing so, further improve marker selection performance. Specifically, we develop a penalization method that accommodates the across-dataset structures by smoothing over regression coefficients. An effective iterative algorithm, which calls an inner coordinate descent iteration, is developed. Simulation shows that the proposed method outperforms the benchmark with more accurate marker identification. The analysis of breast cancer and lung cancer prognosis studies with gene expression measurements shows that the proposed method identifies genes different from those using the benchmark and has better prediction performance.
引用
收藏
页码:144 / 151
页数:8
相关论文
共 50 条
[41]   High-throughput assessment of the antibody profile in ovarian cancer ascitic fluids [J].
Antony, Frank ;
Deantonio, Cecilia ;
Cotella, Diego ;
Soluri, Maria Felicia ;
Tarasiuk, Olga ;
Raspagliesi, Francesco ;
Adorni, Fulvio ;
Piazza, Silvano ;
Ciani, Yari ;
Santoro, Claudio ;
Macor, Paolo ;
Mezzanzanica, Delia ;
Sblattero, Daniele .
ONCOIMMUNOLOGY, 2019, 8 (09)
[42]   The Pursuit of Therapeutic Biomarkers with High-Throughput Cancer Cell Drug Screens [J].
Williams, Steven P. ;
McDermott, Ultan .
CELL CHEMICAL BIOLOGY, 2017, 24 (09) :1066-1074
[43]   CRISPR/Cas9 high-throughput screening in cancer research [J].
Liu, Zhuoxin .
2020 INTERNATIONAL CONFERENCE ON ENERGY, ENVIRONMENT AND BIOENGINEERING (ICEEB 2020), 2020, 185
[44]   Combined flow cytometry and high-throughput image analysis for the study of essential genes in Caenorhabditis elegans [J].
Hernando-Rodriguez, Blanca ;
Paul Erinjeri, Annmary ;
Jesus Rodriguez-Palero, Maria ;
Millar, Val ;
Gonzalez-Hernandez, Sara ;
Olmedo, Maria ;
Schulze, Bettina ;
Baumeister, Ralf ;
Munoz, Manuel J. ;
Askjaer, Peter ;
Artal-Sanz, Marta .
BMC BIOLOGY, 2018, 16
[45]   Differential gene expression in disease: a comparison between high-throughput studies and the literature [J].
Rodriguez-Esteban, Raul ;
Jiang, Xiaoyu .
BMC MEDICAL GENOMICS, 2017, 10
[46]   High-throughput flow cytometry for drug discovery: principles, applications, and case studies [J].
Ding, Mei ;
Kaspersson, Karin ;
Murray, David ;
Bardelle, Catherine .
DRUG DISCOVERY TODAY, 2017, 22 (12) :1844-1850
[47]   A high-throughput ChIP-Seq for large-scale chromatin studies [J].
Chabbert, Christophe D. ;
Adjalley, Sophie H. ;
Klaus, Bernd ;
Fritsch, Emilie S. ;
Gupta, Ishaan ;
Pelechano, Vicent ;
Steinmetz, Lars M. .
MOLECULAR SYSTEMS BIOLOGY, 2015, 11 (01)
[48]   Integrative Analysis of Prognosis Data on Multiple Cancer Subtypes [J].
Liu, Jin ;
Huang, Jian ;
Zhang, Yawei ;
Lan, Qing ;
Rothman, Nathaniel ;
Zheng, Tongzhang ;
Ma, Shuangge .
BIOMETRICS, 2014, 70 (03) :480-488
[49]   miRanalyzer: an update on the detection and analysis of microRNAs in high-throughput sequencing experiments [J].
Hackenberg, Michael ;
Rodriguez-Ezpeleta, Naiara ;
Aransay, Ana M. .
NUCLEIC ACIDS RESEARCH, 2011, 39 :W132-W138
[50]   TOPPAS: A Graphical Workflow Editor for the Analysis of High-Throughput Proteomics Data [J].
Junker, Johannes ;
Bielow, Chris ;
Bertsch, Andreas ;
Sturm, Marc ;
Reinert, Knut ;
Kohlbachert, Oliver .
JOURNAL OF PROTEOME RESEARCH, 2012, 11 (07) :3914-3920