Integrative Analysis of High-throughput Cancer Studies With Contrasted Penalization

被引:22
|
作者
Shi, Xingjie [1 ,2 ]
Liu, Jin [3 ]
Huang, Jian [4 ,5 ]
Zhou, Yong [2 ]
Shia, BenChang [6 ]
Ma, Shuangge [1 ,7 ]
机构
[1] Yale Univ, Sch Publ Hlth, Dept Biostat, New Haven, CT USA
[2] Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai, Peoples R China
[3] UIC Sch Publ Hlth, Div Epidemiol & Biostat, Chicago, IL USA
[4] Univ Iowa, Dept Stat & Actuarial Sci, Iowa City, IA 52242 USA
[5] Univ Iowa, Dept Biostat, Iowa City, IA USA
[6] FuJen Catholic Univ, Dept Stat & Informat Sci, New Taipei City, Taiwan
[7] VA Cooperat Studies Program Coordinating Ctr, West Haven, CT USA
关键词
integrative analysis; contrasted penalization; marker selection; high-throughput cancer studies; PROGNOSIS; IDENTIFICATION; CONVERGENCE; MARKERS;
D O I
10.1002/gepi.21781
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
In cancer studies with high-throughput genetic and genomic measurements, integrative analysis provides a way to effectively pool and analyze heterogeneous raw data from multiple independent studies and outperforms classic meta-analysis and single-dataset analysis. When marker selection is of interest, the genetic basis of multiple datasets can be described using the homogeneity model or the heterogeneity model. In this study, we consider marker selection under the heterogeneity model, which includes the homogeneity model as a special case and can be more flexible. Penalization methods have been developed in the literature for marker selection. This study advances from the published ones by introducing the contrast penalties, which can accommodate the within- and across-dataset structures of covariates/regression coefficients and, by doing so, further improve marker selection performance. Specifically, we develop a penalization method that accommodates the across-dataset structures by smoothing over regression coefficients. An effective iterative algorithm, which calls an inner coordinate descent iteration, is developed. Simulation shows that the proposed method outperforms the benchmark with more accurate marker identification. The analysis of breast cancer and lung cancer prognosis studies with gene expression measurements shows that the proposed method identifies genes different from those using the benchmark and has better prediction performance.
引用
收藏
页码:144 / 151
页数:8
相关论文
共 50 条
  • [1] Integrative Analysis of Cancer Diagnosis Studies with Composite Penalization
    Liu, Jin
    Ma, Shuangge
    Huang, Jian
    SCANDINAVIAN JOURNAL OF STATISTICS, 2014, 41 (01) : 87 - 103
  • [2] Integrative approaches for analysis of mRNA and microRNA high-throughput data
    V. Nazarov, Petr
    Kreis, Stephanie
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 1154 - 1162
  • [3] Promoting Similarity of Sparsity Structures in Integrative Analysis With Penalization
    Huang, Yuan
    Zhang, Qingzhao
    Zhang, Sanguo
    Huang, Jian
    Ma, Shuangge
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (517) : 342 - 350
  • [4] High-throughput tetrad analysis
    Ludlow, Catherine L.
    Scott, Adrian C.
    Cromie, Gareth A.
    Jeffery, Eric W.
    Sirr, Amy
    May, Patrick
    Lin, Jake
    Gilbert, Teresa L.
    Hays, Michelle
    Dudley, Aimee M.
    NATURE METHODS, 2013, 10 (07) : 671 - +
  • [5] High-Throughput Analysis of Ovarian Granulosa Cell Transcriptome
    Chronowska, Ewa
    BIOMED RESEARCH INTERNATIONAL, 2014, 2014
  • [6] High-throughput approaches for plant epigenomic studies
    Schmitz, Robert J.
    Zhang, Xiaoyu
    CURRENT OPINION IN PLANT BIOLOGY, 2011, 14 (02) : 130 - 136
  • [7] Integrative epigenomic and high-throughput functional enhancer profiling reveals determinants of enhancer heterogeneity in gastric cancer
    Sheng, Taotao
    Ho, Shamaine Wei Ting
    Ooi, Wen Fong
    Xu, Chang
    Xing, Manjie
    Padmanabhan, Nisha
    Huang, Kie Kyon
    Ma, Lijia
    Ray, Mohana
    Guo, Yu Amanda
    Sim, Ngak Leng
    Anene-Nzelu, Chukwuemeka George
    Chang, Mei Mei
    Razavi-Mohseni, Milad
    Beer, Michael A.
    Foo, Roger Sik Yin
    Sundar, Raghav
    Chan, Yiong Huak
    Tan, Angie Lay Keng
    Ong, Xuewen
    Skanderup, Anders Jacobsen
    White, Kevin P.
    Jha, Sudhakar
    Tan, Patrick
    GENOME MEDICINE, 2021, 13 (01)
  • [8] A comparative analysis of tissue gene expression data from high-throughput studies
    Ping Jie
    Wang YaJun
    Yu Yao
    Li YiXue
    Li Xuan
    Hao Pei
    CHINESE SCIENCE BULLETIN, 2012, 57 (22): : 2920 - 2927
  • [9] Integrative bioinformatics analysis of high-throughput sequencing and in vitro functional analysis leads to uncovering key hub genes in esophageal squamous cell carcinoma
    Shen, Feng
    Liu, Xing
    Ding, Fengjiao
    Yu, Zhonglin
    Shi, Xinyi
    Cheng, Lushan
    Zhang, Xuewei
    Jing, Chengbao
    Zhao, Zilong
    Cao, Hongyou
    Zhao, Bing
    Liu, Jing
    HEREDITAS, 2025, 162 (01):
  • [10] JUMP: replicability analysis of high-throughput experiments with applications to spatial transcriptomic studies
    Lyu, Pengfei
    Li, Yan
    Wen, Xiaoquan
    Cao, Hongyuan
    BIOINFORMATICS, 2023, 39 (06)