HDMAC: A Web-Based Interactive Program for High-Dimensional Analysis of Molecular Alterations in Cancer

被引:2
作者
Chang, Chung [1 ]
Sung, Chan-Yu [1 ]
Hsiao, Han [1 ]
Chen, Jiabin [2 ]
Chen, I-Hsuan [3 ,4 ,5 ]
Kuo, Wei-Ting [3 ]
Cheng, Lung-Feng [3 ]
Korla, Praveen Kumar [2 ]
Chung, Ming-Jhe [1 ]
Wu, Pei-Jhen [1 ]
Yu, Chia-Cheng [3 ,4 ,5 ,6 ]
Sheu, Jim Jinn-Chyuan [2 ,7 ,8 ,9 ]
机构
[1] Natl Sun Yat Sen Univ, Dept Appl Math, Kaohsiung, Taiwan
[2] Natl Sun Yat Sen Univ, Inst Biomed Sci, Kaohsiung, Taiwan
[3] Kaohsiung Vet Gen Hosp, Dept Surg, Div Transplant Surg Urol, Kaohsiung 81362, Taiwan
[4] Tajen Univ, Coll Pharm & Hlth Care, Dept Pharm, Yanpu Township 90741, Pingtung County, Taiwan
[5] Natl Yang Ming Univ, Sch Med, Taipei 112, Taiwan
[6] Natl Def Med Ctr, Triserv Gen Hosp, Dept Surg, Div Transplant Surg Urol, Taipei 114, Taiwan
[7] Kaohsiung Med Univ, Dept Biotechnol, Kaohsiung 80708, Taiwan
[8] China Med Univ, Sch Chinese Med, Taichung 40402, Taiwan
[9] Asia Univ, Dept Hlth & Nutr Biotechnol, Taichung 41354, Taiwan
关键词
PENALIZED LOGISTIC-REGRESSION; FALSE DISCOVERY RATE; ADAPTIVE LASSO; OVARIAN-CANCER; GENE-EXPRESSION; BLADDER; BREAST; ASSOCIATION; SELECTION; SLC14A1;
D O I
10.1038/s41598-020-60791-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent advances in high-throughput genomic technologies have nurtured a growing demand for statistical tools to facilitate identification of molecular changes as potential prognostic biomarkers or drugable targets for personalized precision medicine. In this study, we developed a web-based interactive and user-friendly platform for high-dimensional analysis of molecular alterations in cancer (HDMAC) (https://ripsung26.shinyapps.io/rshiny/). On HDMAC, several penalized regression models that are suitable for high-dimensional data analysis, Ridge, Lasso and adaptive Lasso, are offered, with Cox regression for survival and logistic regression for binary outcomes. Choice of a first-step screening is provided to address the multiple-comparison issue that often arises with large-volume genomic data. Hazard ratio or estimated coefficient is provided with each selected gene so that a multivariate regression model may be built based on the genes selected. Cross validation is provided as the method to estimate the prediction power of each regression model. In addition, R codes are also provided to facilitate download of whole sets of molecular variables from TCGA. In this study, illustration of the use of HDMAC was made through a set of data on gene mutations and a set on mRNA expression from ovarian cancer patients and a set on mRNA expression from bladder cancer patient. From the analysis of each set of data, a list of candidate genes was obtained that might be associated with mutations or abnormal expression of genes in ovarian and bladder cancers. HDMAC offers a solution for rigorous and validation analysis of high-dimensional genomic data.
引用
收藏
页数:10
相关论文
共 49 条
  • [1] Penalized logistic regression with the adaptive LASSO for gene selection in high-dimensional cancer classification
    Algamal, Zakariya Yahya
    Lee, Muhammad Hisyam
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (23) : 9326 - 9332
  • [2] [Anonymous], 2015, RStudio: Integrated development for R
  • [3] [Anonymous], 2010, R LANG ENV STAT COMP
  • [4] [Anonymous], 2019, ANN TRANSL MED
  • [5] [Anonymous], 2018, SHINY V1 2 0
  • [6] Bladder Cancer Incidence and Mortality: A Global Overview and Recent Trends
    Antoni, Sebastien
    Ferlay, Jacques
    Soerjomataram, Isabelle
    Znaor, Ariana
    Jemal, Ahmedin
    Bray, Freddie
    [J]. EUROPEAN UROLOGY, 2017, 71 (01) : 96 - 108
  • [7] The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans
    Ardlie, Kristin G.
    DeLuca, David S.
    Segre, Ayellet V.
    Sullivan, Timothy J.
    Young, Taylor R.
    Gelfand, Ellen T.
    Trowbridge, Casandra A.
    Maller, Julian B.
    Tukiainen, Taru
    Lek, Monkol
    Ward, Lucas D.
    Kheradpour, Pouya
    Iriarte, Benjamin
    Meng, Yan
    Palmer, Cameron D.
    Esko, Tonu
    Winckler, Wendy
    Hirschhorn, Joel N.
    Kellis, Manolis
    MacArthur, Daniel G.
    Getz, Gad
    Shabalin, Andrey A.
    Li, Gen
    Zhou, Yi-Hui
    Nobel, Andrew B.
    Rusyn, Ivan
    Wright, Fred A.
    Lappalainen, Tuuli
    Ferreira, Pedro G.
    Ongen, Halit
    Rivas, Manuel A.
    Battle, Alexis
    Mostafavi, Sara
    Monlong, Jean
    Sammeth, Michael
    Mele, Marta
    Reverter, Ferran
    Goldmann, Jakob M.
    Koller, Daphne
    Guigo, Roderic
    McCarthy, Mark I.
    Dermitzakis, Emmanouil T.
    Gamazon, Eric R.
    Im, Hae Kyung
    Konkashbaev, Anuar
    Nicolae, Dan L.
    Cox, Nancy J.
    Flutre, Timothee
    Wen, Xiaoquan
    Stephens, Matthew
    [J]. SCIENCE, 2015, 348 (6235) : 648 - 660
  • [8] Bladder cancer: ESMO Practice Guidelines for diagnosis, treatment and follow-up
    Bellmunt, J.
    Orsola, A.
    Leow, J. J.
    Wiegel, T.
    De Santis, M.
    Horwich, A.
    [J]. ANNALS OF ONCOLOGY, 2014, 25 : 40 - 48
  • [9] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
    BENJAMINI, Y
    HOCHBERG, Y
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300
  • [10] Bucholtz M, 2016, SOCIOLINGUISTIC RESEARCH: APPLICATION AND IMPACT, P25