RNA-Seq Accurately Identifies Cancer Biomarker Signatures to Distinguish Tissue of Origin

被引:32
作者
Wei, Iris H. [1 ]
Shi, Yang [2 ]
Jiang, Hui [2 ]
Kumar-Sinha, Chandan [3 ,4 ]
Chinnaiyan, Arul M. [3 ,4 ,5 ,6 ,7 ]
机构
[1] Univ Michigan, Dept Surg, Sch Med, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Dept Biostat, Sch Med, Ann Arbor, MI 48109 USA
[3] Univ Michigan, Michigan Ctr Translat Pathol, Sch Med, Ann Arbor, MI 48109 USA
[4] Univ Michigan, Dept Pathol, Sch Med, Ann Arbor, MI 48109 USA
[5] Univ Michigan, Ctr Comprehens Canc, Sch Med, Ann Arbor, MI 48109 USA
[6] Univ Michigan, Dept Urol, Sch Med, Ann Arbor, MI 48109 USA
[7] Univ Michigan, Howard Hughes Med Inst, Sch Med, Ann Arbor, MI 48109 USA
来源
NEOPLASIA | 2014年 / 16卷 / 11期
关键词
UNKNOWN PRIMARY SITE; HUMAN PROSTATE; PANCREATIC-CANCER; MOLECULAR CLASSIFICATION; EXPRESSION PROFILES; ASPARTIC PROTEINASE; METASTATIC CANCER; TUMOR-SUPPRESSOR; GENE; CARCINOMA;
D O I
10.1016/j.neo.2014.09.007
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Metastatic cancer of unknown primary (CUP) accounts for up to 5% of all new cancer cases, with a 5-year survival rate of only 10%. Accurate identification of tissue of origin would allow for directed, personalized therapies to improve clinical outcomes. Our objective was to use transcriptome sequencing (RNA-Seq) to identify lineage-specific biomarker signatures for the cancer types that most commonly metastasize as CUP (colorectum, kidney, liver, lung, ovary, pancreas, prostate, and stomach). RNA-Seq data of 17,471 transcripts from a total of 3,244 cancer samples across 26 different tissue types were compiled from in-house sequencing data and publically available International Cancer Genome Consortium and The Cancer Genome Atlas datasets. Robust cancer biomarker signatures were extracted using a 10-fold cross-validation method of log transformation, quantile normalization, transcript ranking by area under the receiver operating characteristic curve, and stepwise logistic regression. The entire algorithm was then repeated with a new set of randomly generated training and test sets, yielding highly concordant biomarker signatures. External validation of the cancer-specific signatures yielded high sensitivity (92.0% +/- 3.15%; mean +/- standard deviation) and specificity (97.7% +/- 2.99%) for each cancer biomarker signature. The overall performance of this RNA-Seq biomarker-generating algorithm yielded an accuracy of 90.5%. In conclusion, we demonstrate a computational model for producing highly sensitive and specific cancer biomarker signatures from RNA-Seq data, generating signatures for the top eight cancer types responsible for CUP to accurately identify tumor origin.
引用
收藏
页码:918 / 927
页数:10
相关论文
共 96 条
  • [1] ANALYSIS OF A DIAGNOSTIC STRATEGY FOR PATIENTS WITH SUSPECTED TUMORS OF UNKNOWN ORIGIN
    ABBRUZZESE, JL
    ABBRUZZESE, MC
    LENZI, R
    HESS, KR
    RABER, MN
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 1995, 13 (08) : 2094 - 2103
  • [2] NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION
    AKAIKE, H
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) : 716 - 723
  • [3] MOST HUMAN CARCINOMAS OF THE EXOCRINE PANCREAS CONTAIN MUTANT C-K-RAS GENES
    ALMOGUERA, C
    SHIBATA, D
    FORRESTER, K
    MARTIN, J
    ARNHEIM, N
    PERUCHO, M
    [J]. CELL, 1988, 53 (04) : 549 - 554
  • [4] [Anonymous], 1989, GEN LINEAR MODEL
  • [5] [Anonymous], 2020, NAT COMPR CAR NETW C
  • [6] ATKIN NB, 1982, LANCET, V2, P1349
  • [7] Roles for Nkx3.1 in prostate development and cancer
    Bhatia-Gaur, R
    Donjacour, AA
    Sciavolino, PJ
    Kim, M
    Desai, N
    Young, P
    Norton, CR
    Gridley, T
    Cardiff, RD
    Cunha, GR
    Abate-Shen, C
    Shen, MM
    [J]. GENES & DEVELOPMENT, 1999, 13 (08) : 966 - 977
  • [8] Cancer of unknown primary:: clinicopathologic correlations
    Blaszyk, H
    Hartmann, A
    Björnsson, J
    [J]. APMIS, 2003, 111 (12) : 1089 - 1094
  • [9] Multi-platform, multi-site, microarray-based human tumor classification
    Bloom, G
    Yang, IV
    Boulware, D
    Kwong, KY
    Coppola, D
    Eschrich, S
    Quackenbush, J
    Yeatman, TJ
    [J]. AMERICAN JOURNAL OF PATHOLOGY, 2004, 164 (01) : 9 - 16
  • [10] Epidermal growth factor-induced hepatocellular carcinoma: gene expression profiles in precursor lesions, early stage and solitary tumours
    Borlak, J
    Meier, T
    Halter, R
    Spanel, R
    Spanel-Borowski, K
    [J]. ONCOGENE, 2005, 24 (11) : 1809 - 1819