Transcript-targeted analysis reveals isoform alterations and double-hop fusions in breast cancer

被引:15
|
作者
Namba, Shinichi [1 ,8 ]
Ueno, Toshihide [1 ]
Kojima, Shinya [1 ]
Kobayashi, Kenya [2 ]
Kawase, Katsushige [9 ]
Tanaka, Yosuke [1 ]
Inoue, Satoshi [1 ]
Kishigami, Fumishi [1 ]
Kawashima, Shusuke [9 ]
Maeda, Noriko [3 ]
Ogawa, Tomoko [4 ]
Hazama, Shoichi [5 ]
Togashi, Yosuke [9 ]
Ando, Mizuo [6 ]
Shiraishi, Yuichi [7 ]
Mano, Hiroyuki [1 ]
Kawazu, Masahito [1 ,9 ]
机构
[1] Natl Canc Ctr, Div Cellular Signaling, Tokyo 1040045, Japan
[2] Natl Canc Ctr, Dept Head & Neck Oncol, Tokyo 1040045, Japan
[3] Yamaguchi Univ, Dept Gastroenterol Breast & Endocrine Surg, Grad Sch Med, Yamaguchi 7558505, Japan
[4] Mie Univ Hosp, Dept Breast Surg, Tsu, Mie 5148507, Japan
[5] Yamaguchi Univ, Dept Translat Res & Dev Therapeut Canc, Grad Sch Med, Yamaguchi 7558505, Japan
[6] Univ Tokyo Hosp, Dept Otolaryngol Head & Neck Surg, Tokyo 1138654, Japan
[7] Natl Canc Ctr, Div Genome Anal Platform Dev, Tokyo 1040045, Japan
[8] Osaka Univ, Dept Stat Genet, Grad Sch Med, Suita, Osaka 5650871, Japan
[9] Chiba Canc Ctr, Res Inst, Div Cell Therapy, Chiba 2608717, Japan
基金
日本学术振兴会;
关键词
EML4-ALK FUSION; GENE; RNA; IDENTIFICATION; QUANTIFICATION; PRLZ;
D O I
10.1038/s42003-021-02833-4
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Namba et al develop a new pipeline called MuSTA to enable the efficient assembly of transcriptome from long-read sequencing data of breast cancer samples. This method enables the authors to discover subtype-specific isoforms, find that fusion transcript structures depend on their genomic context and identify a double-hop fusion that results in aberrant expression of an endogenous retroviral gene. Although transcriptome alteration is an essential driver of carcinogenesis, the effects of chromosomal structural alterations on the cancer transcriptome are not yet fully understood. Short-read transcript sequencing has prevented researchers from directly exploring full-length transcripts, forcing them to focus on individual splice sites. Here, we develop a pipeline for Multi-Sample long-read Transcriptome Assembly (MuSTA), which enables construction of a transcriptome from long-read sequence data. Using the constructed transcriptome as a reference, we analyze RNA extracted from 22 clinical breast cancer specimens. We identify a comprehensive set of subtype-specific and differentially used isoforms, which extended our knowledge of isoform regulation to unannotated isoforms including a short form TNS3. We also find that the exon-intron structure of fusion transcripts depends on their genomic context, and we identify double-hop fusion transcripts that are transcribed from complex structural rearrangements. For example, a double-hop fusion results in aberrant expression of an endogenous retroviral gene, ERVFRD-1, which is normally expressed exclusively in placenta and is thought to protect fetus from maternal rejection; expression is elevated in several TCGA samples with ERVFRD-1 fusions. Our analyses provide direct evidence that full-length transcript sequencing of clinical samples can add to our understanding of cancer biology and genomics in general.
引用
收藏
页数:16
相关论文
共 1 条
  • [1] Single-Cell NGS-Based Analysis of Copy Number Alterations Reveals New Insights in Circulating Tumor Cells Persistence in Early-Stage Breast Cancer
    Rossi, Tania
    Gallerani, Giulia
    Angeli, Davide
    Cocchi, Claudia
    Bandini, Erika
    Fici, Pietro
    Gaudio, Michele
    Martinelli, Giovanni
    Rocca, Andrea
    Maltoni, Roberta
    Fabbri, Francesco
    CANCERS, 2020, 12 (09) : 1 - 14