SOAPfuse: an algorithm for identifying fusion transcripts from paired-end RNA-Seq data

被引:136
作者
Jia, Wenlong [1 ,2 ]
Qiu, Kunlong [1 ,2 ]
He, Minghui [1 ,2 ]
Song, Pengfei [2 ]
Zhou, Quan [1 ,2 ,3 ]
Zhou, Feng [2 ,4 ]
Yu, Yuan [2 ]
Zhu, Dandan [2 ]
Nickerson, Michael L. [5 ]
Wan, Shengqing [1 ,2 ]
Liao, Xiangke [6 ]
Zhu, Xiaoqian [6 ,7 ]
Peng, Shaoliang [6 ,7 ]
Li, Yingrui [1 ,2 ]
Wang, Jun [1 ,2 ,8 ,9 ]
Guo, Guangwu [1 ,2 ]
机构
[1] BGI Tech Solut Co Ltd, Shenzhen 518083, Peoples R China
[2] BGI Shenzhen, Shenzhen 518083, Peoples R China
[3] Univ Elect Sci & Technol China, Sch Life Sci & Technol, Chengdu 610054, Peoples R China
[4] S China Univ Technol, Guangzhou Higher Educ Mega Ctr, Sch Biosci & Bioengn, Guangzhou 510006, Guangdong, Peoples R China
[5] NCI, Canc & Inflammat Program, NIH, Frederick, MD 21702 USA
[6] Natl Univ Def Technol, Sch Comp Sci, Changsha 410073, Hunan, Peoples R China
[7] Natl Univ Def Technol, State Key Lab High Performance Comp, Changsha 410073, Hunan, Peoples R China
[8] Univ Copenhagen, Novo Nordisk Fdn Ctr Basic Metab Res, DK-1165 Copenhagen, Denmark
[9] Univ Copenhagen, Dept Biol, DK-1165 Copenhagen, Denmark
基金
国家高技术研究发展计划(863计划);
关键词
GENE FUSIONS; BREAST-CANCER; IDENTIFICATION; ULTRAFAST; DISCOVERY; ALIGNMENT; TOOL;
D O I
10.1186/gb-2013-14-2-r12
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
We have developed a new method, SOAPfuse, to identify fusion transcripts from paired-end RNA-Seq data. SOAPfuse applies an improved partial exhaustion algorithm to construct a library of fusion junction sequences, which can be used to efficiently identify fusion events, and employs a series of filters to nominate high-confidence fusion transcripts. Compared with other released tools, SOAPfuse achieves higher detection efficiency and consumed less computing resources. We applied SOAPfuse to RNA-Seq data from two bladder cancer cell lines, and confirmed 15 fusion transcripts, including several novel events common to both cell lines. SOAPfuse is available at http://soap.genomics.org.cn/soapfuse.html.
引用
收藏
页数:15
相关论文
共 37 条
[1]   A novel bioinformatics pipeline for identification and characterization of fusion transcripts in breast cancer and normal cell lines [J].
Asmann, Yan W. ;
Hossain, Asif ;
Necela, Brian M. ;
Middha, Sumit ;
Kalari, Krishna R. ;
Sun, Zhifu ;
Chai, High-Seng ;
Williamson, David W. ;
Radisky, Derek ;
Schroth, Gary P. ;
Kocher, Jean-Pierre A. ;
Perez, Edith A. ;
Thompson, E. Aubrey .
NUCLEIC ACIDS RESEARCH, 2011, 39 (15) :e100
[2]   Genomic sequencing of colorectal adenocarcinomas identifies a recurrent VTI1A-TCF7L2 fusion [J].
Bass, Adam J. ;
Lawrence, Michael S. ;
Brace, Lear E. ;
Ramos, Alex H. ;
Drier, Yotam ;
Cibulskis, Kristian ;
Sougnez, Carrie ;
Voet, Douglas ;
Saksena, Gordon ;
Sivachenko, Andrey ;
Jing, Rui ;
Parkin, Melissa ;
Pugh, Trevor ;
Verhaak, Roel G. ;
Stransky, Nicolas ;
Boutin, Adam T. ;
Barretina, Jordi ;
Solit, David B. ;
Vakiani, Evi ;
Shao, Wenlin ;
Mishina, Yuji ;
Warmuth, Markus ;
Jimenez, Jose ;
Chiang, Derek Y. ;
Signoretti, Sabina ;
Kaelin, William G., Jr. ;
Spardy, Nicole ;
Hahn, William C. ;
Hoshida, Yujin ;
Ogino, Shuji ;
DePinho, Ronald A. ;
Chin, Lynda ;
Garraway, Levi A. ;
Fuchs, Charles S. ;
Baselga, Jose ;
Tabernero, Josep ;
Gabriel, Stacey ;
Lander, Eric S. ;
Getz, Gad ;
Meyerson, Matthew .
NATURE GENETICS, 2011, 43 (10) :964-U67
[3]   Integrative analysis of the melanoma transcriptome [J].
Berger, Michael F. ;
Levin, Joshua Z. ;
Vijayendran, Krishna ;
Sivachenko, Andrey ;
Adiconis, Xian ;
Maguire, Jared ;
Johnson, Laura A. ;
Robinson, James ;
Verhaak, Roel G. ;
Sougnez, Carrie ;
Onofrio, Robert C. ;
Ziaugra, Liuda ;
Cibulskis, Kristian ;
Laine, Elisabeth ;
Barretina, Jordi ;
Winckler, Wendy ;
Fisher, David E. ;
Getz, Gad ;
Meyerson, Matthew ;
Jaffe, David B. ;
Gabriel, Stacey B. ;
Lander, Eric S. ;
Dummer, Reinhard ;
Gnirke, Andreas ;
Nusbaum, Chad ;
Garraway, Levi A. .
GENOME RESEARCH, 2010, 20 (04) :413-427
[4]   Identification of fusion genes in breast cancer by paired-end RNA-sequencing [J].
Edgren, Henrik ;
Murumagi, Astrid ;
Kangaspeska, Sara ;
Nicorici, Daniel ;
Hongisto, Vesa ;
Kleivi, Kristine ;
Rye, Inga H. ;
Nyberg, Sandra ;
Wolf, Maija ;
Borresen-Dale, Anne-Lise ;
Kallioniemi, Olli .
GENOME BIOLOGY, 2011, 12 (01)
[5]   Genome-wide mapping of alternative splicing in Arabidopsis thaliana [J].
Filichkin, Sergei A. ;
Priest, Henry D. ;
Givan, Scott A. ;
Shen, Rongkun ;
Bryant, Douglas W. ;
Fox, Samuel E. ;
Wong, Weng-Keen ;
Mockler, Todd C. .
GENOME RESEARCH, 2010, 20 (01) :45-58
[6]   Ensembl 2011 [J].
Flicek, Paul ;
Amode, M. Ridwan ;
Barrell, Daniel ;
Beal, Kathryn ;
Brent, Simon ;
Chen, Yuan ;
Clapham, Peter ;
Coates, Guy ;
Fairley, Susan ;
Fitzgerald, Stephen ;
Gordon, Leo ;
Hendrix, Maurice ;
Hourlier, Thibaut ;
Johnson, Nathan ;
Kaehaeri, Andreas ;
Keefe, Damian ;
Keenan, Stephen ;
Kinsella, Rhoda ;
Kokocinski, Felix ;
Kulesha, Eugene ;
Larsson, Pontus ;
Longden, Ian ;
McLaren, William ;
Overduin, Bert ;
Pritchard, Bethan ;
Riat, Harpreet Singh ;
Rios, Daniel ;
Ritchie, Graham R. S. ;
Ruffier, Magali ;
Schuster, Michael ;
Sobral, Daniel ;
Spudich, Giulietta ;
Tang, Y. Amy ;
Trevanion, Stephen ;
Vandrovcova, Jana ;
Vilella, Albert J. ;
White, Simon ;
Wilder, Steven P. ;
Zadissa, Amonida ;
Zamora, Jorge ;
Aken, Bronwen L. ;
Birney, Ewan ;
Cunningham, Fiona ;
Dunham, Ian ;
Durbin, Richard ;
Fernandez-Suarez, Xose M. ;
Herrero, Javier ;
Hubbard, Tim J. P. ;
Parker, Anne ;
Proctor, Glenn .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D800-D806
[7]   Chromosomal abnormalities in cancer [J].
Froehling, Stefan ;
Doehner, Hartmut .
NEW ENGLAND JOURNAL OF MEDICINE, 2008, 359 (07) :722-734
[8]   Differential DNA methylation in discrete developmental stages of the parasitic nematode Trichinella spiralis [J].
Gao, Fei ;
Liu, Xiaolei ;
Wu, Xiu-Ping ;
Wang, Xue-Lin ;
Gong, Desheng ;
Lu, Hanlin ;
Xia, Yudong ;
Song, Yanxia ;
Wang, Junwen ;
Du, Jing ;
Liu, Siyang ;
Han, Xu ;
Tang, Yizhi ;
Yang, Huanming ;
Jin, Qi ;
Zhang, Xiuqing ;
Liu, Mingyuan .
GENOME BIOLOGY, 2012, 13 (10) :R100
[9]   FusionMap: detecting fusion genes from next-generation sequencing data at base-pair resolution [J].
Ge, Huanying ;
Liu, Kejun ;
Juan, Todd ;
Fang, Fang ;
Newman, Matthew ;
Hoeck, Wolfgang .
BIOINFORMATICS, 2011, 27 (14) :1922-1928
[10]   Massively parallel sequencing of the polyadenylated transcriptome of C. elegans [J].
Hillier, LaDeana W. ;
Reinke, Valerie ;
Green, Philip ;
Hirst, Martin ;
Marra, Marco A. ;
Waterston, Robert H. .
GENOME RESEARCH, 2009, 19 (04) :657-666