GRIDSS2: comprehensive characterisation of somatic structural variation using single breakend variants and structural variant phasing

被引:99
作者
Cameron, Daniel L. [1 ,2 ,3 ]
Baber, Jonathan [3 ,4 ]
Shale, Charles [3 ,4 ]
Valle-Inclan, Jose Espejo [5 ,6 ]
Besselink, Nicolle [5 ,6 ]
van Hoeck, Arne [5 ,6 ]
Janssen, Roel [5 ,6 ]
Cuppen, Edwin [4 ,5 ,6 ]
Priestley, Peter [3 ,4 ]
Papenfuss, Anthony T. [1 ,2 ,7 ,8 ]
机构
[1] Walter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic, Australia
[2] Univ Melbourne, Dept Med Biol, Melbourne, Vic, Australia
[3] Hartwig Med Fdn Australia, Sydney, NSW, Australia
[4] Hartwig Med Fdn, Sci Pk 408, Amsterdam, Netherlands
[5] Univ Med Ctr Utrecht, Ctr Mol Med, Heidelberglaan 100, Utrecht, Netherlands
[6] Univ Med Ctr Utrecht, Oncode Inst, Heidelberglaan 100, Utrecht, Netherlands
[7] Peter MacCallum Canc Ctr, Melbourne, Vic, Australia
[8] Univ Melbourne, Sir Peter MacCallum Dept Oncol, Melbourne, Vic, Australia
基金
英国医学研究理事会; 澳大利亚国家健康与医学研究理事会;
关键词
PAIRED-END; CANCER; ALIGNMENT; GENOMES; FORMAT;
D O I
10.1186/s13059-021-02423-x
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
GRIDSS2 is the first structural variant caller to explicitly report single breakendsbreakpoints in which only one side can be unambiguously determined. By treating single breakends as a fundamental genomic rearrangement signal on par with breakpoints, GRIDSS2 can explain 47% of somatic centromere copy number changes using single breakends to non-centromere sequence. On a cohort of 3782 deeply sequenced metastatic cancers, GRIDSS2 achieves an unprecedented 3.1% false negative rate and 3.3% false discovery rate and identifies a novel 32-100 bp duplication signature. GRIDSS2 simplifies complex rearrangement interpretation through phasing of structural variants with 16% of somatic calls phasable using paired-end sequencing.
引用
收藏
页数:25
相关论文
共 42 条
[31]   Integrative genomics viewer [J].
Robinson, James T. ;
Thorvaldsdottir, Helga ;
Winckler, Wendy ;
Guttman, Mitchell ;
Lander, Eric S. ;
Getz, Gad ;
Mesirov, Jill P. .
NATURE BIOTECHNOLOGY, 2011, 29 (01) :24-26
[32]   Strelka: accurate somatic small-variant calling from sequenced tumor-normal sample pairs [J].
Saunders, Christopher T. ;
Wong, Wendy S. W. ;
Swamy, Sajani ;
Becq, Jennifer ;
Murray, Lisa J. ;
Cheetham, R. Keira .
BIOINFORMATICS, 2012, 28 (14) :1811-1817
[33]   Socrates: identification of genomic rearrangements in tumour genomes by re-aligning soft clipped reads [J].
Schroeder, Jan ;
Hsu, Arthur ;
Boyle, Samantha E. ;
Macintyre, Geoff ;
Cmero, Marek ;
Tothill, Richard W. ;
Johnstone, Ricky W. ;
Shackleton, Mark ;
Papenfuss, Anthony T. .
BIOINFORMATICS, 2014, 30 (08) :1064-1072
[34]  
Shale C., 2020, UNSCRAMBLING CANC GE UNSCRAMBLING CANC GE
[35]   An integrative probabilistic model for identification of structural variation in sequencing data [J].
Sindi, Suzanne S. ;
Oenal, Selim ;
Peng, Luke C. ;
Wu, Hsin-Ta ;
Raphael, Benjamin J. .
GENOME BIOLOGY, 2012, 13 (03)
[36]   Extensive transduction of nonrepetitive DNA mediated by L1 retrotransposition in cancer genomes [J].
Tubio, Jose M. C. ;
Li, Yilong ;
Ju, Young Seok ;
Martincorena, Inigo ;
Cooke, Susanna L. ;
Tojo, Marta ;
Gundem, Gunes ;
Pipinikas, Christodoulos P. ;
Zamora, Jorge ;
Raine, Keiran ;
Menzies, Andrew ;
Roman-Garcia, Pablo ;
Fullam, Anthony ;
Gerstung, Moritz ;
Shlien, Adam ;
Tarpey, Patrick S. ;
Papaemmanuil, Elli ;
Knappskog, Stian ;
Van Loo, Peter ;
Ramakrishna, Manasa ;
Davies, Helen R. ;
Marshall, John ;
Wedge, David C. ;
Teague, Jonw. ;
Butler, Adam P. ;
Nik-Zainal, Serena ;
Alexandrov, Ludmil ;
Behjati, Sam ;
Yates, Lucy R. ;
Bolli, Niccolo ;
Mudie, Laura ;
Hardy, Claire ;
Martin, Sancha ;
McLaren, Stuart ;
O'Meara, Sarah ;
Anderson, Elizabeth ;
Maddison, Mark ;
Gamble, Stephen ;
Foster, Christopher ;
Warren, Anne Y. ;
Whitaker, Hayley ;
Brewer, Daniel ;
Eeles, Rosalind ;
Cooper, Colin ;
Neal, David ;
Lynch, Andy G. ;
Visakorpi, Tapio ;
Isaacs, William B. ;
van't Veer, Laura ;
Caldas, Carlos .
SCIENCE, 2014, 345 (6196) :531-531
[37]  
Valle-Inclan J, WHOLE GENOME SEQUENC
[38]  
Valle-Inclan J. E., 2020, MULTIPLATFORM REFERE MULTIPLATFORM REFERE
[39]   SvABA: genome-wide detection of structural variants and indels by local assembly [J].
Wala, Jeremiah A. ;
Bandopadhayay, Pratiti ;
Greenwald, Noah F. ;
O'Rourke, Ryan ;
Sharpe, Ted ;
Stewart, Chip ;
Schumacher, Steve ;
Li, Yilong ;
Weischenfeldt, Joachim ;
Yao, Xiaotong ;
Nusbaum, Chad ;
Campbell, Peter ;
Getz, Gad ;
Meyerson, Matthew ;
Zhang, Cheng-Zhong ;
Imielinski, Marcin ;
Beroukhim, Rameen .
GENOME RESEARCH, 2018, 28 (04) :581-591
[40]  
Wang JM, 2011, NAT METHODS, V8, P652, DOI [10.1038/NMETH.1628, 10.1038/nmeth.1628]