Enabling population assignment from cancer genomes with SNP2pop

被引:3
作者
Huang, Qingyao [1 ,2 ]
Baudis, Michael [1 ,2 ]
机构
[1] Univ Zurich, Inst Mol Life Sci, Winterthurerstr 190, CH-8057 Zurich, Switzerland
[2] Swiss Inst Bioinformat, Winterthurerstr 190, CH-8057 Zurich, Switzerland
关键词
BREAST-CANCER; SUSCEPTIBILITY; ASSOCIATION; RISK; PREDISPOSITION; VARIANTS; GENES; LOCI;
D O I
10.1038/s41598-020-61854-x
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In many cancers, incidence, treatment efficacy and overall prognosis vary between geographic populations. Studies disentangling the contributing factors may help in both understanding cancer biology and tailoring therapeutic interventions. Ancestry estimation in such studies should preferably be driven by genomic data, due to frequently missing or erroneous self-reported or inferred metadata. While respective algorithms have been demonstrated for baseline genomes, such a strategy has not been shown for cancer genomes carrying a substantial somatic mutation load. We have developed a bioinformatics tool for the assignment of population groups from genome profiling data for both unaltered and cancer genomes. Despite extensive somatic mutations in the cancer genomes, consistency between germline and cancer data reached of 97% and 92% for assignment into 5 and 26 ancestral groups, respectively. Comparison with self-reported meta-data estimated a matching rate between 88-92%, mostly limited by interpretation of self-reported ethnicity labels compared to the standardized mapping output. Our SNP2pop application allows to assess population information from SNP arrays as well as sequencing platforms and to estimate the population structure in cancer genomics projects, to facilitate research into the interplay between ethnicity-related genetic background, environmental factors and somatic mutation patterns in cancer biology.
引用
收藏
页数:9
相关论文
共 35 条
  • [1] Relatedness Mapping and Tracts of Relatedness for Genome-Wide Data in the Presence of Linkage Disequilibrium
    Albrechtsen, Anders
    Korneliussen, Thorfinn Sand
    Moltke, Ida
    Hansen, Thomas van Overseem
    Nielsen, Finn Cilius
    Nielsen, Rasmus
    [J]. GENETIC EPIDEMIOLOGY, 2009, 33 (03) : 266 - 274
  • [2] Fast model-based estimation of ancestry in unrelated individuals
    Alexander, David H.
    Novembre, John
    Lange, Kenneth
    [J]. GENOME RESEARCH, 2009, 19 (09) : 1655 - 1664
  • [3] Signatures of mutational processes in human cancer
    Alexandrov, Ludmil B.
    Nik-Zainal, Serena
    Wedge, David C.
    Aparicio, Samuel A. J. R.
    Behjati, Sam
    Biankin, Andrew V.
    Bignell, Graham R.
    Bolli, Niccolo
    Borg, Ake
    Borresen-Dale, Anne-Lise
    Boyault, Sandrine
    Burkhardt, Birgit
    Butler, Adam P.
    Caldas, Carlos
    Davies, Helen R.
    Desmedt, Christine
    Eils, Roland
    Eyfjord, Jorunn Erla
    Foekens, John A.
    Greaves, Mel
    Hosoda, Fumie
    Hutter, Barbara
    Ilicic, Tomislav
    Imbeaud, Sandrine
    Imielinsk, Marcin
    Jaeger, Natalie
    Jones, David T. W.
    Jones, David
    Knappskog, Stian
    Kool, Marcel
    Lakhani, Sunil R.
    Lopez-Otin, Carlos
    Martin, Sancha
    Munshi, Nikhil C.
    Nakamura, Hiromi
    Northcott, Paul A.
    Pajic, Marina
    Papaemmanuil, Elli
    Paradiso, Angelo
    Pearson, John V.
    Puente, Xose S.
    Raine, Keiran
    Ramakrishna, Manasa
    Richardson, Andrea L.
    Richter, Julia
    Rosenstiel, Philip
    Schlesner, Matthias
    Schumacher, Ton N.
    Span, Paul N.
    Teague, Jon W.
    [J]. NATURE, 2013, 500 (7463) : 415 - +
  • [4] A global reference for human genetic variation
    Altshuler, David M.
    Durbin, Richard M.
    Abecasis, Goncalo R.
    Bentley, David R.
    Chakravarti, Aravinda
    Clark, Andrew G.
    Donnelly, Peter
    Eichler, Evan E.
    Flicek, Paul
    Gabriel, Stacey B.
    Gibbs, Richard A.
    Green, Eric D.
    Hurles, Matthew E.
    Knoppers, Bartha M.
    Korbel, Jan O.
    Lander, Eric S.
    Lee, Charles
    Lehrach, Hans
    Mardis, Elaine R.
    Marth, Gabor T.
    McVean, Gil A.
    Nickerson, Deborah A.
    Wang, Jun
    Wilson, Richard K.
    Boerwinkle, Eric
    Doddapaneni, Harsha
    Han, Yi
    Korchina, Viktoriya
    Kovar, Christie
    Lee, Sandra
    Muzny, Donna
    Reid, Jeffrey G.
    Zhu, Yiming
    Chang, Yuqi
    Feng, Qiang
    Fang, Xiaodong
    Guo, Xiaosen
    Jian, Min
    Jiang, Hui
    Jin, Xin
    Lan, Tianming
    Li, Guoqing
    Li, Jingxiang
    Li, Yingrui
    Liu, Shengmao
    Liu, Xiao
    Lu, Yao
    Ma, Xuedi
    Tang, Meifang
    Wang, Bo
    [J]. NATURE, 2015, 526 (7571) : 68 - +
  • [5] A common variant associated with prostate cancer in European and African populations
    Amundadottir, Laufey T.
    Sulem, Patrick
    Gudmundsson, Julius
    Helgason, Agnar
    Baker, Adam
    Agnarsson, Bjarni A.
    Sigurdsson, Asgeir
    Benediktsdottir, Kristrun R.
    Cazier, Jean-Baptiste
    Sainz, Jesus
    Jakobsdottir, Margret
    Kostic, Jelena
    Magnusdottir, Droplaug N.
    Ghosh, Shyamali
    Agnarsson, Kari
    Birgisdottir, Birgitta
    Le Roux, Louise
    Olafsdottir, Adalheidur
    Blondal, Thorarinn
    Andresdottir, Margret
    Gretarsdottir, Olafia Svandis
    Bergthorsson, Jon T.
    Gudbjartsson, Daniel
    Gylfason, Arnaldur
    Thorleifsson, Gudmar
    Manolescu, Andrei
    Kristjansson, Kristleifur
    Geirsson, Gudmundur
    Isaksson, Helgi
    Douglas, Julie
    Johansson, Jan-Erik
    Balter, Katarina
    Wiklund, Fredrik
    Montie, James E.
    Yu, Xiaoying
    Suarez, Brian K.
    Ober, Carole
    Cooney, Kathleen A.
    Gronberg, Henrik
    Catalona, William J.
    Einarsson, Gudmundur V.
    Barkardottir, Rosa B.
    Gulcher, Jeffrey R.
    Kong, Augustine
    Thorsteinsdottir, Unnur
    Stefansson, Kari
    [J]. NATURE GENETICS, 2006, 38 (06) : 652 - 658
  • [6] [Anonymous], 2013, Income, poverty, and health insurance coverage in the United States: 2012
  • [7] [Anonymous], 2009, GENETICS, DOI DOI 10.1111/J.1469-1809.2009.00517.X
  • [8] [Anonymous], 2013, Disparities in STEM Employment by Sex, Race, Hispanic Origin
  • [9] The Alcohol Flushing Response: An Unrecognized Risk Factor for Esophageal Cancer from Alcohol Consumption
    Brooks, Philip J.
    Enoch, Mary-Anne
    Goldman, David
    Li, Ting-Kai
    Yokoyama, Akira
    [J]. PLOS MEDICINE, 2009, 6 (03) : 0258 - 0263
  • [10] Progenetix: 12 years of oncogenomic data curation
    Cai, Haoyang
    Kumar, Nitin
    Ai, Ni
    Gupta, Saumya
    Rath, Prisni
    Baudis, Michael
    [J]. NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) : D1055 - D1062