共 33 条
Muscle5: High-accuracy alignment ensembles enable unbiased assessments of sequence homology and phylogeny
被引:366
作者:

Edgar, Robert C.
论文数: 0 引用数: 0
h-index: 0
机构: Independent Researcher,
机构:
[1] Independent Researcher,
关键词:
UNCERTAINTY;
BENCHMARK;
ALGORITHM;
DATABASE;
TREE;
D O I:
10.1038/s41467-022-34630-w
中图分类号:
O [数理科学和化学];
P [天文学、地球科学];
Q [生物科学];
N [自然科学总论];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
Multiple sequence alignments are widely used to predict protein structure, function, and phylogeny, but are uncertain with more diverged sequences. Muscle5 generates ensembles of alternative high-accurate alignments, enabling novel confidence estimates in alignments, trees, and other inferences. Multiple sequence alignments are widely used to infer evolutionary relationships, enabling inferences of structure, function, and phylogeny. Standard practice is to construct one alignment by some preferred method and use it in further analysis; however, undetected alignment bias can be problematic. I describe Muscle5, a novel algorithm which constructs an ensemble of high-accuracy alignment with diverse biases by perturbing a hidden Markov model and permuting its guide tree. Confidence in an inference is assessed as the fraction of the ensemble which supports it. Applied to phylogenetic tree estimation, I show that ensembles can confidently resolve topologies with low bootstrap according to standard methods, and conversely that some topologies with high bootstraps are incorrect. Applied to the phylogeny of RNA viruses, ensemble analysis shows that recently adopted taxonomic phyla are probably polyphyletic. Ensemble analysis can improve confidence assessment in any inference from an alignment.
引用
收藏
页数:9
相关论文
共 33 条
[1]
Ribovirus classification by a polymerase barcode sequence
[J].
Babaian, Artem
;
Edgar, Robert
.
PEERJ,
2022, 10

Babaian, Artem
论文数: 0 引用数: 0
h-index: 0
机构:
St Edmunds Coll, Cambridge, England
Univ Cambridge, Dept Haematol, Cambridge, England St Edmunds Coll, Cambridge, England

Edgar, Robert
论文数: 0 引用数: 0
h-index: 0
机构: St Edmunds Coll, Cambridge, England
[2]
Incorporating alignment uncertainty into Felsenstein's phylogenetic bootstrap to improve its reliability
[J].
Chang, Jia-Ming
;
Floden, Evan W.
;
Herrero, Javier
;
Gascuel, Olivier
;
Di Tommaso, Paolo
;
Notredame, Cedric
.
BIOINFORMATICS,
2021, 37 (11)
:1506-1514

Chang, Jia-Ming
论文数: 0 引用数: 0
h-index: 0
机构:
European Bioinformat Inst, European Mol Biol Lab, Wellcome Genome Campus, Cambridge CB10 1SD, England
Natl Chengchi Univ, Dept Comp Sci, Taipei 11605, Taiwan European Bioinformat Inst, European Mol Biol Lab, Wellcome Genome Campus, Cambridge CB10 1SD, England

Floden, Evan W.
论文数: 0 引用数: 0
h-index: 0
机构:
Barcelona Inst Sci & Technol, Ctr Genom Regulat CRG, Barcelona 08003, Spain European Bioinformat Inst, European Mol Biol Lab, Wellcome Genome Campus, Cambridge CB10 1SD, England

论文数: 引用数:
h-index:
机构:

Gascuel, Olivier
论文数: 0 引用数: 0
h-index: 0
机构:
Inst Pasteur, Unite Bioinformat Evolut, C3BI USR 3756 IP CNRS, F-75015 Paris, France European Bioinformat Inst, European Mol Biol Lab, Wellcome Genome Campus, Cambridge CB10 1SD, England

Di Tommaso, Paolo
论文数: 0 引用数: 0
h-index: 0
机构:
Barcelona Inst Sci & Technol, Ctr Genom Regulat CRG, Barcelona 08003, Spain European Bioinformat Inst, European Mol Biol Lab, Wellcome Genome Campus, Cambridge CB10 1SD, England

Notredame, Cedric
论文数: 0 引用数: 0
h-index: 0
机构:
Barcelona Inst Sci & Technol, Ctr Genom Regulat CRG, Barcelona 08003, Spain
Univ Pompeu Fabra UPF, Barcelona 08003, Spain European Bioinformat Inst, European Mol Biol Lab, Wellcome Genome Campus, Cambridge CB10 1SD, England
[3]
Generalized Bootstrap Supports for Phylogenetic Analyses of Protein Sequences Incorporating Alignment Uncertainty
[J].
Chatzou, Maria
;
Floden, Evan W.
;
Di Tommaso, Paolo
;
Gascuel, Olivier
;
Notredame, Cedric
.
SYSTEMATIC BIOLOGY,
2018, 67 (06)
:997-1009

Chatzou, Maria
论文数: 0 引用数: 0
h-index: 0
机构:
Barcelona Inst Sci & Technol, Ctr Genom Regulat CRG, Dr Aiguader 88, Barcelona 08003, Spain
UPF, Dr Aiguader 88, Barcelona 08003, Spain Barcelona Inst Sci & Technol, Ctr Genom Regulat CRG, Dr Aiguader 88, Barcelona 08003, Spain

Floden, Evan W.
论文数: 0 引用数: 0
h-index: 0
机构:
Barcelona Inst Sci & Technol, Ctr Genom Regulat CRG, Dr Aiguader 88, Barcelona 08003, Spain
UPF, Dr Aiguader 88, Barcelona 08003, Spain Barcelona Inst Sci & Technol, Ctr Genom Regulat CRG, Dr Aiguader 88, Barcelona 08003, Spain

Di Tommaso, Paolo
论文数: 0 引用数: 0
h-index: 0
机构:
Barcelona Inst Sci & Technol, Ctr Genom Regulat CRG, Dr Aiguader 88, Barcelona 08003, Spain
UPF, Dr Aiguader 88, Barcelona 08003, Spain Barcelona Inst Sci & Technol, Ctr Genom Regulat CRG, Dr Aiguader 88, Barcelona 08003, Spain

Gascuel, Olivier
论文数: 0 引用数: 0
h-index: 0
机构:
C3BI USR 3756 CNRS, Unite Bioinformat Evolut, 25-28 Rue Docteur Roux, F-75724 Paris 15, France
Inst Pasteur, 25-28 Rue Docteur Roux, F-75724 Paris 15, France
CNRS, IBC LIRMM UMR5506, Methodes & Algorithmes Bioinformat, 161 Rue Ada, F-34095 Montpellier 5, France
Univ Montpellier, CC477, 161 Rue Ada, F-34095 Montpellier 5, France Barcelona Inst Sci & Technol, Ctr Genom Regulat CRG, Dr Aiguader 88, Barcelona 08003, Spain

Notredame, Cedric
论文数: 0 引用数: 0
h-index: 0
机构:
Barcelona Inst Sci & Technol, Ctr Genom Regulat CRG, Dr Aiguader 88, Barcelona 08003, Spain
UPF, Dr Aiguader 88, Barcelona 08003, Spain Barcelona Inst Sci & Technol, Ctr Genom Regulat CRG, Dr Aiguader 88, Barcelona 08003, Spain
[4]
Middle East Respiratory Syndrome Coronavirus (MERS-CoV): Announcement of the Coronavirus Study Group
[J].
de Groot, Raoul J.
;
Baker, Susan C.
;
Baric, Ralph S.
;
Brown, Caroline S.
;
Drosten, Christian
;
Enjuanes, Luis
;
Fouchier, Ron A. M.
;
Galiano, Monica
;
Gorbalenya, Alexander E.
;
Memish, Ziad A.
;
Perlman, Stanley
;
Poon, Leo L. M.
;
Snijder, Eric J.
;
Stephens, Gwen M.
;
Woo, Patrick C. Y.
;
Zaki, Ali M.
;
Zambon, Maria
;
Ziebuhr, John
.
JOURNAL OF VIROLOGY,
2013, 87 (14)
:7790-7792

de Groot, Raoul J.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Utrecht, Fac Vet Med, Dept Infect Dis & Immunol, Div Virol, Utrecht, Netherlands Univ Utrecht, Fac Vet Med, Dept Infect Dis & Immunol, Div Virol, Utrecht, Netherlands

Baker, Susan C.
论文数: 0 引用数: 0
h-index: 0
机构:
Loyola Univ, Med Ctr, Dept Microbiol & Immunol, Maywood, IL 60153 USA Univ Utrecht, Fac Vet Med, Dept Infect Dis & Immunol, Div Virol, Utrecht, Netherlands

Baric, Ralph S.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ N Carolina, Dept Epidemiol, Chapel Hill, NC USA Univ Utrecht, Fac Vet Med, Dept Infect Dis & Immunol, Div Virol, Utrecht, Netherlands

Brown, Caroline S.
论文数: 0 引用数: 0
h-index: 0
机构:
WHO, WHO Reg Off Europe, Copenhagen, Denmark Univ Utrecht, Fac Vet Med, Dept Infect Dis & Immunol, Div Virol, Utrecht, Netherlands

Drosten, Christian
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Bonn, Med Ctr, Inst Virol, Bonn, Germany Univ Utrecht, Fac Vet Med, Dept Infect Dis & Immunol, Div Virol, Utrecht, Netherlands

Enjuanes, Luis
论文数: 0 引用数: 0
h-index: 0
机构:
Campus Univ Autonoma Madrid, Natl Ctr Biotechnol CNB CSIC, Dept Mol & Cell Biol, Madrid, Spain Univ Utrecht, Fac Vet Med, Dept Infect Dis & Immunol, Div Virol, Utrecht, Netherlands

Fouchier, Ron A. M.
论文数: 0 引用数: 0
h-index: 0
机构:
Erasmus MC, Virosci Lab, Rotterdam, Netherlands Univ Utrecht, Fac Vet Med, Dept Infect Dis & Immunol, Div Virol, Utrecht, Netherlands

Galiano, Monica
论文数: 0 引用数: 0
h-index: 0
机构:
Publ Hlth England, Hlth Protect Agcy, London, England Univ Utrecht, Fac Vet Med, Dept Infect Dis & Immunol, Div Virol, Utrecht, Netherlands

Gorbalenya, Alexander E.
论文数: 0 引用数: 0
h-index: 0
机构:
Leiden Univ, Med Ctr, Dept Med Microbiol, Leiden, Netherlands Univ Utrecht, Fac Vet Med, Dept Infect Dis & Immunol, Div Virol, Utrecht, Netherlands

Memish, Ziad A.
论文数: 0 引用数: 0
h-index: 0
机构:
Minist Hlth, Publ Hlth Directorate, Riyadh, Saudi Arabia Univ Utrecht, Fac Vet Med, Dept Infect Dis & Immunol, Div Virol, Utrecht, Netherlands

Perlman, Stanley
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Iowa, Dept Microbiol, Iowa City, IA 52242 USA Univ Utrecht, Fac Vet Med, Dept Infect Dis & Immunol, Div Virol, Utrecht, Netherlands

Poon, Leo L. M.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Hong Kong, Influenza Res Ctr, Hong Kong, Hong Kong, Peoples R China
Univ Hong Kong, Sch Publ Hlth, Hong Kong, Hong Kong, Peoples R China Univ Utrecht, Fac Vet Med, Dept Infect Dis & Immunol, Div Virol, Utrecht, Netherlands

Snijder, Eric J.
论文数: 0 引用数: 0
h-index: 0
机构:
Leiden Univ, Med Ctr, Dept Med Microbiol, Leiden, Netherlands Univ Utrecht, Fac Vet Med, Dept Infect Dis & Immunol, Div Virol, Utrecht, Netherlands

Stephens, Gwen M.
论文数: 0 引用数: 0
h-index: 0
机构:
Minist Hlth, Publ Hlth Directorate, Riyadh, Saudi Arabia Univ Utrecht, Fac Vet Med, Dept Infect Dis & Immunol, Div Virol, Utrecht, Netherlands

Woo, Patrick C. Y.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Hong Kong, Dept Microbiol, Hong Kong, Hong Kong, Peoples R China Univ Utrecht, Fac Vet Med, Dept Infect Dis & Immunol, Div Virol, Utrecht, Netherlands

论文数: 引用数:
h-index:
机构:

Zambon, Maria
论文数: 0 引用数: 0
h-index: 0
机构:
Publ Hlth England, Hlth Protect Agcy, London, England Univ Utrecht, Fac Vet Med, Dept Infect Dis & Immunol, Div Virol, Utrecht, Netherlands

Ziebuhr, John
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Giessen, Inst Med Virol, D-35390 Giessen, Germany Univ Utrecht, Fac Vet Med, Dept Infect Dis & Immunol, Div Virol, Utrecht, Netherlands
[5]
ProbCons: Probabilistic consistency-based multiple sequence alignment
[J].
Do, CB
;
Mahabhashyam, MSP
;
Brudno, M
;
Batzoglou, S
.
GENOME RESEARCH,
2005, 15 (02)
:330-340

Do, CB
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA

Mahabhashyam, MSP
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA

Brudno, M
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA

Batzoglou, S
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[6]
MUSCLE: multiple sequence alignment with high accuracy and high throughput
[J].
Edgar, RC
.
NUCLEIC ACIDS RESEARCH,
2004, 32 (05)
:1792-1797

Edgar, RC
论文数: 0 引用数: 0
h-index: 0
机构: Mill Valley, CA 94941
[7]
PROGRESSIVE SEQUENCE ALIGNMENT AS A PREREQUISITE TO CORRECT PHYLOGENETIC TREES
[J].
FENG, DF
;
DOOLITTLE, RF
.
JOURNAL OF MOLECULAR EVOLUTION,
1987, 25 (04)
:351-360

FENG, DF
论文数: 0 引用数: 0
h-index: 0

DOOLITTLE, RF
论文数: 0 引用数: 0
h-index: 0
[8]
Replication crisis or an opportunity to improve scientific production?
[J].
Frias-Navarro, Dolores
;
Pascual-Llobell, Juan
;
Pascual-Soler, Marcos
;
Perezgonzalez, Jose
;
Berrios-Riquelme, Jose
.
EUROPEAN JOURNAL OF EDUCATION,
2020, 55 (04)
:618-631

Frias-Navarro, Dolores
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Valencia, Fac Psychol, Dept Methodol Behav Sci, Valencia, Spain Univ Valencia, Fac Psychol, Dept Methodol Behav Sci, Valencia, Spain

Pascual-Llobell, Juan
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Valencia, Fac Psychol, Dept Methodol Behav Sci, Valencia, Spain Univ Valencia, Fac Psychol, Dept Methodol Behav Sci, Valencia, Spain

Pascual-Soler, Marcos
论文数: 0 引用数: 0
h-index: 0
机构:
ESIC Business & Mkt Sch, Valencia, Spain Univ Valencia, Fac Psychol, Dept Methodol Behav Sci, Valencia, Spain

Perezgonzalez, Jose
论文数: 0 引用数: 0
h-index: 0
机构:
Massey Univ, Business Sch, Palmerston North, New Zealand Univ Valencia, Fac Psychol, Dept Methodol Behav Sci, Valencia, Spain

Berrios-Riquelme, Jose
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Tarapaca, Dept Social Sci, Fac Social Sci, Iquique, Chile Univ Valencia, Fac Psychol, Dept Methodol Behav Sci, Valencia, Spain
[9]
A benchmark of multiple sequence alignment programs upon structural RNAs
[J].
Gardner, PP
;
Wilm, A
;
Washietl, S
.
NUCLEIC ACIDS RESEARCH,
2005, 33 (08)
:2433-2439

Gardner, PP
论文数: 0 引用数: 0
h-index: 0
机构: Univ Copenhagen, Dept Evolutionary Biol, DK-2100 Copenhagen, Denmark

Wilm, A
论文数: 0 引用数: 0
h-index: 0
机构: Univ Copenhagen, Dept Evolutionary Biol, DK-2100 Copenhagen, Denmark

Washietl, S
论文数: 0 引用数: 0
h-index: 0
机构: Univ Copenhagen, Dept Evolutionary Biol, DK-2100 Copenhagen, Denmark
[10]
The new scope of virus taxonomy: partitioning the virosphere into 15 hierarchical ranks
[J].
Gorbalenya, Alexander E.
;
Krupovic, Mart
;
Mushegian, Arcady
;
Kropinski, Andrew M.
;
Siddell, Stuart G.
;
Varsani, Arvind
;
Adams, Michael J.
;
Davison, Andrew J.
;
Dutilh, Bas E.
;
Harrach, Balazs
;
Harrison, Robert L.
;
Junglen, Sandra
;
King, Andrew M. Q.
;
Knowles, Nick J.
;
Lefkowitz, Elliot J.
;
Nibert, Max L.
;
Rubino, Luisa
;
Sabanadzovic, Sead
;
Sanfacon, Helene
;
Simmonds, Peter
;
Walker, Peter J.
;
Zerbini, F. Murilo
;
Kuhn, Jens H.
.
NATURE MICROBIOLOGY,
2020, 5 (05)
:668-674

Gorbalenya, Alexander E.
论文数: 0 引用数: 0
h-index: 0
机构:
Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands
Lomonosov Moscow State Univ, Fac Bioengn & Bioinformat, Moscow, Russia Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Krupovic, Mart
论文数: 0 引用数: 0
h-index: 0
机构:
Inst Pasteur, Archaeal Virol Unit, Paris, France Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Mushegian, Arcady
论文数: 0 引用数: 0
h-index: 0
机构:
Natl Sci Fdn, Div Mol & Cellular Biosci, Alexandria, VA USA Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Kropinski, Andrew M.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Guelph, Dept Food Sci, Guelph, ON, Canada
Univ Guelph, Dept Pathobiol, Guelph, ON, Canada Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Siddell, Stuart G.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Bristol, Sch Cellular & Mol Med, Bristol, Avon, England Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Varsani, Arvind
论文数: 0 引用数: 0
h-index: 0
机构:
Arizona State Univ, Biodesign Ctr Fundamental & Appl Microbi, Ctr Evolut & Med, Sch Life Sci, Tempe, AZ USA Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Adams, Michael J.
论文数: 0 引用数: 0
h-index: 0
机构: Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Davison, Andrew J.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Glasgow, Ctr Virus Res, MRC, Glasgow, Lanark, Scotland Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Dutilh, Bas E.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Utrecht, Sci Life, Theoret Biol & Bioinformat, Utrecht, Netherlands Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Harrach, Balazs
论文数: 0 引用数: 0
h-index: 0
机构:
Ctr Agr Res, Inst Vet Med Res, Budapest, Hungary Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Harrison, Robert L.
论文数: 0 引用数: 0
h-index: 0
机构:
USDA ARS, Invas Insect Biocontrol & Behav Lab, Beltsville, MD USA Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Junglen, Sandra
论文数: 0 引用数: 0
h-index: 0
机构:
Charite Univ Med Berlin, Inst Virol, Berlin, Germany
Free Univ Berlin, Berlin, Germany
Humboldt Univ, Berlin, Germany
Berlin Inst Hlth, Berlin, Germany Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

King, Andrew M. Q.
论文数: 0 引用数: 0
h-index: 0
机构:
Pirbright Inst, Pirbright, England Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Knowles, Nick J.
论文数: 0 引用数: 0
h-index: 0
机构:
Pirbright Inst, Pirbright, England Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Lefkowitz, Elliot J.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Alabama Birmingham, Dept Microbiol, Birmingham, AL 35294 USA Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Nibert, Max L.
论文数: 0 引用数: 0
h-index: 0
机构:
Harvard Med Sch, Blavatn Inst, Dept Microbiol, Boston, MA 02115 USA Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Rubino, Luisa
论文数: 0 引用数: 0
h-index: 0
机构:
Natl Res Council Italy, Ist Protez Sostenibile Piante, Bari, Italy Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Sabanadzovic, Sead
论文数: 0 引用数: 0
h-index: 0
机构:
Mississippi State Univ, Dept Biochem Mol Biol Entomol & Plant Pathol, Mississippi State, MS 39762 USA Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Sanfacon, Helene
论文数: 0 引用数: 0
h-index: 0
机构:
Agr & Agri Food Canada, Summerland Res & Dev Ctr, Summerland, BC, Canada Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

论文数: 引用数:
h-index:
机构:

Walker, Peter J.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Queensland, Sch Chem & Mol Biosci, St Lucia, Qld, Australia Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Zerbini, F. Murilo
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Fed Vicosa, Dept Fitopatol BIOAGRO, Vicosa, MG, Brazil Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands

Kuhn, Jens H.
论文数: 0 引用数: 0
h-index: 0
机构:
NIAID, Integrated Res Facil Ft Detrick IRF Frederick, NIH, Frederick, MD 21704 USA Leiden Univ, Med Ctr, Dept Biomed Data Sci & Microbiol, Leiden, Netherlands