Cross-sectional use of barcode of life data system and GenBank as DNA barcoding databases for the advancement of museomics

被引:3
作者
Nakazato, Takeru [1 ]
Jinbo, Utsugi [2 ]
机构
[1] Res Org Informat & Syst ROIS, Joint Support Ctr Data Sci Res ROIS DS, Database Ctr Life Sci DBCLS, Mishima, Japan
[2] Natl Museum Nat & Sci, Ctr Collect, Tsukuba, Japan
关键词
sequencing data; voucher specimen; DNA barcode; biodiversity information; taxonomy database;
D O I
10.3389/fevo.2022.966605
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Museomics is an approach to the DNA sequencing of museum specimens that can generate both biodiversity and sequence information. In this study, we surveyed both the biodiversity information-based database BOLD (Barcode of Life System) and the sequence information database GenBank, by using DNA barcoding data as an example, with the aim of integrating the data from these two databases. DNA barcoding is a method of identifying species from DNA sequences by using short genetic markers. We surveyed how many entries had biodiversity information (such as links to BOLD and specimen IDs) by downloading all fish, insect, and flowering plant data available from the GenBank Nucleotide, and BOLD ID was assigned to 26.2% of entries for insects. In the same way, we downloaded the respective BOLD data and checked the status of links to sequence information. We also investigated how many species do these databases cover, and 7,693 species were found to exist only in BOLD. In the future, as museomics develops as a field, the targeted sequences will be extended not only to DNA barcodes, but also to mitochondrial genomes, other genes, and genome sequences. Consequently, the value of the sequence data will increase. In addition, various species will be sequenced and, thus, biodiversity information such as the evidence specimen photographs used as a basis for species identification, will become even more indispensable. This study contributes to the acceleration of museomics-associated research by using databases in a cross-sectional manner.
引用
收藏
页数:10
相关论文
共 25 条
[1]   Trends in DNA barcoding and metabarcoding INTRODUCTION [J].
Adamowicz, Sarah J. ;
Boatwright, James S. ;
Chain, Frederic ;
Fisher, Brian L. ;
Hogg, Ian D. ;
Leese, Florian ;
Lijtmaer, Dario A. ;
Mwale, Monica ;
Naaum, Amanda M. ;
Pochon, Xavier ;
Steinke, Dirk ;
Wilson, John-James ;
Wood, Susanna ;
Xu, Jianping ;
Xu, Sen ;
Zhou, Xin ;
van der Bank, Michelle .
GENOME, 2019, 62 (03) :V-VIII
[2]  
Andersson A.F., 2020, PUBLISHING DNA DERIV
[3]   The international nucleotide sequence database collaboration [J].
Arita, Masanori ;
Karsch-Mizrachi, Ilene ;
Cochrane, Guy .
NUCLEIC ACIDS RESEARCH, 2021, 49 (D1) :D121-D124
[4]   Collections-based research in the genomic era [J].
Buerki, Sven ;
Baker, William J. .
BIOLOGICAL JOURNAL OF THE LINNEAN SOCIETY, 2016, 117 (01) :5-10
[5]   Presenting and preserving the change in taxonomic knowledge for linked data [J].
Chawuthai, Rathachai ;
Takeda, Hideaki ;
Wuwongse, Vilas ;
Jinbo, Utsugi .
SEMANTIC WEB, 2016, 7 (06) :589-616
[6]   Review and Interpretation of Trends in DNA Barcoding [J].
DeSalle, Rob ;
Goldstein, Paul .
FRONTIERS IN ECOLOGY AND EVOLUTION, 2019, 7
[7]  
GBIF, 2019, CIRRH ICT HUFN 1766, DOI [10.15468/39omei, DOI 10.15468/39OMEI]
[8]  
Groom Q.J., 2021, PREPRINT, DOI [10.37044/osf.io/93qf4, DOI 10.37044/OSF.IO/93QF4]
[9]   Standards recommendations for the Earth BioGenome Project [J].
Lawniczak, Mara K. N. ;
Durbin, Richard ;
Flicek, Paul ;
Lindblad-Toh, Kerstin ;
Wei, Xiaofeng ;
Archibald, John M. ;
Baker, William J. ;
Belov, Katherine ;
Blaxter, Mark L. ;
Bonet, Tomas Marques ;
Childers, Anna K. ;
Coddington, Jonathan A. ;
Crandall, Keith A. ;
Crawford, Andrew J. ;
Davey, Robert P. ;
Di Palma, Federica ;
Fang, Qi ;
Haerty, Wilfried ;
Hall, Neil ;
Hoff, Katharina J. ;
Howe, Kerstin ;
Jarvis, Erich D. ;
Johnson, Warren E. ;
Johnson, Rebecca N. ;
Kersey, Paul J. ;
Liu, Xin ;
Lopez, Jose Victor ;
Myers, Eugene W. ;
Pettersson, Olga Vinnere ;
Phillippy, Adam M. ;
Poelchau, Monica F. ;
Pruitt, Kim D. ;
Rhie, Arang ;
Castilla-Rubio, Juan Carlos ;
Sahu, Sunil Kumar ;
Salmon, Nicholas A. ;
Soltis, Pamela S. ;
Swarbreck, David ;
Thibaud-Nissen, Francoise ;
Wang, Sibo ;
Wegrzyn, Jill L. ;
Zhang, Guojie ;
Zhang, He ;
Lewin, Harris A. ;
Richards, Stephen .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2022, 119 (04)
[10]   GenBank is a reliable resource for 21st century biodiversity research [J].
Leray, Matthieu ;
Knowlton, Nancy ;
Ho, Shian-Lei ;
Nguyen, Bryan N. ;
Machida, Ryuji J. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (45) :22651-22656