DNA barcoding authentication for the wood of eight endangered Dalbergia timber species using machine learning approaches

被引:25
作者
He, Tuo [1 ,2 ]
Jiao, Lichao [1 ,2 ]
Yu, Min [1 ,2 ]
Guo, Juan [1 ,2 ]
Jiang, Xiaomei [1 ,2 ]
Yin, Yafang [1 ,2 ]
机构
[1] Chinese Acad Forestry, Chinese Res Inst Wood Ind, Dept Wood Anat & Utilizat, Beijing 100091, Peoples R China
[2] Chinese Acad Forestry, Wood Collect WOODPEDIA, Beijing 100091, Peoples R China
基金
中国国家自然科学基金;
关键词
Dalbergia timber species; DNA barcoding; illegal logging; machine learning approaches (MIAs); reference library; SMO classifier; wood identification; IDENTIFICATION; SEQUENCES; CLASSIFICATION; VALIDATION; ASSIGNMENT; MEMBERSHIP; SPECIMENS; SOFTWARE; TAXONOMY; DISTANCE;
D O I
10.1515/hf-2018-0076
中图分类号
S7 [林业];
学科分类号
0829 ; 0907 ;
摘要
Reliable wood identification and proof of the provenance of trees is the first step for combating illegal logging. DNA barcoding belongs to the promising tools in this regard, for which reliable methods and reference libraries are needed. Machine learning approaches (MLAs) are tailored to the necessities of DNA barcoding, which are based on mathematical multivaried analysis. In the present study, eight Dalbergia timber species were investigated in terms of their DNA sequences focusing on four barcodes (ITS2, matK, trnH-psbA and trnL) by means of the MLAs BLOG and WEKA for wood species identification. The data material downloaded from NCBI (288 sequences) and taken from a previous study of the authors (153 DNA sequences) was taken as dataset for calibration. The MLAs' effectivity was verified through identification of non-vouchered wood specimens. The results indicate that the SMO classifier as part of the WEKA approach performed the best (98%similar to 100%) for discriminating the eight Dalbergia timber species. Moreover, the two-locus combination ITS2 + trnH-psbA showed the highest success rate. Furthermore, the non-vouchered wood specimens were successfully identified by means of ITS2 + trnH-psbA with the SMO classifier. The MLAs are successful in combination with DNA barcode reference libraries for the identification of endangered Dalbergia timber species.
引用
收藏
页码:277 / 285
页数:9
相关论文
共 85 条
[1]  
Acland A, 2013, NUCLEIC ACIDS RES, V41, pD8, DOI [10.1093/nar/gkx1095, 10.1093/nar/gks1189, 10.1093/nar/gkq1172]
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]  
[Anonymous], 2009, ACM SIGKDD explorations newsletter, DOI 10.1145/1656274.1656278
[4]  
[Anonymous], 2013, PLANT LIST VERSION 1
[5]  
Awano T, 2009, P NATL ACAD SCI USA, V106, P2794, DOI [10.1073/pnas.0812297106, 10.1073/pnas.0905845106]
[6]  
Benavoli A, 2017, J MACH LEARN RES, V18
[7]   The Effect of Geographical Scale of Sampling on DNA Barcoding [J].
Bergsten, Johannes ;
Bilton, David T. ;
Fujisawa, Tomochika ;
Elliott, Miranda ;
Monaghan, Michael T. ;
Balke, Michael ;
Hendrich, Lars ;
Geijer, Joja ;
Herrmann, Jan ;
Foster, Garth N. ;
Ribera, Ignacio ;
Nilsson, Anders N. ;
Barraclough, Timothy G. ;
Vogler, Alfried P. .
SYSTEMATIC BIOLOGY, 2012, 61 (05) :851-869
[8]   Learning to classify species with barcodes [J].
Bertolazzi, Paola ;
Felici, Giovanni ;
Weitschek, Emanuel .
BMC BIOINFORMATICS, 2009, 10 :S7
[9]  
Bhargava N., 2013, INT J ADV RES COMPUT, V3, P1114, DOI DOI 10.23956/IJARCSSE
[10]   ITS and trnH-psbA as Efficient DNA Barcodes to Identify Threatened Commercial Woody Angiosperms from Southern Brazilian Atlantic Rainforests [J].
Bolson, Monica y ;
Smidt, Eric de Camargo ;
Brotto, Marcelo Leandro ;
Silva-Pereira, Viviane .
PLOS ONE, 2015, 10 (12)