A fuzzy-set-theory-based approach to analyse species membership in DNA barcoding

被引:74
作者
Zhang, A. -B. [1 ]
Muster, C. [2 ]
Liang, H. -B. [3 ]
Zhu, C. -D. [3 ]
Crozier, R. [4 ]
Wan, P. [1 ]
Feng, J. [5 ]
Ward, R. D. [6 ]
机构
[1] Capital Normal Univ, Coll Life Sci, Beijing 100048, Peoples R China
[2] Univ Greifswald, Zool Inst & Museum, Greifswald, Germany
[3] Chinese Acad Sci, Key Lab Zool Systemat & Evolut, Inst Zool, Beijing 100101, Peoples R China
[4] James Cook Univ, Dept Evolutionary Genet, Sch Marine & Trop Biol, Townsville, Qld 4811, Australia
[5] Capital Normal Univ, Coll Appl Math, Beijing 100048, Peoples R China
[6] CSIRO Marine & Atmospher Res, Wealth Oceans Flagship, Hobart, Tas 7001, Australia
基金
澳大利亚研究理事会;
关键词
DNA barcoding; fuzzy set theory; species membership; statistical approach; C-OXIDASE-I; MITOCHONDRIAL-DNA; IDENTIFICATION; TAXONOMY; SEQUENCES; LIFE; CONSEQUENCES; LEPIDOPTERA; PERFORMANCE; DIPTERA;
D O I
10.1111/j.1365-294X.2011.05235.x
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Reliable assignment of an unknown query sequence to its correct species remains a methodological problem for the growing field of DNA barcoding. While great advances have been achieved recently, species identification from barcodes can still be unreliable if the relevant biodiversity has been insufficiently sampled. We here propose a new notion of species membership for DNA barcodingfuzzy membership, based on fuzzy set theoryand illustrate its successful application to four real data sets (bats, fishes, butterflies and flies) with more than 5000 random simulations. Two of the data sets comprise especially dense species/population-level samples. In comparison with current DNA barcoding methods, the newly proposed minimum distance (MD) plus fuzzy set approach, and another computationally simple method, best close match, outperform two computationally sophisticated Bayesian and BootstrapNJ methods. The new method proposed here has great power in reducing false-positive species identification compared with other methods when conspecifics of the query are absent from the reference database.
引用
收藏
页码:1848 / 1863
页数:16
相关论文
共 71 条
[1]   A step toward barcoding life: A model-based, decision-theoretic method to assign genes to preexisting species groups [J].
Abdo, Zaid ;
Golding, G. Brian .
SYSTEMATIC BIOLOGY, 2007, 56 (01) :44-56
[2]   DNA barcode analysis: a comparison of phylogenetic and statistical classification methods [J].
Austerlitz, Frederic ;
David, Olivier ;
Schaeffer, Brigitte ;
Bleakley, Kevin ;
Olteanu, Madalina ;
Leblois, Raphael ;
Veuille, Michel ;
Laredo, Catherine .
BMC BIOINFORMATICS, 2009, 10 :S10
[3]   Intraspecific genetic variation in Paramecium revealed by mitochondrial cytochrome c oxidase I sequences [J].
Barth, D ;
Krenek, S ;
Fokin, SI ;
Berendonk, TU .
JOURNAL OF EUKARYOTIC MICROBIOLOGY, 2006, 53 (01) :20-25
[4]   Problems with DNA barcodes for species delimitation:: 'ten species' of Astraptes fulgerator reassessed (Lepidoptera: Hesperiidae) [J].
Brower, Andrew V. Z. .
SYSTEMATICS AND BIODIVERSITY, 2006, 4 (02) :127-132
[5]   Barcoding ciliates:: a comprehensive study of 75 isolates of the genus Tetrahymena [J].
Chantangsi, Chitchai ;
Lynn, Denis H. ;
Brandl, Maria T. ;
Cole, Jeffrey C. ;
Hetrick, Neil ;
Ikonomi, Pranvera .
INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY, 2007, 57 :2412-2425
[6]   Rapid DNA barcoding analysis of large datasets using the composition vector method [J].
Chu, Ka Hou ;
Xu, Minli ;
Li, Chi Pang .
BMC BIOINFORMATICS, 2009, 10 :S8
[7]   DNA barcoding of Neotropical bats: species identification and discovery within Guyana [J].
Clare, Elizabeth L. ;
Lim, Burton K. ;
Engstrom, Mark D. ;
Eger, Judith L. ;
Hebert, Paul D. N. .
MOLECULAR ECOLOGY NOTES, 2007, 7 (02) :184-190
[8]   Complete DNA barcode reference library for a country's butterfly fauna reveals high performance for temperate Europe [J].
Dinca, Vlad ;
Zakharov, Evgeny V. ;
Hebert, Paul D. N. ;
Vila, Roger .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2011, 278 (1704) :347-355
[9]   DNA barcoding is no substitute for taxonomy [J].
Ebach, MC ;
Holdrege, C .
NATURE, 2005, 434 (7034) :697-697
[10]   MUSCLE: multiple sequence alignment with high accuracy and high throughput [J].
Edgar, RC .
NUCLEIC ACIDS RESEARCH, 2004, 32 (05) :1792-1797