Divide and Conquer: Enriching Environmental Sequencing Data

被引:5
作者
Bergeron, Anne [1 ]
Belcaid, Mahdi [2 ]
Steward, Grieg F. [3 ]
Poisson, Guylaine [2 ]
机构
[1] Univ Quebec, Montreal, PQ H3C 3P8, Canada
[2] Univ Hawaii Manoa, Honolulu, HI 96822 USA
[3] Univ Hawaii Manoa, Dept Oceanog, Honolulu, HI 96822 USA
来源
PLOS ONE | 2007年 / 2卷 / 09期
基金
加拿大自然科学与工程研究理事会; 美国国家科学基金会;
关键词
D O I
10.1371/journal.pone.0000830
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background. In environmental sequencing projects, a mix of DNA from a whole microbial community is fragmented and sequenced, with one of the possible goals being to reconstruct partial or complete genomes of members of the community. In communities with high diversity of species, a significant proportion of the sequences do not overlap any other fragment in the sample. This problem will arise not only in situations with a relatively even distribution of many species, but also when the community in a particular environment is routinely dominated by the same few species. In the former case, no genomes may be assembled at all, while in the latter case a few dominant species in an environment will always be sequenced at high coverage to the detriment of coverage of the greater number of sparse species. Methods and Results. Here we show that, with the same global sequencing effort, separating the species into two or more sub-communities prior to sequencing can yield a much higher proportion of sequences that can be assembled. We first use the Lander-Waterman model to show that, if the expected percentage of singleton sequences is higher than 25%, then, under the uniform distribution hypothesis, splitting the community is always a wise choice. We then construct simulated microbial communities to show that the results hold for highly non-uniform distributions. We also show that, for the distributions considered in the experiments, it is possible to estimate quite accurately the relative diversity of the two sub-communities. Conclusion. Given the fact that several methods exist to split microbial communities based on physical properties such as size, density, surface biochemistry, or optical properties, we strongly suggest that groups involved in environmental sequencing, and expecting high diversity, consider splitting their communities in order to maximize the information content of their sequencing effort.
引用
收藏
页数:7
相关论文
共 18 条
  • [1] The marine viromes of four oceanic regions
    Angly, Florent E.
    Felts, Ben
    Breitbart, Mya
    Salamon, Peter
    Edwards, Robert A.
    Carlson, Craig
    Chan, Amy M.
    Haynes, Matthew
    Kelley, Scott
    Liu, Hong
    Mahaffy, Joseph M.
    Mueller, Jennifer E.
    Nulton, Jim
    Olson, Robert
    Parsons, Rachel
    Rayhawk, Steve
    Suttle, Curtis A.
    Rohwer, Forest
    [J]. PLOS BIOLOGY, 2006, 4 (11) : 2121 - 2131
  • [2] Diversity and population structure of a near-shore marine-sediment viral community
    Breitbart, M
    Felts, B
    Kelley, S
    Mahaffy, JM
    Nulton, J
    Salamon, P
    Rohwer, F
    [J]. PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2004, 271 (1539) : 565 - 574
  • [3] Genomic analysis of uncultured marine viral communities
    Breitbart, M
    Salamon, P
    Andresen, B
    Mahaffy, JM
    Segall, AM
    Mead, D
    Azam, F
    Rohwer, F
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (22) : 14250 - 14255
  • [4] Bioinformatics for whole-genome shotgun sequencing of microbial communities
    Chen, K
    Pachter, L
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2005, 1 (02) : 106 - 112
  • [5] Microbial community genomics in the ocean
    DeLong, EE
    [J]. NATURE REVIEWS MICROBIOLOGY, 2005, 3 (06) : 459 - 469
  • [6] Viral metagenomics
    Edwards, RA
    Rohwer, F
    [J]. NATURE REVIEWS MICROBIOLOGY, 2005, 3 (06) : 504 - 510
  • [7] Genomic analysis of the uncultivated marine crenarchaeote Cenarchaeum symbiosum
    Hallam, Steven J.
    Konstantinidis, Konstantinos T.
    Putnam, Nik
    Schleper, Christa
    Watanabe, Yoh-ichi
    Sugahara, Junichi
    Preston, Christina
    de la Torre, Jose
    Richardson, Paul M.
    DeLong, Edward F.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (48) : 18296 - 18301
  • [8] LANDER E S, 1988, Genomics, V2, P231
  • [9] SIZING AND SEPARATION OF LIPOSOMES, BIOLOGICAL VESICLES, AND VIRUSES BY HIGH-PERFORMANCE LIQUID-CHROMATOGRAPHY
    OLLIVON, M
    WALTER, A
    BLUMENTHAL, R
    [J]. ANALYTICAL BIOCHEMISTRY, 1986, 152 (02) : 262 - 274
  • [10] ISOELECTRIC FOCUSING OF VIRUSES IN POLYACRYLAMIDE GELS
    RICE, RH
    HORST, J
    [J]. VIROLOGY, 1972, 49 (02) : 602 - &