Rapid analysis of metagenomic data using signature-based clustering

被引:2
|
作者
Chappell, Timothy [1 ]
Geva, Shlomo [1 ]
Hogan, James M. [1 ]
Huygens, Flavia [2 ,3 ]
Rathnayake, Irani U. [2 ,3 ]
Rudd, Stephen [4 ]
Kelly, Wayne [1 ]
Perrin, Dimitri [1 ]
机构
[1] Queensland Univ Technol, Sch Elect Engn & Comp Sci, 2 George St, Brisbane, Qld 4001, Australia
[2] Queensland Univ Technol, Inst Hlth & Biomed Innovat, 60 Musk Ave, Kelvin Grove, Qld 4059, Australia
[3] Queensland Univ Technol, Fac Hlth, Sch Biomed Sci, 2 George St, Brisbane, Qld 4001, Australia
[4] QFAB, Level 6 QBP Bld 80,Chancellors Pl, Brisbane, Qld 4072, Australia
来源
BMC BIOINFORMATICS | 2018年 / 19卷
关键词
Metagenomics; Clustering; Community analysis; Read signatures; Wound healing; SEARCH; MICROBIOLOGY; BACTERIAL;
D O I
10.1186/s12859-018-2540-4
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundSequencing highly-variable 16S regions is a common and often effective approach to the study of microbial communities, and next-generation sequencing (NGS) technologies provide abundant quantities of data for analysis. However, the speed of existing analysis pipelines may limit our ability to work with these quantities of data. Furthermore, the limited coverage of existing 16S databases may hamper our ability to characterise these communities, particularly in the context of complex or poorly studied environments.ResultsIn this article we present the SigClust algorithm, a novel clustering method involving the transformation of sequence reads into binary signatures. When compared to other published methods, SigClust yields superior cluster coherence and separation of metagenomic read data, while operating within substantially reduced timeframes. We demonstrate its utility on published Illumina datasets and on a large collection of labelled wound reads sourced from patients in a wound clinic. The temporal analysis is based on tracking the dominant clusters of wound samples over time. The analysis can identify markers of both healing and non-healing wounds in response to treatment. Prominent clusters are found, corresponding to bacterial species known to be associated with unfavourable healing outcomes, including a number of strains of Staphylococcus aureus.ConclusionsSigClust identifies clusters rapidly and supports an improved understanding of the wound microbiome without reliance on a reference database. The results indicate a promising use for a SigClust-based pipeline in wound analysis and prediction, and a possible novel method for wound management and treatment.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Rapid analysis of metagenomic data using signature-based clustering
    Timothy Chappell
    Shlomo Geva
    James M. Hogan
    Flavia Huygens
    Irani U. Rathnayake
    Stephen Rudd
    Wayne Kelly
    Dimitri Perrin
    BMC Bioinformatics, 19
  • [2] Signature-Based Clustering for Analysis of the Wound Microbiome
    Chappell, Timothy
    Geva, Shlomo
    Hogan, James M.
    Huygens, Flavia
    Kelly, Wayne
    Perrin, Dimitri
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 339 - 346
  • [3] GSTaxClassifier: a genomic signature based taxonomic classifier for metagenomic data analysis
    Yu, Fahong
    Sun, Yijun
    Liu, Li
    Farmerie, William
    BIOINFORMATION, 2009, 4 (01) : 46 - 49
  • [4] Leveraging Clustering Techniques to Facilitate Metagenomic Analysis
    Ennis, Damien
    Dascalu, Sergiu
    Harris, Frederick C., Jr.
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2016, 22 (01) : 153 - 165
  • [5] Ultrafast clustering algorithms for metagenomic sequence analysis
    Li, Weizhong
    Fu, Limin
    Niu, Beifang
    Wu, Sitao
    Wooley, John
    BRIEFINGS IN BIOINFORMATICS, 2012, 13 (06) : 656 - 668
  • [6] Energy signature-based clustering using open data for urban building energy analysis toward carbon neutrality: A case study on electricity change under COVID-19
    Choi, Sebin
    Yoon, Sungmin
    SUSTAINABLE CITIES AND SOCIETY, 2023, 92
  • [7] Clustering-based visualizations for diagnosing diseases on metagenomic data
    Nguyen, Hai Thanh
    Phan, Trang Huyen
    Pham, Linh Thuy Thi
    Pham, Ngoc Huynh
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (8-9) : 5685 - 5699
  • [8] Massive metagenomic data analysis using abundance-based machine learning
    Harris, Zachary N.
    Dhungel, Eliza
    Mosior, Matthew
    Ahn, Tae-Hyuk
    BIOLOGY DIRECT, 2019, 14 (1)
  • [9] Massive metagenomic data analysis using abundance-based machine learning
    Zachary N. Harris
    Eliza Dhungel
    Matthew Mosior
    Tae-Hyuk Ahn
    Biology Direct, 14
  • [10] Binning Metagenomic Contigs Using Unsupervised Clustering and Reference Databases
    Jiang, Zhongjun
    Li, Xiaobo
    Guo, Lijun
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2022, 14 (04) : 795 - 803