Leveraging Clustering Techniques to Facilitate Metagenomic Analysis

被引:1
|
作者
Ennis, Damien [1 ]
Dascalu, Sergiu [1 ]
Harris, Frederick C., Jr. [1 ]
机构
[1] Univ Nevada, Dept Comp Sci & Engn, Reno, NV 89557 USA
基金
美国国家科学基金会;
关键词
Metagenomics; Clustering; K-means; Machine learning; Self-organizing map; SEARCH;
D O I
10.1080/10798587.2015.1073887
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning clustering algorithms provide excellent methods for conducting metagenomic analysis with efficiency. This study uses two machine learning algorithms, the self-organizing map and the K-means algorithms, to cluster data from an environmental sample collected from a hot springs habitat and to provide a visual analysis of that data. A data processing pipeline is described that uses the clustering algorithms to identify which reference genomes should be included for further analysis in determining possible organisms that are present in a metagenomic sample. The clustering revealed probable candidates for additional analysis, including a thermophilic, anaerobic bacterium, which is likely to be found in a hot springs environment and serves to validate the functionality of these tools. The machine learning techniques discussed here can serve as a launching point for elucidating protein sequences that could serve as possible reference comparisons to a specific metagenomic sample and lead to further study.
引用
收藏
页码:153 / 165
页数:13
相关论文
共 50 条
  • [31] MBBC: an efficient approach for metagenomic binning based on clustering
    Wang, Ying
    Hu, Haiyan
    Li, Xiaoman
    BMC BIOINFORMATICS, 2015, 16
  • [32] Generalized Lattice Based Probabilistic Approach for Metagenomic Clustering
    Jha, Manjari
    Malhotra, Raunaq
    Acharya, Raj
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (04) : 749 - 761
  • [33] MBBC: an efficient approach for metagenomic binning based on clustering
    Ying Wang
    Haiyan Hu
    Xiaoman Li
    BMC Bioinformatics, 16
  • [34] An Integrative Approach for the Functional Analysis of Metagenomic Studies
    Wassan, Jyotsna Talreja
    Wang, Haiying
    Browne, Fiona
    Wash, Paul
    Kelly, Brain
    Palu, Cintia
    Konstantinidou, Nina
    Roehe, Rainer
    Dewhurst, Richard
    Zheng, Huiru
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT II, 2017, 10362 : 421 - 427
  • [35] Clustering Web services to facilitate service discovery
    Jian Wu
    Liang Chen
    Zibin Zheng
    Michael R. Lyu
    Zhaohui Wu
    Knowledge and Information Systems, 2014, 38 : 207 - 229
  • [36] Clustering Web services to facilitate service discovery
    Wu, Jian
    Chen, Liang
    Zheng, Zibin
    Lyu, Michael R.
    Wu, Zhaohui
    KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 38 (01) : 207 - 229
  • [37] COMPARATIVE PERFORMANCE ANALYSIS OF CLUSTERING TECHNIQUES IN EDUCATIONAL DATA MINING
    DeFreitas, Kyle
    Bernard, Margaret
    IADIS-INTERNATIONAL JOURNAL ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2015, 10 (02): : 65 - 78
  • [38] Techniques for analysis of disease clustering in space and in time in veterinary epidemiology
    Ward, MP
    Carpenter, TE
    PREVENTIVE VETERINARY MEDICINE, 2000, 45 (3-4) : 257 - 284
  • [39] Analysis of Valuable Clustering Techniques for Deep Web Access and Navigation
    Qurat-ul-ain
    Sajid, Asma
    Jamil, Uzma
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (01) : 204 - 211
  • [40] Frequent conceptual links and link-based clustering: A comparative analysis of two clustering techniques
    Stattner, Erick
    Collard, Martine
    2013 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2013, : 140 - 147