AutoClassWeb: a simple web interface for Bayesian clustering of omics data

被引:0
作者
Poulain, Pierre [1 ]
Camadro, Jean-Michel [1 ]
机构
[1] Univ Paris Cite, CNRS, Inst Jacques Monod, F-75013 Paris, France
关键词
Clustering; Genomics; Proteomics; Bayesian; Autoclass; Machine learning;
D O I
10.1186/s13104-022-06129-6
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Objective: Data clustering is a common exploration step in the omics era, notably in genomics and proteomics where many genes or proteins can be quantified from one or more experiments. Bayesian clustering is a powerful unsupervised algorithm that can classify several thousands of genes or proteins. AutoClass C, its original implementation, handles missing data, automatically determines the best number of clusters but is not user-friendly. Results: We developed an online tool called AutoClassWeb, which provides an easy-to-use and simple web interface for Bayesian clustering with AutoClass. Input data are entered as TSV files and quality controlled. Results are provided in formats that ease further analyses with spreadsheet programs or with programming languages, such as Python or R. AutoClassWeb is implemented in Python and is published under the 3-Clauses BSD license. The source code is available at https://github.com/pierrepo/autoclassweb along with a detailed documentation.
引用
收藏
页数:4
相关论文
共 14 条
[1]   AutoClass@IJM: a powerful tool for Bayesian classification of heterogeneous data in biology [J].
Achcar, Fiona ;
Camadro, Jean-Michel ;
Mestivier, Denis .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W63-W67
[2]  
[Anonymous], 1996, ADV KNOWLEDGE DISCOV
[3]  
Camadro J., 2019, J OPEN SOURCE SOFTW, V4, P1390, DOI DOI 10.21105/JOSS.01390
[4]   Identifying the structure in cuttlefish visual signals [J].
Crook, AC ;
Baddeley, R ;
Osorio, D .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES B-BIOLOGICAL SCIENCES, 2002, 357 (1427) :1617-1624
[5]   Serotonin Differentially Regulates L5 Pyramidal Cell Classes of the Medial Prefrontal Cortex in Rats and Mice [J].
Elliott, Mary C. ;
Tanaka, Peter M. ;
Schwark, Ryan W. ;
Andrade, Rodrigo .
ENEURO, 2018, 5 (01)
[6]  
Franco M, 2019, METHODS MOL BIOL, V1986, P153, DOI 10.1007/978-1-4939-9442-7_7
[7]   The Metacaspase (Mca1p) has a Dual Role in Farnesol-induced Apoptosis in Candida albicans [J].
Leger, Thibaut ;
Garcia, Camille ;
Ounissi, Marwa ;
Lelandais, Gaelle ;
Camadro, Jean-Michel .
MOLECULAR & CELLULAR PROTEOMICS, 2015, 14 (01) :93-108
[8]   BioContainers: an open-source and community-driven framework for software standardization [J].
Leprevost, Felipe da Veiga ;
Gruening, Bjoern A. ;
Aflitos, Saulo Alves ;
Rost, Hannes L. ;
Uszkoreit, Julian ;
Barsnes, Harald ;
Vaudel, Marc ;
Moreno, Pablo ;
Gatto, Laurent ;
Weber, Jonas ;
Bai, Mingze ;
Jimenez, Rafael C. ;
Sachsenberg, Timo ;
Pfeuffer, Julianus ;
Alvarez, Roberto Vera ;
Griss, Johannes ;
Nesvizhskii, Alexey I. ;
Perez-Riverol, Yasset .
BIOINFORMATICS, 2017, 33 (16) :2580-2582
[9]   P-AutoClass: Scalable parallel clustering for mining large data sets [J].
Pizzuti, C ;
Talia, D .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (03) :629-641
[10]  
Rossum Van G., 1995, Python Tutorial, V620