Stemformatics: visualize and download curated stem cell data

被引:25
作者
Choi, Jarny [1 ]
Pacheco, Chris M. [1 ]
Mosbergen, Rowland [1 ]
Korn, Othmar [2 ]
Chen, Tyrone [1 ]
Nagpal, Isha [1 ]
Englart, Steve [1 ]
Angel, Paul W. [1 ]
Wells, Christine A. [1 ,3 ]
机构
[1] Univ Melbourne, Ctr Stem Cell Syst Anat & Neurosci, Melbourne, Vic 3010, Australia
[2] Univ Queensland, Australian Inst Bioengn & Nanotechnol, Brisbane, Qld, Australia
[3] Walter & Eliza Hall Inst Med Res, Parkville, Vic 3052, Australia
基金
澳大利亚研究理事会;
关键词
GENE-EXPRESSION; ROUTES;
D O I
10.1093/nar/gky1064
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Stemformatics is an established gene expression data portal containing over 420 public gene expression datasets derived from microarray, RNA sequencing and single cell profiling technologies. Developed for the stem cell community, it has a major focus on pluripotency, tissue stem cells, and staged differentiation. Stemformatics includes curated collections' of data relevant to cell reprogramming, as well as hematopoiesis and leukaemia. Rather than simply rehosting datasets as they appear in public repositories, Stemformatics uses a stringent set of quality control metrics and its own pipelines to process handpicked datasets from raw files. This means that about 30% of datasets processed by Stemformatics fail the quality control metrics and never make it to the portal, ensuring that Stemformatics data are of high quality and have been processed in a consistent manner. Stemformatics provides easy-to-use and intuitive tools for biologists to visually explore the data, including interactive gene expression profiles, principal component analysis plots and hierarchical clusters, among others. The addition of tools that facilitate cross-dataset comparisons provides users with snapshots of gene expression in multiple cell and tissues, assisting the identification of cell-type restricted genes, or potential housekeeping genes. Stemformatics is freely available at stemformatics.org.
引用
收藏
页码:D841 / D846
页数:6
相关论文
共 15 条
[1]   YuGene: A simple approach to scale gene expression data derived from different platforms for integrated analyses [J].
Cao, Kim-Anh Le ;
Rohart, Florian ;
McHugh, Leo ;
Korn, Othmar ;
Wells, Christine A. .
GENOMICS, 2014, 103 (04) :239-251
[2]   Small RNA changes en route to distinct cellular states of induced pluripotency [J].
Clancy, Jennifer L. ;
Patel, Hardip R. ;
Hussein, Samer M. I. ;
Tonge, Peter D. ;
Cloonan, Nicole ;
Corso, Andrew J. ;
Li, Mira ;
Lee, Dong-Sung ;
Shin, Jong-Yeon ;
Wong, Justin J. L. ;
Bailey, Charles G. ;
Benevento, Marco ;
Munoz, Javier ;
Chuah, Aaron ;
Wood, David ;
Rasko, John E. J. ;
Heck, Albert J. R. ;
Grimmond, Sean M. ;
Rogers, Ian M. ;
Seo, Jeong-Sun ;
Wells, Christine A. ;
Puri, Mira C. ;
Nagy, Andras ;
Preiss, Thomas .
NATURE COMMUNICATIONS, 2014, 5
[3]   Gene Expression Omnibus: NCBI gene expression and hybridization array data repository [J].
Edgar, R ;
Domrachev, M ;
Lash, AE .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :207-210
[4]   Genome-wide characterization of the routes to pluripotency [J].
Hussein, Samer M. I. ;
Puri, Mira C. ;
Tonge, Peter D. ;
Benevento, Marco ;
Corso, Andrew J. ;
Clancy, Jennifer L. ;
Mosbergen, Rowland ;
Li, Mira ;
Lee, Dong-Sung ;
Cloonan, Nicole ;
Wood, David L. A. ;
Munoz, Javier ;
Middleton, Robert ;
Korn, Othmar ;
Patel, Hardip R. ;
White, Carl A. ;
Shin, Jong-Yeon ;
Gauthier, Maely E. ;
Le Cao, Kim-Anh ;
Kim, Jong-Il ;
Mar, Jessica C. ;
Shakiba, Nika ;
Ritchie, William ;
Rasko, John E. J. ;
Grimmond, Sean M. ;
Zandstra, Peter W. ;
Wells, Christine A. ;
Preiss, Thomas ;
Seo, Jeong-Sun ;
Heck, Albert J. R. ;
Rogers, Ian M. ;
Nagy, Andras .
NATURE, 2014, 516 (7530) :198-+
[5]   ArrayExpress update-simplifying data submissions [J].
Kolesnikov, Nikolay ;
Hastings, Emma ;
Keays, Maria ;
Melnichuk, Olga ;
Tang, Y. Amy ;
Williams, Eleanor ;
Dylag, Miroslaw ;
Kurbatova, Natalja ;
Brandizi, Marco ;
Burdett, Tony ;
Megy, Karyn ;
Pilicheva, Ekaterina ;
Rustici, Gabriella ;
Tikhonov, Andrew ;
Parkinson, Helen ;
Petryszak, Robert ;
Sarkans, Ugis ;
Brazma, Alvis .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D1113-D1116
[6]   Tackling the widespread and critical impact of batch effects in high-throughput data [J].
Leek, Jeffrey T. ;
Scharpf, Robert B. ;
Bravo, Hector Corrada ;
Simcha, David ;
Langmead, Benjamin ;
Johnson, W. Evan ;
Geman, Donald ;
Baggerly, Keith ;
Irizarry, Rafael A. .
NATURE REVIEWS GENETICS, 2010, 11 (10) :733-739
[7]  
Mah N., 2017, GENOMICS COMPUT BIOL, V3, pe48
[8]   Expression Atlas: gene and protein expression across multiple studies and organisms [J].
Papatheodorou, Irene ;
Fonseca, Nuno A. ;
Keays, Maria ;
Tang, Y. Amy ;
Barrera, Elisabet ;
Bazant, Wojciech ;
Burke, Melissa ;
Fullgrabe, Anja ;
Fuentes, Alfonso Munoz-Pomer ;
George, Nancy ;
Huerta, Laura ;
Koskinen, Satu ;
Mohammed, Suhaib ;
Geniza, Matthew ;
Preece, Justin ;
Jaiswal, Pankaj ;
Jarnuczak, Andrew F. ;
Huber, Wolfgang ;
Stegle, Oliver ;
Vizcaino, Juan Antonio ;
Brazma, Alvis ;
Petryszak, Robert .
NUCLEIC ACIDS RESEARCH, 2018, 46 (D1) :D246-D251
[9]   StemMapper: a curated gene expression database for stem cell lineage analysis [J].
Pinto, Jose P. ;
Machado, Rui S. R. ;
Magno, Ramiro ;
Oliveira, Daniel V. ;
Machado, Susana ;
Andrade, Raquel P. ;
Braganca, Jose ;
Duarte, Isabel ;
Futschik, Matthias E. .
NUCLEIC ACIDS RESEARCH, 2018, 46 (D1) :D788-D793
[10]   A molecular classification of human mesenchymal stromal cells [J].
Rohart, Florian ;
Mason, Elizabeth A. ;
Matigian, Nicholas ;
Mosbergen, Rowland ;
Korn, Othmar ;
Chen, Tyrone ;
Butcher, Suzanne ;
Patel, Jatin ;
Atkinson, Kerry ;
Khosrotehrani, Kiarash ;
Fisk, Nicholas M. ;
Le Cao, Kim-Anh ;
Wells, Christine A. .
PEERJ, 2016, 4