Comprehensive comparison of large-scale tissue expression datasets

被引:63
|
作者
Santos, Alberto [1 ]
Tsafou, Kalliopi [1 ]
Stolte, Christian [2 ]
Pletscher-Frankild, Sune [1 ]
O'Donoghue, Sean I. [2 ,3 ]
Jensen, Lars Juhl [1 ]
机构
[1] Univ Copenhagen, Fac Hlth & Med Sci, Novo Nordisk Fdn Ctr Prot Res, Copenhagen, Denmark
[2] CSIRO, Sydney, NSW, Australia
[3] Garvan Inst Med Res, Sydney, NSW, Australia
来源
PEERJ | 2015年 / 3卷
基金
美国国家卫生研究院;
关键词
Immunohistochemistry; RNA sequencing; Tissue expression; Mass spectrometry; Microarrays; Databases; Tissue-specificity; GENE-EXPRESSION; MASS-SPECTROMETRY; HOUSEKEEPING GENES; RNA-SEQ; ATLAS; SPECIFICITY; MICROARRAY; DATABASE; DRAFT;
D O I
10.7717/peerj.1054
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
For tissues to carry out their functions, they rely on the right proteins to be present. Several high-throughput technologies have been used to map out which proteins are expressed in which tissues; however, the data have not previously been systematically compared and integrated. We present a comprehensive evaluation of tissue expression data from a variety of experimental techniques and show that these agree surprisingly well with each other and with results from literature curation and text mining. We further found that most datasets support the assumed but not demonstrated distinction between tissue-specific and ubiquitous expression. By developing comparable confidence scores for all types of evidence, we show that it is possible to improve both quality and coverage by combining the datasets. To facilitate use and visualization of our work, we have developed the TISSUES resource (http://tissues.jensenlab.org), which makes all the scored and integrated data available through a single user-friendly web interface.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Visualization of large-scale trajectory datasets
    Zachar, Gergely
    2023 CYBER-PHYSICAL SYSTEMS AND INTERNET-OF-THINGS WEEK, CPS-IOT WEEK WORKSHOPS, 2023, : 152 - 157
  • [2] Learning to Index in Large-Scale Datasets
    Prayoonwong, Amorntip
    Wang, Cheng-Hsien
    Chiu, Chih-Yi
    MULTIMEDIA MODELING, MMM 2018, PT I, 2018, 10704 : 305 - 316
  • [3] Leveraging explainable AI and large-scale datasets for comprehensive classification of renal histologic types
    Moon, Seung Wan
    Kim, Jisup
    Kim, Young Jae
    Kim, Sung Hyun
    An, Chi Sung
    Kim, Kwang Gi
    Jung, Chan Kwon
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [4] Curating a Large-Scale Regulatory Network by Evaluating Its Consistency with Expression Datasets
    Guziolowski, Carito
    Gruel, Jeremy
    Radulescu, Ovidiu
    Siegel, Anne
    COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, 2009, 5488 : 144 - +
  • [5] MedDialog: Large-scale Medical Dialogue Datasets
    Zeng, Guangtao
    Yang, Wenmian
    Ju, Zeqian
    Yang, Yue
    Wang, Sicheng
    Zhang, Ruisi
    Zhou, Meng
    Zeng, Jiaqi
    Dong, Xiangyu
    Zhang, Ruoyu
    Fang, Hongchao
    Zhu, Penghui
    Chen, Shu
    Xie, Pengtao
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 9241 - 9250
  • [6] Towards algorithmic analytics for large-scale datasets
    Bzdok, Danilo
    Nichols, Thomas E.
    Smith, Stephen M.
    NATURE MACHINE INTELLIGENCE, 2019, 1 (07) : 296 - 306
  • [7] RANSAC-SVM for Large-Scale Datasets
    Nishida, Kenji
    Kurita, Takio
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 3767 - 3770
  • [8] Map Matching Algorithm for Large-scale Datasets
    Fiedler, David
    Cap, Michal
    Nykl, Jan
    Zilecky, Pavol
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3, 2022, : 500 - 508
  • [9] Momentum Online LDA for Large-scale Datasets
    Ouyang, Jihong
    Lu, You
    Li, Ximing
    21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014), 2014, 263 : 1075 - 1076
  • [10] Large-Scale Datasets in Special Education Research
    Griffin, Megan M.
    Steinbrecher, Trisha D.
    USING SECONDARY DATASETS TO UNDERSTAND PERSONS WITH DEVELOPMENTAL DISABILITIES AND THEIR FAMILIES, 2013, 45 : 155 - 183