DeClUt: Decluttering differentially expressed genes through clustering of their expression profiles

被引:1
|
作者
Zanfardino, Mario [1 ]
Franzese, Monica [1 ]
Geraci, Filippo [2 ]
机构
[1] IRCCS Synlab SDN, Via E Gianturco 113, I-80143 Naples, Italy
[2] CNR, Inst Informat & Telemat, Via G Moruzzi 1, I-56124 Pisa, Italy
关键词
Differentially expressed genes; RNA-seq; Clustering; k-center; Biomarkers; PACKAGE; CANCER;
D O I
10.1016/j.cmpb.2024.108258
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and Objective: differential expression analysis is one of the most popular activities in transcriptomic studies based on next-generation sequencing technologies. In fact, differentially expressed genes (DEGs) between two conditions represent ideal prognostic and diagnostic candidate biomarkers for many pathologies. As a result, several algorithms, such as DESeq2 and edgeR, have been developed to identify DEGs. Despite their widespread use, there is no consensus on which model performs best for different types of data, and many existing methods suffer from high False Discovery Rates (FDR). Methods: we present a new algorithm, DeClUt, based on the intuition that the expression profile of differentially expressed genes should form two reasonably compact and well-separated clusters. This, in turn, implies that the bipartition induced by the two conditions being compared should overlap with the clustering. The clustering algorithm underlying DeClUt was designed to be robust to outliers typical of RNA-seq data. In particular, we used the average silhouette function to enforce membership assignment of samples to the most appropriate condition. Results: DeClUt was tested on real RNA-seq datasets and benchmarked against four of the most widely used methods (edgeR, DESeq2, NOISeq, and SAMseq). Experiments showed a higher self-consistency of results than the competitors as well as a significantly lower False Positive Rate (FPR). Moreover, tested on a real prostate cancer RNA-seq dataset, DeClUt has highlighted 8 DE genes, linked to neoplastic process according to DisGeNET database, that none of the other methods had identified. Conclusions: our work presents a novel algorithm that builds upon basic concepts of data clustering and exhibits greater consistency and significantly lower False Positive Rate than state-of-the-art methods. Additionally, DeClUt is able to highlight relevant differentially expressed genes not otherwise identified by other tools contributing to improve efficacy of differential expression analyses in various biological applications.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Genes differentially expressed in thyroid carcinoma identified by comparison of SAGE expression profiles
    Pauws, E
    Veenboer, GJM
    Smit, JWA
    de Vijlder, JJM
    Morreau, H
    Ris-Stalpers, C
    FASEB JOURNAL, 2004, 18 (01): : 560 - +
  • [2] Expression profiles of differentially expressed genes affecting fecundity in goat ovarian tissues
    Ling, Y. H.
    Quan, Q.
    Xiang, H.
    Zhu, L.
    Chu, M. X.
    Zhang, X. R.
    Han, C. Y.
    GENETICS AND MOLECULAR RESEARCH, 2015, 14 (04) : 18743 - 18752
  • [3] Integrated analysis of differentially expressed genes and miRNA expression profiles in dilated cardiomyopathy
    Chen, Yu
    Cai, Wen-Ke
    Yu, Jie
    Shen, Ming
    Zhou, Jin-Huan
    Yang, Sheng-Yu
    Liu, Wei
    Lu, Si
    Shi, Yan-Kun
    Yang, Li-Xia
    HELIYON, 2024, 10 (04)
  • [4] Prioritization of differentially expressed genes through integrating public expression data
    Xu, W.
    Li, S.
    Zhang, Z.
    Hu, J.
    Zhao, Y.
    ANIMAL GENETICS, 2019, 50 (06) : 726 - 732
  • [5] Gene Expression Profiles of Peripheral Blood Monocytes in Osteoarthritis and Analysis of Differentially Expressed Genes
    Shi, Ting
    Shen, Xiongjie
    Gao, Ge
    BIOMED RESEARCH INTERNATIONAL, 2019, 2019
  • [6] Clustering of Differentially Expressed Genes from Transcriptome of Vitis flexuosa
    Ahn, Soon Young
    Kim, Seon Ae
    Kim, Seung Heui
    Yun, Hae Keun
    HORTSCIENCE, 2013, 48 (09) : S261 - S261
  • [7] Molecular diagnosis of colorectal tumors by expression profiles of 50 genes expressed differentially in adenomas and carcinomas
    Lin, YM
    Furukawa, Y
    Tsunoda, T
    Yue, CT
    Yang, KC
    Nakamura, Y
    ONCOGENE, 2002, 21 (26) : 4120 - 4128
  • [8] Molecular diagnosis of colorectal tumors by expression profiles of 50 genes expressed differentially in adenomas and carcinomas
    Yu-Min Lin
    Yoichi Furukawa
    Tatsuhiko Tsunoda
    Chung-Tai Yue
    Kou-Ching Yang
    Yusuke Nakamura
    Oncogene, 2002, 21 : 4120 - 4128
  • [9] Comparing the continuous representation of time-series expression profiles to identify differentially expressed genes
    Bar-Joseph, Z
    Gerber, G
    Simon, L
    Gifford, DK
    Jaakkola, TS
    Jaakkola, TS
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (18) : 10146 - 10151
  • [10] Hybridization and gene expression: Beyond differentially expressed genes
    Runemark, Anna
    Moore, Emily C.
    Larson, Erica L.
    MOLECULAR ECOLOGY, 2024,