Leveraging gene correlations in single cell transcriptomic data

被引:0
作者
Silkwood, Kai [1 ,2 ]
Dollinger, Emmanuel [1 ,2 ,3 ]
Gervin, Joshua [1 ,2 ]
Atwood, Scott [1 ,2 ]
Nie, Qing [1 ,2 ,3 ]
Lander, Arthur D. [1 ,2 ]
机构
[1] Univ Calif Irvine, Ctr Complex Biol Syst, Irvine, CA 92697 USA
[2] Univ Calif Irvine, Dept Dev & Cell Biol, Irvine, CA 92697 USA
[3] Univ Calif Irvine, Dept Math, Irvine, CA USA
来源
BMC BIOINFORMATICS | 2024年 / 25卷 / 01期
基金
美国国家卫生研究院;
关键词
Single cell RNA sequencing; Gene-gene correlation; Gene regulatory network; Gene co-expression network; Melanoma; EXPRESSION ANALYSES; NOISE; SIGNATURES; STATES;
D O I
10.1186/s12859-024-05926-z
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Many approaches have been developed to overcome technical noise in single cell RNA-sequencing (scRNAseq). As researchers dig deeper into data-looking for rare cell types, subtleties of cell states, and details of gene regulatory networks-there is a growing need for algorithms with controllable accuracy and fewer ad hoc parameters and thresholds. Impeding this goal is the fact that an appropriate null distribution for scRNAseq cannot simply be extracted from data in which ground truth about biological variation is unknown (i.e., usually). Results We approach this problem analytically, assuming that scRNAseq data reflect only cell heterogeneity (what we seek to characterize), transcriptional noise (temporal fluctuations randomly distributed across cells), and sampling error (i.e., Poisson noise). We analyze scRNAseq data without normalization-a step that skews distributions, particularly for sparse data-and calculate p values associated with key statistics. We develop an improved method for selecting features for cell clustering and identifying gene-gene correlations, both positive and negative. Using simulated data, we show that this method, which we call BigSur (Basic Informatics and Gene Statistics from Unnormalized Reads), captures even weak yet significant correlation structures in scRNAseq data. Applying BigSur to data from a clonal human melanoma cell line, we identify thousands of correlations that, when clustered without supervision into gene communities, align with known cellular components and biological processes, and highlight potentially novel cell biological relationships. Conclusions New insights into functionally relevant gene regulatory networks can be obtained using a statistically grounded approach to the identification of gene-gene correlations.
引用
收藏
页数:43
相关论文
共 50 条
  • [31] Immune Cell-Related Genes in Juvenile Idiopathic Arthritis Identified Using Transcriptomic and Single-Cell Sequencing Data
    Zhang, Wenbo
    Cai, Zhe
    Liang, Dandan
    Han, Jiaochan
    Wu, Ping
    Shan, Jiayi
    Meng, Guangxun
    Zeng, Huasong
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (13)
  • [32] Single cell transcriptomic analysis reveals tumor immune infiltration by NK cells gene signature in lung adenocarcinoma
    Zhu, Yimin
    Wu, Xiuhua
    Zhang, Yunjiao
    Gu, Jie
    Zhou, Rongwei
    Guo, Zhong
    HELIYON, 2024, 10 (13)
  • [33] Robust discovery of gene regulatory networks from single-cell gene expression data by Causal Inference Using Composition of Transactions
    Shojaee, Abbas
    Huang, Shao-shan Carol
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (06)
  • [34] scDA: Single cell discriminant analysis for single-cell RNA sequencing data
    Shi, Qianqian
    Li, Xinxing
    Peng, Qirui
    Zhang, Chuanchao
    Chen, Luonan
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 3234 - 3244
  • [35] Joint learning of multiple gene networks from single-cell gene expression data
    Wu, Nuosi
    Yin, Fu
    Ou-Yang, Le
    Zhu, Zexuan
    Xie, Weixin
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 : 2583 - 2595
  • [36] Leveraging single-cell sequencing to classify and characterize tumor subgroups in bulk RNA-sequencing data
    Shetty, Arya
    Wang, Su
    Khan, A. Basit
    English, Collin W.
    Nouri, Shervin Hosseingholi
    Magill, Stephen T.
    Raleigh, David R.
    Klisch, Tiemo J.
    Harmanci, Arif O.
    Patel, Akash J.
    Harmanci, Akdes Serin
    JOURNAL OF NEURO-ONCOLOGY, 2024, 168 (03) : 515 - 524
  • [37] Identification of key biomarkers related to fibrocartilage chondrocytes for osteoarthritis based on bulk, single-cell transcriptomic data
    Pan, Bailin
    Yao, Peixiu
    Ma, Jinjin
    Lin, Xuanhao
    Zhou, Laixi
    Lin, Canzhen
    Zhang, Yufeng
    Lin, Bendan
    Lin, Chuangxin
    FRONTIERS IN IMMUNOLOGY, 2024, 15
  • [38] IDEIS: a tool to identify PTPRC/CD45 isoforms from single-cell transcriptomic data
    Michalik, Juraj
    Niederlova, Veronika
    Stepanek, Ondrej
    FRONTIERS IN IMMUNOLOGY, 2024, 15
  • [39] Studying the mechanism underlying lipid metabolism in osteosarcoma based on transcriptomic RNA sequencing and single-cell data
    Li, Zhe
    Jin, Chi
    Lu, Xinchang
    Zhang, Yan
    Zhang, Yi
    Wen, Jia
    Liu, Yongkui
    Liu, Xiaoting
    Li, Jiazhen
    JOURNAL OF GENE MEDICINE, 2023, 25 (06)
  • [40] Data Integration of Hybrid Microarray and Single Cell Expression Data to Enhance Gene Network Inference
    Zhang, Wei
    Li, Wenchao
    Zhang, Jianming
    Wang, Ning
    CURRENT BIOINFORMATICS, 2019, 14 (03) : 255 - 268