Geographically weighted linear combination test for gene-set analysis of a continuous spatial phenotype as applied to intratumor heterogeneity

被引:1
|
作者
Amini, Payam [1 ,2 ]
Hajihosseini, Morteza [3 ,4 ]
Pyne, Saumyadipta [5 ,6 ]
Dinu, Irina [3 ]
机构
[1] Iran Univ Med Sci, Sch Publ Hlth, Dept Biostat, Tehran, Iran
[2] Keele Univ, Sch Med, Keele, Staffs, England
[3] Univ Alberta, Sch Publ Hlth, Edmonton, AB, Canada
[4] Ctr Acad Med, Stanford Dept Urol, Palo Alto, CA USA
[5] Hlth Analyt Network, Pittsburgh, PA 15237 USA
[6] Univ Calif Santa Barbara, Santa Barbara, CA 93106 USA
来源
FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY | 2023年 / 11卷
关键词
intratumor heterogeneity; gene-set analysis; geographically weighted regression; linear combination test; Spatial single cell analysis; cancer-associated fibroblast; CARCINOMA-ASSOCIATED FIBROBLASTS; CANCER-ASSOCIATED FIBROBLASTS; STROMAL CELLS; GLOBAL TEST; EXPRESSION; EVOLUTION; HALLMARKS; INFORMATION; REGRESSION; SIGNATURES;
D O I
10.3389/fcell.2023.1065586
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Background: The impact of gene-sets on a spatial phenotype is not necessarily uniform across different locations of cancer tissue. This study introduces a computational platform, GWLCT, for combining gene set analysis with spatial data modeling to provide a new statistical test for location-specific association of phenotypes and molecular pathways in spatial single-cell RNA-seq data collected from an input tumor sample.Methods: The main advantage of GWLCT consists of an analysis beyond global significance, allowing the association between the gene-set and the phenotype to vary across the tumor space. At each location, the most significant linear combination is found using a geographically weighted shrunken covariance matrix and kernel function. Whether a fixed or adaptive bandwidth is determined based on a cross-validation cross procedure. Our proposed method is compared to the global version of linear combination test (LCT), bulk and random-forest based gene-set enrichment analyses using data created by the Visium Spatial Gene Expression technique on an invasive breast cancer tissue sample, as well as 144 different simulation scenarios.Results: In an illustrative example, the new geographically weighted linear combination test, GWLCT, identifies the cancer hallmark gene-sets that are significantly associated at each location with the five spatially continuous phenotypic contexts in the tumors defined by different well-known markers of cancer-associated fibroblasts. Scan statistics revealed clustering in the number of significant gene-sets. A spatial heatmap of combined significance over all selected gene-sets is also produced. Extensive simulation studies demonstrate that our proposed approach outperforms other methods in the considered scenarios, especially when the spatial association increases.Conclusion: Our proposed approach considers the spatial covariance of gene expression to detect the most significant gene-sets affecting a continuous phenotype. It reveals spatially detailed information in tissue space and can thus play a key role in understanding the contextual heterogeneity of cancer cells.
引用
收藏
页数:15
相关论文
共 2 条
  • [1] Linear combination test for gene set analysis of a continuous phenotype
    Dinu, Irina
    Wang, Xiaoming
    Kelemen, Linda E.
    Vatanpour, Shabnam
    Pyne, Saumyadipta
    BMC BIOINFORMATICS, 2013, 14
  • [2] Longitudinal linear combination test for gene set analysis
    Moez, Elham Khodayari
    Hajihosseini, Morteza
    Andrews, Jeffrey L.
    Dinu, Irina
    BMC BIOINFORMATICS, 2019, 20 (01)