Bayesian tensor factorization-drive breast cancer subtyping by integrating multi-omics data

被引:16
|
作者
Liu, Qian [1 ,2 ]
Cheng, Bowen [3 ]
Jin, Yongwon [1 ]
Hu, Pingzhao [1 ,2 ,3 ,4 ]
机构
[1] Univ Manitoba, Dept Biochem & Med Genet, Room 308,Basic Med Sci Bldg,745 Bannatyne Ave, Winnipeg, MB R3E 0J9, Canada
[2] Univ Manitoba, Dept Comp Sci, Winnipeg, MB, Canada
[3] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON, Canada
[4] CancerCare Manitoba Res Inst, Winnipeg, MB, Canada
关键词
Breast cancer subtyping; Multi-omics data; Bayesian tensor factorization; Consensus clustering; Survival analysis; GENE-EXPRESSION; CLASS DISCOVERY; PROGNOSIS; CLASSIFICATION; IDENTIFICATION; VALIDATION; BIOMARKERS; NETWORK; MODEL; RANK;
D O I
10.1016/j.jbi.2021.103958
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Breast cancer is a highly heterogeneous disease. Subtyping the disease and identifying the genomic features driving these subtypes are critical for precision oncology for breast cancer. This study focuses on developing a new computational approach for breast cancer subtyping. We proposed to use Bayesian tensor factorization (BTF) to integrate multi-omics data of breast cancer, which include expression profiles of RNA-sequencing, copy number variation, and DNA methylation measured on 762 breast cancer patients from The Cancer Genome Atlas. We applied a consensus clustering approach to identify breast cancer subtypes using the factorized latent features by BTF. Subtype-specific survival patterns of the breast cancer patients were evaluated using Kaplan-Meier (KM) estimators. The proposed approach was compared with other state-of-the-art approaches for cancer subtyping. The BTF-subtyping analysis identified 17 optimized latent components, which were used to reveal six major breast cancer subtypes. Out of all different approaches, only the proposed approach showed distinct survival patterns (p < 0.05). Statistical tests also showed that the identified clusters have statistically significant distributions. Our results showed that the proposed approach is a promising strategy to efficiently use publicly available multi-omics data to identify breast cancer subtypes.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] A Cascade Deep Forest Model for Breast Cancer Subtype Classification Using Multi-Omics Data
    El-Nabawy, Ala'a
    Belal, Nahla A.
    El-Bendary, Nashwa
    MATHEMATICS, 2021, 9 (13)
  • [42] Comprehensive Analysis of Metabolic Genes in Breast Cancer Based on Multi-Omics Data
    Hua, Yu
    Gao, Lihong
    Li, Xiaobo
    PATHOLOGY & ONCOLOGY RESEARCH, 2021, 27
  • [43] Multi-Omics Analysis Detects Novel Prognostic Subgroups of Breast Cancer
    Quang-Huy Nguyen
    Hung Nguyen
    Tin Nguyen
    Duc-Hau Le
    FRONTIERS IN GENETICS, 2020, 11
  • [44] Identifying Cancer Driver lncRNAs Bridged by Functional Effectors through Integrating Multi-omics Data in Human Cancers
    Zhang, Yong
    Liao, Gaoming
    Bai, Jing
    Zhang, Xinxin
    Xu, Liwen
    Deng, Chunyu
    Yan, Min
    Xie, Aimin
    Luo, Tao
    Long, Zhilin
    Xiao, Yun
    Li, Xia
    MOLECULAR THERAPY-NUCLEIC ACIDS, 2019, 17 : 362 - 373
  • [45] Multi -view spectral clustering with latent representation learning for applications on multi-omics cancer subtyping
    Ge, Shuguang
    Liu, Jian
    Cheng, Yuhu
    Meng, Xiaojing
    Wang, Xuesong
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (01)
  • [46] A deep learning-based framework for predicting survival-associated groups in colon cancer by integrating multi-omics and clinical data
    Salimy, Siamak
    Lanjanian, Hossein
    Abbasi, Karim
    Salimi, Mahdieh
    Najafi, Ali
    Tapak, Leili
    Masoudi-Nejad, Ali
    HELIYON, 2023, 9 (07)
  • [47] Recursive integration of synergised graph representations of multi-omics data for cancer subtypes identification
    Madhumita
    Dwivedi, Archit
    Paul, Sushmita
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [48] An extension of latent unknown clustering integrating multi-omics data (LUCID) incorporating incomplete omics data
    Zhao, Yinqi
    Jia, Qiran
    Goodrich, Jesse
    Darst, Burcu
    Conti, David, V
    BIOINFORMATICS ADVANCES, 2024, 4 (01):
  • [49] RETRACTED: Lung Cancer Stage Prediction Using Multi-Omics Data (Retracted Article)
    Li, Wei
    Liu, Binchun
    Wang, Weiqian
    Sun, Can
    Che, Jianpeng
    Yuan, Xuelian
    Zhai, Chunbo
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022
  • [50] Integrating multi-omics summary data using a Mendelian randomization framework
    Jin, Chong
    Lee, Brian
    Shen, Li
    Long, Qi
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (06)