Bayesian tensor factorization-drive breast cancer subtyping by integrating multi-omics data

被引:16
|
作者
Liu, Qian [1 ,2 ]
Cheng, Bowen [3 ]
Jin, Yongwon [1 ]
Hu, Pingzhao [1 ,2 ,3 ,4 ]
机构
[1] Univ Manitoba, Dept Biochem & Med Genet, Room 308,Basic Med Sci Bldg,745 Bannatyne Ave, Winnipeg, MB R3E 0J9, Canada
[2] Univ Manitoba, Dept Comp Sci, Winnipeg, MB, Canada
[3] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON, Canada
[4] CancerCare Manitoba Res Inst, Winnipeg, MB, Canada
关键词
Breast cancer subtyping; Multi-omics data; Bayesian tensor factorization; Consensus clustering; Survival analysis; GENE-EXPRESSION; CLASS DISCOVERY; PROGNOSIS; CLASSIFICATION; IDENTIFICATION; VALIDATION; BIOMARKERS; NETWORK; MODEL; RANK;
D O I
10.1016/j.jbi.2021.103958
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Breast cancer is a highly heterogeneous disease. Subtyping the disease and identifying the genomic features driving these subtypes are critical for precision oncology for breast cancer. This study focuses on developing a new computational approach for breast cancer subtyping. We proposed to use Bayesian tensor factorization (BTF) to integrate multi-omics data of breast cancer, which include expression profiles of RNA-sequencing, copy number variation, and DNA methylation measured on 762 breast cancer patients from The Cancer Genome Atlas. We applied a consensus clustering approach to identify breast cancer subtypes using the factorized latent features by BTF. Subtype-specific survival patterns of the breast cancer patients were evaluated using Kaplan-Meier (KM) estimators. The proposed approach was compared with other state-of-the-art approaches for cancer subtyping. The BTF-subtyping analysis identified 17 optimized latent components, which were used to reveal six major breast cancer subtypes. Out of all different approaches, only the proposed approach showed distinct survival patterns (p < 0.05). Statistical tests also showed that the identified clusters have statistically significant distributions. Our results showed that the proposed approach is a promising strategy to efficiently use publicly available multi-omics data to identify breast cancer subtypes.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Integrating Multi-Omics for Uncovering the Architecture of Cross-Talking Pathways in Breast Cancer
    Wang, Li
    Xiao, Yun
    Ping, Yanyan
    Li, Jing
    Zhao, Hongying
    Li, Feng
    Hu, Jing
    Zhang, Hongyi
    Deng, Yulan
    Tian, Jiawei
    Li, Xia
    PLOS ONE, 2014, 9 (08):
  • [32] Multi-omics data integration for hepatocellular carcinoma subtyping with multi-kernel learning
    Wang, Jiaying
    Miao, Yuting
    Li, Lingmei
    Wu, Yongqing
    Ren, Yan
    Cui, Yuehua
    Cao, Hongyan
    FRONTIERS IN GENETICS, 2022, 13
  • [33] Multi-view multi-level contrastive graph convolutional network for cancer subtyping on multi-omics data
    Yang, Bo
    Cui, Chenxi
    Wang, Meng
    Ji, Hong
    Gao, Feiyue
    BRIEFINGS IN BIOINFORMATICS, 2025, 26 (01)
  • [34] Topological integration of RPPA proteomic data with multi-omics data for survival prediction in breast cancer via pathway activity inference
    Kim, Tae Rim
    Jeong, Hyun-Hwan
    Sohn, Kyung-Ah
    BMC MEDICAL GENOMICS, 2019, 12 (Suppl 5)
  • [35] PIntMF: Penalized Integrative Matrix Factorization method for multi-omics data
    Pierre-Jean, Morgane
    Mauger, Florence
    Deleuze, Jean-Francois
    Le Floch, Edith
    BIOINFORMATICS, 2022, 38 (04) : 900 - 907
  • [36] HONMF: integration analysis of multi-omics microbiome data via matrix factorization and hypergraph
    Ma, Yuanyuan
    Liu, Lifang
    Ma, Yingjun
    Zhang, Song
    BIOINFORMATICS, 2023, 39 (06)
  • [37] A Unified Bayesian Framework for Bi-overlapping-Clustering Multi-omics Data via Sparse Matrix Factorization
    Zhou, Fangting
    He, Kejun
    Cai, James J.
    Davidson, Laurie A.
    Chapkin, Robert S.
    Ni, Yang
    STATISTICS IN BIOSCIENCES, 2023, 15 (03) : 669 - 691
  • [38] Integrating multi-omics data through deep learning for accurate cancer prognosis prediction
    Chai, Hua
    Zhou, Xiang
    Zhang, Zhongyue
    Rao, Jiahua
    Zhao, Huiying
    Yang, Yuedong
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 134
  • [39] NESM: a network embedding method for tumor stratification by integrating multi-omics data
    Li, Feng
    Sun, Zhensheng
    Liu, Jin-Xing
    Shang, Junliang
    Dai, Lingyun
    Liu, Xikui
    Li, Yan
    G3-GENES GENOMES GENETICS, 2022, 12 (11):
  • [40] Classifying Breast Cancer Subtypes Using Deep Neural Networks Based on Multi-Omics Data
    Lin, Yuqi
    Zhang, Wen
    Cao, Huanshen
    Li, Gaoyang
    Du, Wei
    GENES, 2020, 11 (08) : 1 - 18