Bayesian Hybrid Matrix Factorisation for Data Integration

被引:0
|
作者
Brouwer, Thomas [1 ]
Lio, Pietro [1 ]
机构
[1] Univ Cambridge, Cambridge, England
基金
英国工程与自然科学研究理事会;
关键词
CANCER; SENSITIVITY; DISCOVERY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a novel Bayesian hybrid matrix factorisation model (HMF) for data integration, based on combining multiple matrix factorisation methods, that can be used for in- and out-of-matrix prediction of missing values. The model is very general and can be used to integrate many datasets across different entity types, including repeated experiments, similarity matrices, and very sparse datasets. We apply our method on two biological applications, and extensively compare it to state-of-the-art machine learning and matrix factorisation models. For in-matrix predictions on drug sensitivity datasets we obtain consistently better performances than existing methods. This is especially the case when we increase the sparsity of the datasets. Furthermore, we perform out-of-matrix predictions on methylation and gene expression datasets, and obtain the best results on two of the three datasets, especially when the predictivity of datasets is high.
引用
收藏
页码:557 / 566
页数:10
相关论文
共 50 条
  • [1] Bayesian Boolean Matrix Factorisation
    Rukat, Tammo
    Holmes, Chris C.
    Titsias, Michalis K.
    Yau, Christopher
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [2] Comparative Study of Inference Methods for Bayesian Nonnegative Matrix Factorisation
    Brouwer, Thomas
    Frellsen, Jes
    Lio, Pietro
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT I, 2017, 10534 : 513 - 529
  • [3] Bayesian Joint Matrix Decomposition for Data Integration with Heterogeneous Noise
    Zhang, Chihao
    Zhang, Shihua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (04) : 1184 - 1196
  • [4] Matrix factorisation methods applied in microarray data analysis
    Kossenkov, Andrew V.
    Ochs, Michael F.
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2010, 4 (01) : 72 - 90
  • [5] Bayesian extensions to non-negative matrix factorisation for audio signal modelling
    Virtanen, Tuomas
    Cemgil, A. Taylan
    Godsill, Simon
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 1825 - 1828
  • [6] Multiresolution matrix factorisation as a compression method for smart meter data
    Ahmad, Arfah
    Datta, Amitava
    Sreeram, Victor
    Mishra, Yateendra
    JOURNAL OF ENGINEERING-JOE, 2020, 2020 (08): : 737 - 744
  • [7] Rank Matrix Factorisation
    Thanh Le Van
    van Leeuwen, Matthijs
    Nijssen, Siegfried
    De Raedt, Luc
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART I, 2015, 9077 : 734 - 746
  • [8] Performance Prediction via Bayesian Matrix Factorisation for Multilingual Natural Language Processing Tasks
    Schram, Viktoria
    Beck, Daniel
    Cohn, Trevor
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1790 - 1801
  • [9] Data Integration in Bayesian Phylogenetics
    Hassler, Gabriel W.
    Magee, Andrew F.
    Zhang, Zhenyu
    Baele, Guy
    Lemey, Philippe
    Ji, Xiang
    Fourment, Mathieu
    Suchard, Marc A.
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, 2023, 10 : 353 - 377
  • [10] A hybrid collaborative filtering recommendation algorithm: integrating content information and matrix factorisation
    Wang, Jing
    Sangaiah, Arun Kumar
    Liu, Wei
    INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2020, 11 (03) : 367 - 377