Machine learning for integrating data in biology and medicine: Principles, practice, and opportunities

被引:327
作者
Zitnik, Marinka [1 ]
Nguyen, Francis [2 ,3 ]
Wang, Bo [4 ]
Leskovec, Jure [1 ,5 ]
Goldenberg, Anna [6 ,7 ,8 ]
Hoffman, Michael M. [2 ,3 ,7 ,8 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[2] Univ Toronto, Dept Med Biophys, Toronto, ON, Canada
[3] Princess Margaret Canc Ctr, Toronto, ON, Canada
[4] Hikvis Res Inst, Santa Clara, CA USA
[5] Chan Zuckerberg Biohub, San Francisco, CA 94158 USA
[6] SickKids Res Inst, Genet & Genome Biol, Toronto, ON, Canada
[7] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
[8] Vector Inst, Toronto, ON, Canada
基金
加拿大自然科学与工程研究理事会; 美国国家科学基金会;
关键词
Computational biology; Personalized medicine; Systems biology; Heterogeneous data; Machine learning; DRUG-DRUG INTERACTION; GENOME-WIDE ASSOCIATION; DNA METHYLATION; DATA FUSION; TRANSCRIPTION FACTORS; CHROMATIN-STATE; CHIP-SEQ; PROBABILISTIC FUNCTIONS; MULTICELLULAR FUNCTION; HETEROGENEOUS NETWORK;
D O I
10.1016/j.inffus.2018.09.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
New technologies have enabled the investigation of biology and human health at an unprecedented scale and in multiple dimensions. These dimensions include a myriad of properties describing genome, epigenome, transcriptome, microbiome, phenotype, and lifestyle. No single data type, however, can capture the complexity of all the factors relevant to understanding a phenomenon such as a disease. Integrative methods that combine data from multiple technologies have thus emerged as critical statistical and computational approaches. The key challenge in developing such approaches is the identification of effective models to provide a comprehensive and relevant systems view. An ideal method can answer a biological or medical question, identifying important features and predicting outcomes, by harnessing heterogeneous data across several dimensions of biological variation. In this Review, we describe the principles of data integration and discuss current methods and available implementations. We provide examples of successful data integration in biology and medicine. Finally, we discuss current challenges in biomedical integrative methods and our perspective on the future development of the field.
引用
收藏
页码:71 / 91
页数:21
相关论文
共 50 条
  • [21] Application of Machine Learning in Translational Medicine: Current Status and Future Opportunities
    Terranova, Nadia
    Venkatakrishnan, Karthik
    Benincosa, Lisa J.
    AAPS JOURNAL, 2021, 23 (04)
  • [22] Machine Learning SNP Based Prediction for Precision Medicine
    Ho, Daniel Sik Wai
    Schierding, William
    Wake, Melissa
    Saffery, Richard
    O'Sullivan, Justin
    FRONTIERS IN GENETICS, 2019, 10
  • [23] Integrating Physical Principles with Machine Learning for Predicting Field-Enhanced Catalysis
    Zhao, Runze
    Li, Qiang
    Yang, Jiaqi
    Zhu, Cheng
    Che, Fanglin
    JACS AU, 2025, 5 (03): : 1121 - 1132
  • [24] The basics of data, big data, and machine learning in clinical practice
    Soriano-Valdez, David
    Pelaez-Ballestas, Ingris
    Manrique de Lara, Amaranta
    Gastelum-Strozzi, Alfonso
    CLINICAL RHEUMATOLOGY, 2021, 40 (01) : 11 - 23
  • [25] The basics of data, big data, and machine learning in clinical practice
    David Soriano-Valdez
    Ingris Pelaez-Ballestas
    Amaranta Manrique de Lara
    Alfonso Gastelum-Strozzi
    Clinical Rheumatology, 2021, 40 : 11 - 23
  • [26] Translational Medicine in the Era of Big Data and Machine Learning
    Weintraub, William S.
    Fahed, Akl C.
    Rumsfeld, John S.
    CIRCULATION RESEARCH, 2018, 123 (11) : 1202 - 1204
  • [27] Big Data and Machine Learning in Healthcare: Concepts, Technologies, and Opportunities
    Hiri, Mustafa
    Chrayah, Mohamed
    Ourdani, Nabil
    El Alamir, Taha
    EMERGING TRENDS IN INTELLIGENT SYSTEMS & NETWORK SECURITY, 2023, 147 : 123 - 135
  • [28] Integrating servopress acoustic monitoring with Machine Learning for enhanced predictive maintenance: opportunities and limitations
    Aradi, Attila
    Varga, Attila Karoly
    Takacs, Peter
    2024 25TH INTERNATIONAL CARPATHIAN CONTROL CONFERENCE, ICCC 2024, 2024,
  • [29] Machine Learning for Precision Psychiatry: Opportunities and Challenges
    Bzdok, Danilo
    Meyer-Lindenberg, Andreas
    BIOLOGICAL PSYCHIATRY-COGNITIVE NEUROSCIENCE AND NEUROIMAGING, 2018, 3 (03) : 223 - 230
  • [30] Transforming Agriculture: A Synergistic Approach Integrating Topology with Artificial Intelligence and Machine Learning for Sustainable and Data-Driven Practice
    Sriram, K. P.
    Sujatha, P. Kola
    Athinarayanan, S.
    Kanimozhi, G.
    Joel, M. Robinson
    2ND INTERNATIONAL CONFERENCE ON SUSTAINABLE COMPUTING AND SMART SYSTEMS, ICSCSS 2024, 2024, : 1350 - 1354