FL-QSAR: a federated learning-based QSAR prototype for collaborative drug discovery

被引:68
作者
Chen, Shaoqi [1 ,2 ]
Xue, Dongyu [1 ,2 ]
Chuai, Guohui [1 ,2 ]
Yang, Qiang [3 ,4 ]
Liu, Qi [1 ,2 ]
机构
[1] Tongji Univ, Translat Med Ctr Stem Cell Therapy, Shanghai 200092, Peoples R China
[2] Tongji Univ, Inst Regenerat Med, Shanghai East Hosp, Bioinformat Dept,Sch Life Sci & Technol, Shanghai 200092, Peoples R China
[3] WeBank, Dept AI, Shenzhen 518055, Peoples R China
[4] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Clear Water Bay, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1093/bioinformatics/btaa1006
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Quantitative structure-activity relationship (QSAR) analysis is commonly used in drug discovery. Collaborations among pharmaceutical institutions can lead to a better performance in QSAR prediction, however, intellectual property and related financial interests remain substantially hindering inter-institutional collaborations in QSAR modeling for drug discovery. Results: For the first time, we verified the feasibility of applying the horizontal federated learning (HFL), which is a recently developed collaborative and privacy-preserving learning framework to perform QSAR analysis. A prototype platform of federated-learning-based QSAR modeling for collaborative drug discovery, i.e. FL-QSAR, is presented accordingly. We first compared the HFL framework with a classic privacy-preserving computation framework, i.e. secure multiparty computation to indicate its difference from various perspective. Then we compared FL-QSAR with the public collaboration in terms of QSAR modeling. Our extensive experiments demonstrated that (i) collaboration by FL-QSAR outperforms a single client using only its private data, and (ii) collaboration by FL-QSAR achieves almost the same performance as that of collaboration via cleartext learning algorithms using all shared information. Taking together, our results indicate that FL-QSAR under the HFL framework provides an efficient solution to break the barriers between pharmaceutical institutions in QSAR modeling, therefore promote the development of collaborative and privacy-preserving drug discovery with extendable ability to other privacy-related biomedical areas.
引用
收藏
页码:5492 / 5498
页数:7
相关论文
共 25 条
[1]  
[Anonymous], 1982, S FDN COMP SCI
[2]  
Ben-Or M., 1988, P 20 ANN ACM S THEOR, P1, DOI [DOI 10.1145/62212.62213, 10.1145/62212.62213]
[3]  
Bogdanov D, 2008, LECT NOTES COMPUT SC, V5283, P192
[4]   Secure genome-wide association analysis using multiparty computation [J].
Cho, Hyunghoon ;
Wu, David J. ;
Berger, Bonnie .
NATURE BIOTECHNOLOGY, 2018, 36 (06) :547-+
[5]   MR fingerprinting Deep RecOnstruction NEtwork (DRONE) [J].
Cohen, Ouri ;
Zhu, Bo ;
Rosen, Matthew S. .
MAGNETIC RESONANCE IN MEDICINE, 2018, 80 (03) :885-894
[6]   Accurate and reproducible invasive breast cancer detection in whole-slide images: A Deep Learning approach for quantifying tumor extent [J].
Cruz-Roa, Angel ;
Gilmore, Hannah ;
Basavanhally, Ajay ;
Feldman, Michael ;
Ganesan, Shridar ;
Shih, Natalie N. C. ;
Tomaszewski, John ;
Gonzalez, Fabio A. ;
Madabhushi, Anant .
SCIENTIFIC REPORTS, 2017, 7
[7]  
de la Torre Lydia, 2018, A guide to the california consumer privacy act of 2018
[8]   Interpretable dimensionality reduction of single cell transcriptome data with deep generative models [J].
Ding, Jiarui ;
Condon, Anne ;
Shah, Sohrab P. .
NATURE COMMUNICATIONS, 2018, 9
[9]   Realizing private and practical pharmacological collaboration [J].
Hie, Brian ;
Cho, Hyunghoon ;
Berger, Bonnie .
SCIENCE, 2018, 362 (6412) :347-350
[10]  
Hitaj B, ARXIV PREPRINT ARXIV