共 50 条
ColdDTA: Utilizing data augmentation and attention-based feature fusion for drug-target binding affinity prediction
被引:12
|作者:
Fang, Kejie
[1
]
Zhang, Yiming
[2
]
Du, Shiyu
[2
,3
,4
]
He, Jian
[5
]
机构:
[1] Ningbo Univ, Fac Elect Engn & Comp Sci, Ningbo 315211, Peoples R China
[2] Chinese Acad Sci, Engn Lab Adv Energy Mat, Ningbo Inst Mat Technol & Engn, Ningbo 315201, Peoples R China
[3] China Univ Petr East China, Sch Mat Sci & Engn, Qingdao 266580, Peoples R China
[4] China Univ Petr East China, Sch Comp Sci, Qingdao 266580, Peoples R China
[5] Shanghai Jiao Tong Univ, Ctr Single Cell Omics, Sch Publ Hlth, State Key Lab Syst Med Canc,Sch Med, Shanghai 200025, Peoples R China
关键词:
Drug-target affinity;
Data augmentation;
Feature fusion;
Attention mechanism;
Graph neural networks;
D O I:
10.1016/j.compbiomed.2023.107372
中图分类号:
Q [生物科学];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
Accurate prediction of drug-target affinity (DTA) plays a crucial role in drug discovery and development. Recently, deep learning methods have shown excellent predictive performance on randomly split public datasets. However, verifications are still required on this splitting method to reflect real-world problems in practical applications. And in a cold-start experimental setup, where drugs or proteins in the test set do not appear in the training set, the performance of deep learning models often significantly decreases. This indicates that improving the generalization ability of the models remains a challenge. To this end, in this study, we propose ColdDTA: using data augmentation and attention-based feature fusion to improve the generalization ability of predicting drug-target binding affinity. Specifically, ColdDTA generates new drug-target pairs by removing subgraphs of drugs. The attention-based feature fusion module is also used to better capture the drug-target interactions. We conduct cold-start experiments on three benchmark datasets, and the consistency index (CI) and mean square error (MSE) results on the Davis and KIBA datasets show that ColdDTA outperforms the five state-of-the-art baseline methods. Meanwhile, the results of area under the receiver operating characteristic (ROC-AUC) on the BindingDB dataset show that ColdDTA also has better performance on the classification task. Furthermore, visualizing the model weights allows for interpretable insights. Overall, ColdDTA can better solve the realistic DTA prediction problem. The code has been available to the public.
引用
收藏
页数:10
相关论文