Enhancing Model Learning and Interpretation using Multiple Molecular Graph Representations for Compound Property and Activity Prediction

被引:1
作者
Kengkanna, Apakorn [1 ]
Ohue, Masahito [1 ]
机构
[1] Tokyo Inst Technol, Sch Comp, Dept Comp Sci, Kanagawa, Japan
来源
2023 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, CIBCB | 2023年
关键词
drug discovery; machine learning; graph neural network; molecular graph representation; interpretation; attention mechanism; NETWORK;
D O I
10.1109/CIBCB56990.2023.10264879
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Graph neural networks (GNNs) demonstrate great performance in compound property and activity prediction due to their capability to efficiently learn complex molecular graph structures. However, two main limitations persist including compound representation and model interpretability. While atom-level molecular graph representations are commonly used because of their ability to capture natural topology, they may not fully express important substructures or functional groups which significantly influence molecular properties. Consequently, recent research proposes alternative representations employing reduction techniques to integrate higher-level information and leverages both representations for model learning. However, there is still a lack of study about different molecular graph representations on model learning and interpretation. Interpretability is also crucial for drug discovery as it can offer chemical insights and inspiration for optimization. Numerous studies attempt to include model interpretation to explain the rationale behind predictions, but most of them focus solely on individual prediction with little analysis of the interpretation on different molecular graph representations. This research introduces multiple molecular graph representations that incorporate higher-level information and investigates their effects on model learning and interpretation from diverse perspectives. Several experiments are conducted across a broad range of datasets and an attention mechanism is applied to identify significant features. The results indicate that combining atom graph representation with reduced molecular graph representation can yield promising model performance. Furthermore, the interpretation results can provide significant features and potential substructures consistently aligning with background knowledge. These multiple molecular graph representations and interpretation analysis can bolster model comprehension and facilitate relevant applications in drug discovery.
引用
收藏
页码:191 / 198
页数:8
相关论文
共 40 条
[1]   Optuna: A Next-generation Hyperparameter Optimization Framework [J].
Akiba, Takuya ;
Sano, Shotaro ;
Yanase, Toshihiko ;
Ohta, Takeru ;
Koyama, Masanori .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :2623-2631
[2]   Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI [J].
Barredo Arrieta, Alejandro ;
Diaz-Rodriguez, Natalia ;
Del Ser, Javier ;
Bennetot, Adrien ;
Tabik, Siham ;
Barbado, Alberto ;
Garcia, Salvador ;
Gil-Lopez, Sergio ;
Molina, Daniel ;
Benjamins, Richard ;
Chatila, Raja ;
Herrera, Francisco .
INFORMATION FUSION, 2020, 58 :82-115
[3]  
Birchall K, 2011, METHODS MOL BIOL, V672, P197, DOI 10.1007/978-1-60761-839-3_8
[4]   Deep Learning-Based Prediction of Drug-Induced Cardiotoxicity [J].
Cai, Chuipu ;
Guo, Pengfei ;
Zhou, Yadi ;
Zhou, Jingwei ;
Wang, Qi ;
Zhang, Fengxue ;
Fang, Jiansong ;
Cheng, Feixiong .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (03) :1073-1084
[5]   Machine Learning in Drug Discovery: A Review [J].
Dara, Suresh ;
Dhamercherla, Swetha ;
Jadav, Surender Singh ;
Babu, C. H. Madhu ;
Ahsan, Mohamed Jawed .
ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (03) :1947-1999
[6]   Molecular representations in AI-driven drug discovery: a review and practical guide [J].
David, Laurianne ;
Thakkar, Amol ;
Mercado, Rocio ;
Engkvist, Ola .
JOURNAL OF CHEMINFORMATICS, 2020, 12 (01)
[7]   On the Art of Compiling and Using 'Drug-Like' Chemical Fragment Spaces [J].
Degen, Joerg ;
Wegscheid-Gerlach, Christof ;
Zaliani, Andrea ;
Rarey, Matthias .
CHEMMEDCHEM, 2008, 3 (10) :1503-1507
[8]   Utilizing graph machine learning within drug discovery and development [J].
Gaudelet, Thomas ;
Day, Ben ;
Jamasb, Arian R. ;
Soman, Jyothish ;
Regep, Cristian ;
Liu, Gertrude ;
Hayter, Jeremy B. R. ;
Vickers, Richard ;
Roberts, Charles ;
Tang, Jian ;
Roblin, David ;
Blundell, Tom L. ;
Bronstein, Michael M. ;
Taylor-King, Jake P. .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
[9]   Benchmark Data Set for in Silico Prediction of Ames Mutagenicity [J].
Hansen, Katja ;
Mika, Sebastian ;
Schroeter, Timon ;
Sutter, Andreas ;
ter Laak, Antonius ;
Steger-Hartmann, Thomas ;
Heinrich, Nikolaus ;
Mueller, Klaus-Robert .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2009, 49 (09) :2077-2081
[10]   Interpretation of Structure-Activity Relationships in Real-World Drug Design Data Sets Using Explainable Artificial Intelligence [J].
Harren, Tobias ;
Matter, Hans ;
Hessler, Gerhard ;
Rarey, Matthias ;
Grebner, Christoph .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (03) :447-462