Data-augmented machine learning scoring functions for virtual screening of YTHDF1 m6A reader protein

被引:0
|
作者
Junaid, Muhammad [1 ,2 ]
Wang, Bo [1 ]
Li, Wenjin [1 ]
机构
[1] Institute for Advanced Study, Shenzhen University, Shenzhen,518060, China
[2] College of Physics and Optoelectronics Engineering, Shenzhen University, Shenzhen,518060, China
关键词
Adversarial machine learning;
D O I
10.1016/j.compbiomed.2024.109268
中图分类号
学科分类号
摘要
Machine learning is rapidly advancing the drug discovery process, significantly enhancing speed and efficiency. Innovation in computer-aided drug design is primarily driven by structure- and ligand-based approaches. When the number of known inhibitors for a target is limited, data augmentation strategies are often preferred to enhance model performance. In this study, we developed predictive machine learning models for structure-based drug discovery leveraging multiple traditional machine learning algorithms trained with target and ligand dynamics-aware datasets. To illustrate our approach, we present a composite model that combines classification and regression to predict YTHDF1 inhibitors, utilizing PLEC features. YTHDF1, a key m6A reader protein involved in mRNA translation, is implicated in various cancers, making it a promising therapeutic target. Traditional structure-based virtual screening (SBVS) using generic scoring functions has struggled to identify potent YTHDF1 inhibitors due to the protein's unique binding characteristics. To overcome this, we developed YTHDF1-specific machine learning scoring functions (MLSFs) to enhance SBVS efficacy. We employed various data augmentation techniques to generate a comprehensive dataset, incorporating multiple conformations of ligands and the YTHDF1 protein. We have trained 64 YTHDF1-specific MLSFs using four machine learning algorithms and evaluated them on ten test sets, focusing on their predictive and ranking power. Our results demonstrate that the artificial neural network with protein-ligand extended connectivity fingerprints (ANN-PLEC) outperforms other MLSFs, consistently achieving high area under the precision-recall curve (PR-AUC) of 0.87. This method shows promise for targets with limited quantities of active molecules, providing a viable path forward for drug discovery research. The ANN-PLEC scoring function is made freely available on GitHub for other researchers to access and utilize https://github.com/JuniML/SBVS-YTHDF1/. © 2024
引用
收藏
相关论文
共 50 条
  • [1] The roles and mechanisms of the m6A reader protein YTHDF1 in tumor biology and human diseases
    Chen, Zuyao
    Zhong, Xiaolin
    Xia, Min
    Zhong, Jing
    MOLECULAR THERAPY-NUCLEIC ACIDS, 2021, 26 : 1270 - 1279
  • [2] Clinical significance of m6A reader YTHDF1 expression in colorectal cancer
    Nishizawa, Yujiro
    Konno, Masamitsu
    Asai, Ayumu
    Koseki, Jun
    Nishida, Naohiro
    Hata, Taishi
    Matsuda, Chu
    Mizushima, Tsunekazu
    Satoh, Taroh
    Doki, Yuichiro
    Mori, Masaki
    Ishii, Hideshi
    CANCER SCIENCE, 2018, 109 : 1250 - 1250
  • [3] m6A reader YTHDF1 is required for efficient intestinal regeneration and tumorigenesis
    Bing, H.
    Yan, S.
    Liu, K.
    Xiang, J.
    Li, T.
    Zhang, L.
    Gao, X.
    MOLECULAR BIOLOGY OF THE CELL, 2018, 29 (26)
  • [4] The m6A epitranscriptomic reader YTHDF1 is indispensable for minimizing ischemic brain damage
    Vemuganti, R.
    Chokkalla, A.
    Mehta, S.
    JOURNAL OF CEREBRAL BLOOD FLOW AND METABOLISM, 2022, 42 (1_SUPPL): : 8 - 8
  • [5] The RNA m6A Reader YTHDF1 Is Required for Acute Myeloid Leukemia Progression
    Hong, Yun-Guang
    Yang, Zhigang
    Chen, Yan
    Liu, Tian
    Zheng, Yuyuan
    Zhou, Chun
    Wu, Guo-Cai
    Chen, Yinhui
    Xia, Juan
    Wen, Ruiting
    Liu, Wenxin
    Zhao, Yi
    Chen, Jin
    Gao, Xiangwei
    Chen, Zhanghui
    CANCER RESEARCH, 2023, 83 (06) : 845 - 860
  • [6] m6A reader YTHDF1 promotes cardiac fibrosis by enhancing AXL translation
    Wu, Han
    Jiang, Weitao
    Pang, Ping
    Si, Wei
    Kong, Xue
    Zhang, Xinyue
    Xiong, Yuting
    Wang, Chunlei
    Zhang, Feng
    Song, Jinglun
    Yang, Yang
    Zeng, Linghua
    Liu, Kuiwu
    Jia, Yingqiong
    Wang, Zhuo
    Ju, Jiaming
    Diao, Hongtao
    Bian, Yu
    Yang, Baofeng
    FRONTIERS OF MEDICINE, 2024, 18 (03) : 499 - 515
  • [7] Epigenetic m6A RNA Modification Reader YTHDF1 in Merkel Cell Carcinoma
    Jaguri, Aayami
    Steinhoff, Martin
    Ahmad, Aamir
    CANCERS, 2023, 15 (16)
  • [8] m6A facilitates hippocampus-dependent learning and memory through YTHDF1
    Hailing Shi
    Xuliang Zhang
    Yi-Lan Weng
    Zongyang Lu
    Yajing Liu
    Zhike Lu
    Jianan Li
    Piliang Hao
    Yu Zhang
    Feng Zhang
    You Wu
    Jary Y. Delgado
    Yijing Su
    Meera J. Patel
    Xiaohua Cao
    Bin Shen
    Xingxu Huang
    Guo-li Ming
    Xiaoxi Zhuang
    Hongjun Song
    Chuan He
    Tao Zhou
    Nature, 2018, 563 : 249 - 253
  • [9] m6A facilitates hippocampus-dependent learning and memory through YTHDF1
    Shi, Hailing
    Zhang, Xuliang
    Weng, Yi-Lan
    Lu, Zongyang
    Liu, Yajing
    Lu, Zhike
    Li, Jianan
    Hao, Piliang
    Zhang, Yu
    Zhang, Feng
    Wu, You
    Delgado, Jary Y.
    Su, Yijing
    Patel, Meera J.
    Cao, Xiaohua
    Shen, Bin
    Huang, Xingxu
    Ming, Guo-li
    Zhuang, Xiaoxi
    Song, Hongjun
    He, Chuan
    Zhou, Tao
    NATURE, 2018, 563 (7730) : 249 - +
  • [10] m6A facilitates hippocampus-dependent learning and memory through Ythdf1
    Shi, Hailing
    Zhang, Xuliang
    Lu, Zongyang
    Liu, Yajing
    Weng, Yi-Lan
    Lu, Zhike
    Li, Jianan
    Hao, Piliang
    Zhang, Yu
    Delgado, Jary
    Patel, Meera
    Cao, Xiaohua
    Huang, Xingxu
    Su, Yijing
    Ming, Guo-Li
    Zhuang, Xiaoxi
    Song, Hongjun
    He, Chuan
    Zhou, Tao
    FASEB JOURNAL, 2018, 32 (01):