Traditional Machine and Deep Learning for Predicting Toxicity Endpoints

被引:3
|
作者
Norinder, Ulf [1 ]
机构
[1] Stockholm Univ, Dept Comp & Syst Sci, S-16407 Kista, Sweden
来源
MOLECULES | 2023年 / 28卷 / 01期
关键词
CATMoS dataset; CDDD; BERT; conformal prediction; random forest; RDKit; LANGUAGE;
D O I
10.3390/molecules28010217
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Molecular structure property modeling is an increasingly important tool for predicting compounds with desired properties due to the expensive and resource-intensive nature and the problem of toxicity-related attrition in late phases during drug discovery and development. Lately, the interest for applying deep learning techniques has increased considerably. This investigation compares the traditional physico-chemical descriptor and machine learning-based approaches through autoencoder generated descriptors to two different descriptor-free, Simplified Molecular Input Line Entry System (SMILES) based, deep learning architectures of Bidirectional Encoder Representations from Transformers (BERT) type using the Mondrian aggregated conformal prediction method as overarching framework. The results show for the binary CATMoS non-toxic and very-toxic datasets that for the former, almost equally balanced, dataset all methods perform equally well while for the latter dataset, with an 11-fold difference between the two classes, the MolBERT model based on a large pre-trained network performs somewhat better compared to the rest with high efficiency for both classes (0.93-0.94) as well as high values for sensitivity, specificity and balanced accuracy (0.86-0.87). The descriptor-free, SMILES-based, deep learning BERT architectures seem capable of producing well-balanced predictive models with defined applicability domains. This work also demonstrates that the class imbalance problem is gracefully handled through the use of Mondrian conformal prediction without the use of over- and/or under-sampling, weighting of classes or cost-sensitive methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Deep Learning in Predicting Preterm Birth: A Comparative Study of Machine Learning Algorithms
    Zhang, Fangchao
    Tong, Lingling
    Shi, Chen
    Zuo, Rui
    Wang, Liwei
    Wang, Yan
    MATERNAL-FETAL MEDICINE, 2024, 6 (03) : 141 - 146
  • [42] Deep Learning in Predicting Preterm Birth: A Comparative Study of Machine Learning Algorithms
    Zhang Fangchao
    Tong Lingling
    Shi Chen
    Zuo Rui
    Wang Liwei
    Wang Yan
    母胎医学杂志(英文), 2024, 06 (03)
  • [43] A fusion framework of deep learning and machine learning for predicting sgRNA cleavage efficiency
    Liu, Yu
    Fan, Rui
    Yi, Jingkun
    Cui, Qinghua
    Cui, Chunmei
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 165
  • [44] Predicting Potato Crop Yield with Machine Learning and Deep Learning for Sustainable Agriculture
    El-Kenawy, El-Sayed M.
    Alhussan, Amel Ali
    Khodadadi, Nima
    Mirjalili, Seyedali
    Eid, Marwa M.
    POTATO RESEARCH, 2024, : 759 - 792
  • [45] Ensemble learning of deep learning and traditional machine learning approaches for skin lesion segmentation and classification
    Khan, Adil H.
    Iskandar, Dayang NurFatimah Awang
    Al-Asad, Jawad F.
    Mewada, Hiren
    Sherazi, Muhammad Abid
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (13):
  • [46] Is deep learning superior to traditional techniques in machine health monitoring applications
    Wang, W.
    Vos, K.
    Taylor, J.
    Jenkins, C.
    Bala, B.
    Whitehead, L.
    Peng, Z.
    AERONAUTICAL JOURNAL, 2023, 127 (1318): : 2105 - 2117
  • [47] Chinese Multimodal Emotion Recognition in Deep and Traditional Machine Learning Approaches
    Miao, Haotian
    Zhang, Yifei
    Li, Weipeng
    Zhang, Haoran
    Wang, Daling
    Feng, Shi
    2018 FIRST ASIAN CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII ASIA), 2018,
  • [48] Deep Learning Versus Traditional Machine Learning Methods for Aggregated Energy Demand Prediction
    Paterakis, Nikolaos G.
    Mocanu, Elena
    Gibescu, Madeleine
    Stappers, Bart
    van Alst, Walter
    2017 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES CONFERENCE EUROPE (ISGT-EUROPE), 2017,
  • [49] Comparative Study between Traditional Machine Learning and Deep Learning Approaches for Text Classification
    Kamath, Cannannore Nidhi
    Bukhari, Syed Saqib
    Dengel, Andreas
    PROCEEDINGS OF THE ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG 2018), 2018,
  • [50] Speech emotion recognition for psychotherapy: an analysis of traditional machine learning and deep learning techniques
    Shah, Nidhi
    Sood, Kanika
    Arora, Jayraj
    2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 718 - 723