Development of Ensemble Learning Method Considering Applicability Domains

被引:0
作者
Sato, Keigo [1 ]
Kaneko, Hiromasa [1 ]
机构
[1] Meiji Univ, Sch Sci & Technol, Dept Appl Chem, Tokyo, Japan
关键词
Ensemble learning; Regression; Applicability domain; QSAR; QSPR; MODELS; PREDICTION; REGRESSION;
D O I
10.2477/jccj.2019-0010
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In quantitative structure-activity relationship and quantitative structure-physical relationship quantitatively, regression models are constructed activities and properties y, and molecular descriptors x for compounds. To improve predictive performance of models, multiple sub-models are constructed and a final y-value is predicted by integrating y-values predicted with sub-models in ensemble learning. Although it was confirmed that predictive performance improved by considering the applicability domain (AD) of each sub-model and by using only the sub-models inside AD, ADs cannot be compared between sub-datasets with different x. It was impossible to predict a y-value by selecting and weighting sub-models for a new sample. In this study, we focused on the similarity-weighted root-mean-square distance (wRMSD), which is an index of AD, and developed wRMSD-based AD considering ensemble learning (WEL), an ensemble learning method based on wRMSD. Since wRMSD is represented as the scale of y, AD can be compared between sub-models with different x, and thus, it is possible to predict a y-value, weighting sub-models having low wRMSD-values, which means high reliability of prediction, for a new sample. It was confirmed that AD was enlarged and predictive performance improved by using WEL compared to the conventional ensemble learning method through data analysis using three datasets of compounds for which water solubility, toxicity and pharmacological activity were measured. Python code for WEL is available at https://github.com/hkaneko1985/wel.
引用
收藏
页码:187 / 193
页数:7
相关论文
共 50 条
  • [31] Monitoring Soil Salinity Classes through Remote Sensing-Based Ensemble Learning Concept: Considering Scale Effects
    Chen, Huifang
    Wu, Jingwei
    Xu, Chi
    REMOTE SENSING, 2024, 16 (04)
  • [32] Prediction of pipe performance with stacking ensemble learning based approaches
    Shi, Fang
    Liu, Yihao
    Liu, Zheng
    Li, Eric
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (06) : 3845 - 3855
  • [33] A MBCRF Algorithm Based on Ensemble Learning for Building Demand Response Considering the Thermal Comfort
    Li, Yuchun
    Han, Yinghua
    Wang, Jinkuan
    Zhao, Qiang
    ENERGIES, 2018, 11 (12)
  • [34] A Survey on ensemble learning under the era of deep learning
    Yang, Yongquan
    Lv, Haijun
    Chen, Ning
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (06) : 5545 - 5589
  • [35] Application of ensemble learning to genomic selection in chinese simmental beef cattle
    Liang, Mang
    Miao, Jian
    Wang, Xiaoqiao
    Chang, Tianpeng
    An, Bingxing
    Duan, Xinghai
    Xu, Lingyang
    Gao, Xue
    Zhang, Lupei
    Li, Junya
    Gao, Huijiang
    JOURNAL OF ANIMAL BREEDING AND GENETICS, 2021, 138 (03) : 291 - 299
  • [36] Novel Ensemble Forecasting of Streamflow Using Locally Weighted Learning Algorithm
    Adnan, Rana Muhammad
    Jaafari, Abolfazl
    Mohanavelu, Aadhityaa
    Kisi, Ozgur
    Elbeltagi, Ahmed
    SUSTAINABILITY, 2021, 13 (11)
  • [37] An empirical solution for over-pruning with a novel ensemble-learning method for fMRI decoding
    Hirose, Satoshi
    Nambu, Isao
    Naito, Eiichi
    JOURNAL OF NEUROSCIENCE METHODS, 2015, 239 : 238 - 245
  • [38] A novel ensemble deep learning model with dynamic error correction and multi-objective ensemble pruning for time series forecasting
    Zhang, Shuai
    Chen, Yong
    Zhang, Wenyu
    Feng, Ruijun
    INFORMATION SCIENCES, 2021, 544 : 427 - 445
  • [39] A Prediction Method of Cable Crosstalk in Electronic Systems with Ensemble Learning
    Xu Yang
    Dejian Zhou
    Wei Song
    Yulai She
    Xiaoyong Chen
    Arabian Journal for Science and Engineering, 2022, 47 : 2987 - 3000
  • [40] A stacked ensemble learning method for customer lifetime value prediction
    Asadi, Nader
    Kazerooni, Mehrdad
    KYBERNETES, 2024, 53 (07) : 2342 - 2360