Automatic Feature Selection for Atom-Centered Neural Network Potentials Using a Gradient Boosting Decision Algorithm

被引:0
|
作者
Li, Renzhe [1 ]
Wang, Jiaqi [1 ]
Singh, Akksay [1 ,2 ,3 ]
Li, Bai [1 ]
Song, Zichen [1 ,4 ]
Zhou, Chuan [1 ]
Li, Lei [1 ]
机构
[1] Southern Univ Sci & Technol, Dept Mat Sci & Engn, Shenzhen 518055, Peoples R China
[2] Univ Texas Austin, Dept Chem, Austin, TX 78712 USA
[3] Univ Texas Austin, Inst Computat Engn & Sci, Austin, TX 78712 USA
[4] City Univ Hong Kong, Dept Mat Sci & Engn, Kowloon, Hong Kong 999077, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
LIQUID-METAL; FORCE-FIELD; APPROXIMATION; DYNAMICS; PERFORMANCE; SIMULATION; MODEL;
D O I
10.1021/acs.jctc.4c01176
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Atom-centered neural network (ANN) potentials have shown high accuracy and computational efficiency in modeling atomic systems. A crucial step in developing reliable ANN potentials is the proper selection of atom-centered symmetry functions (ACSFs), also known as atomic features, to describe atomic environments. Inappropriate selection of ACSFs can lead to poor-quality ANN potentials. Here, we propose a gradient boosting decision tree (GBDT)-based framework for the automatic selection of optimal ACSFs. This framework takes uniformly distributed sets of ACSFs as input and evaluates their relative importance. The ACSFs with high average importance scores are selected and used to train an ANN potential. We applied this method to the Ge system, resulting in an ANN potential with root-mean-square errors (RMSE) of 10.2 meV/atom for energy and 84.8 meV/& Aring; for force predictions, utilizing only 18 ACSFs to achieve a balance between accuracy and computational efficiency. The framework is validated using the grid searching method, demonstrating that ACSFs selected with our framework are in the optimal region. Furthermore, we also compared our method with commonly used feature selection algorithms. The results show that our algorithm outperforms the others in terms of effectiveness and accuracy. This study highlights the significance of the ACSF parameter effect on the ANN performance and presents a promising method for automatic ACSF selection, facilitating the development of machine learning potentials.
引用
收藏
页码:10564 / 10573
页数:10
相关论文
共 41 条
  • [21] Computer vision-based classification of concrete spall severity using metaheuristic-optimized Extreme Gradient Boosting Machine and Deep Convolutional Neural Network
    Nguyen, Hieu
    Hoang, Nhat-Duc
    AUTOMATION IN CONSTRUCTION, 2022, 140
  • [22] Wind power forecast using wavelet neural network trained by improved Clonal selection algorithm
    Chitsaz, Hamed
    Amjady, Nima
    Zareipour, Hamidreza
    ENERGY CONVERSION AND MANAGEMENT, 2015, 89 : 588 - 598
  • [23] Induction motor bearing fault classification using deep neural network with particle swarm optimization-extreme gradient boosting
    Lee, Chun-Yao
    Maceren, Edu Daryl C.
    IET ELECTRIC POWER APPLICATIONS, 2024, 18 (03) : 297 - 311
  • [24] Minimization of Surface Deflection in Rectangular Embossing Using Automatic Training of Artificial Neural Network and Genetic Algorithm
    Cho, Sungmin
    Chung, Wanjin
    INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2019, 20 (01) : 57 - 66
  • [25] Two-stage short-term wind power probabilistic prediction using natural gradient boosting combined with neural network
    Zhang, Siyi
    Liu, Mingbo
    Xie, Min
    Lin, Shunjiang
    APPLIED SOFT COMPUTING, 2024, 159
  • [26] Prediction of the performance and exhaust emissions of a compression ignition engine using a wavelet neural network with a stochastic gradient algorithm
    Molkdaragh, R. Rahimi
    Jafarmadar, S.
    Khalilaria, Sh
    Saraee, H. Soukht
    ENERGY, 2018, 142 : 1128 - 1138
  • [27] Performance assessment of decision-making units using an adaptive neural network algorithm: one period case
    Mona Anvari
    Mohammad Saidi Mehrabad
    Ali Azadeh
    Morteza Saberi
    The International Journal of Advanced Manufacturing Technology, 2010, 46 : 1059 - 1069
  • [28] Performance assessment of decision-making units using an adaptive neural network algorithm: one period case
    Anvari, Mona
    Mehrabad, Mohammad Saidi
    Azadeh, Ali
    Saberi, Morteza
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2010, 46 (9-12): : 1059 - 1069
  • [29] Wind energy system fault classification and detection using deep convolutional neural network and particle swarm optimization-extreme gradient boosting
    Lee, Chun-Yao
    Maceren, Edu Daryl C.
    IET ENERGY SYSTEMS INTEGRATION, 2024, 6 (04) : 479 - 497
  • [30] Modified genetic algorithm-based feature selection combined with pre-trained deep neural network for demand forecasting in outpatient department
    Jiang, Shancheng
    Chin, Kwai-Sang
    Wang, Long
    Qu, Gang
    Tsui, Kwok L.
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 82 : 216 - 230