Automatic Feature Selection for Atom-Centered Neural Network Potentials Using a Gradient Boosting Decision Algorithm

被引:0
|
作者
Li, Renzhe [1 ]
Wang, Jiaqi [1 ]
Singh, Akksay [1 ,2 ,3 ]
Li, Bai [1 ]
Song, Zichen [1 ,4 ]
Zhou, Chuan [1 ]
Li, Lei [1 ]
机构
[1] Southern Univ Sci & Technol, Dept Mat Sci & Engn, Shenzhen 518055, Peoples R China
[2] Univ Texas Austin, Dept Chem, Austin, TX 78712 USA
[3] Univ Texas Austin, Inst Computat Engn & Sci, Austin, TX 78712 USA
[4] City Univ Hong Kong, Dept Mat Sci & Engn, Kowloon, Hong Kong 999077, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
LIQUID-METAL; FORCE-FIELD; APPROXIMATION; DYNAMICS; PERFORMANCE; SIMULATION; MODEL;
D O I
10.1021/acs.jctc.4c01176
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Atom-centered neural network (ANN) potentials have shown high accuracy and computational efficiency in modeling atomic systems. A crucial step in developing reliable ANN potentials is the proper selection of atom-centered symmetry functions (ACSFs), also known as atomic features, to describe atomic environments. Inappropriate selection of ACSFs can lead to poor-quality ANN potentials. Here, we propose a gradient boosting decision tree (GBDT)-based framework for the automatic selection of optimal ACSFs. This framework takes uniformly distributed sets of ACSFs as input and evaluates their relative importance. The ACSFs with high average importance scores are selected and used to train an ANN potential. We applied this method to the Ge system, resulting in an ANN potential with root-mean-square errors (RMSE) of 10.2 meV/atom for energy and 84.8 meV/& Aring; for force predictions, utilizing only 18 ACSFs to achieve a balance between accuracy and computational efficiency. The framework is validated using the grid searching method, demonstrating that ACSFs selected with our framework are in the optimal region. Furthermore, we also compared our method with commonly used feature selection algorithms. The results show that our algorithm outperforms the others in terms of effectiveness and accuracy. This study highlights the significance of the ACSF parameter effect on the ANN performance and presents a promising method for automatic ACSF selection, facilitating the development of machine learning potentials.
引用
收藏
页码:10564 / 10573
页数:10
相关论文
共 41 条
  • [31] Automatic feature selection using gray-level co-occurrence matrix and detection of mustard plant disease using feed forward neural networks
    Sharma, Anita
    Sharma, Chirag
    Sharma, Vikrant
    Vats, Satvik
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2023, 44 (06): : 1127 - 1138
  • [32] A Novel Ranking Algorithm of Enhanced Images using a Convolutional Neural Network and a Saliency-based Patch Selection Scheme
    Chetouani, Aladine
    Qureshi, Muhammad Ali
    Deriche, Mohamed
    Beghdadi, Azeddine
    2019 ELEVENTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2019,
  • [33] A hybrid group decision support system for supplier selection using analytic hierarchy process, fuzzy set theory and neural network
    Kar, Arpan Kumar
    JOURNAL OF COMPUTATIONAL SCIENCE, 2015, 6 : 23 - 33
  • [34] Breast cancer tumor type recognition using graph feature selection technique and radial basis function neural network with optimal structure
    Zarbakhsh, Payam
    Addeh, Abdoljalil
    JOURNAL OF CANCER RESEARCH AND THERAPEUTICS, 2018, 14 (03) : 625 - 633
  • [35] Influences of Soil Bulk Density and Texture on Estimation of Surface Soil Moisture Using Spectral Feature Parameters and an Artificial Neural Network Algorithm
    Diao, Wanying
    Liu, Gang
    Zhang, Huimin
    Hu, Kelin
    Jin, Xiuliang
    AGRICULTURE-BASEL, 2021, 11 (08):
  • [36] A diagnosis system by U-net and deep neural network enabled with optimal feature selection for liver tumor detection using CT images
    Rela, Munipraveena
    Suryakari, Nagaraja Rao
    Patil, Ramana Reddy
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) : 3185 - 3227
  • [37] Classification of white blood cells using deep features obtained from Convolutional Neural Network models based on the combination of feature selection methods
    Togacar, Mesut
    Ergen, Burhan
    Comert, Zafer
    APPLIED SOFT COMPUTING, 2020, 97
  • [38] Reduction of Insolvency Risk and Total Costs in Banking Sector using Partners Selection Approach with Genetic Algorithm and Multilayer Perceptron Neural Network
    Azarbad, M.
    Shojaie, A. A.
    Abdi, F.
    Ghezavati, V. R.
    Khalili-Damghani, K.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2024, 37 (08): : 1667 - 1690
  • [39] Streamflow Predictions in Ungauged Basins Using Recurrent Neural Network and Decision Tree-Based Algorithm: Application to the Southern Region of the Korean Peninsula
    Won, Jeongeun
    Seo, Jiyu
    Lee, Jeonghoon
    Choi, Jeonghyeon
    Park, Yoonkyung
    Lee, Okjeong
    Kim, Sangdan
    WATER, 2023, 15 (13)
  • [40] Improving DNA-Binding Protein Prediction Using Three-Part Sequence-Order Feature Extraction and a Deep Neural Network Algorithm
    Hu, Jun
    Zeng, Wen-Wu
    Jia, Ning-Xin
    Arif, Muhammad
    Yu, Dong-Jun
    Zhang, Gui-Jun
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (03) : 1044 - 1057