Comparison of Machine Learning Methods towards Developing Interpretable Polyamide Property Prediction

被引:12
|
作者
Lee, Franklin Langlang [1 ]
Park, Jaehong [2 ]
Goyal, Sushmit [1 ]
Qaroush, Yousef [1 ]
Wang, Shihu [1 ]
Yoon, Hong [3 ]
Rammohan, Aravind [1 ]
Shim, Youngseon [2 ]
机构
[1] Corning Inc, Sci & Technol Div, Corning, NY 14831 USA
[2] Samsung Elect Co Ltd, Data & Informat Technol DIT Ctr, CSE Team, Hwaseong 18448, South Korea
[3] Corning Precis Mat Co Ltd, Corning Technol Ctr Korea, 212 Tangjeong Ro, Asan 31454, South Korea
关键词
machine learning 1; polyamide; 2; QSPR; 3; HIERARCHICAL STRUCTURE; REGRESSION; DESIGN;
D O I
10.3390/polym13213653
中图分类号
O63 [高分子化学(高聚物)];
学科分类号
070305 ; 080501 ; 081704 ;
摘要
Polyamides are often used for their superior thermal, mechanical, and chemical properties. They form a diverse set of materials that have a large variation in properties between linear to aromatic compounds, which renders the traditional quantitative structure-property relationship (QSPR) challenging. We use extended connectivity fingerprints (ECFP) and traditional QSPR fingerprints to develop machine learning models to perform high fidelity prediction of glass transition temperature (Tg), melting temperature (Tm), density (rho), and tensile modulus (E). The non-linear model using random forest is in general found to be more accurate than linear regression; however, using feature selection or regularization, the accuracy of linear models is shown to be improved significantly to become comparable to the more complex nonlinear algorithm. We find that none of the models or fingerprints were able to accurately predict the tensile modulus E, which we hypothesize is due to heterogeneity in data and data sources, as well as inherent challenges in measuring it. Finally, QSPR models revealed that the fraction of rotatable bonds, and the rotational degree of freedom affects polyamide properties most profoundly and can be used for back of the envelope calculations for a quick estimate of the polymer attributes (glass transition temperature, melting temperature, and density). These QSPR models, although having slightly lower prediction accuracy, show the most promise for the polymer chemist seeking to develop an intuition of ways to modify the chemistry to enhance specific attributes.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Modern machine learning methods for protein property prediction
    Dosajh, Arjun
    Agrawal, Prakul
    Chatterjee, Prathit
    Priyakumar, U. Deva
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2025, 90
  • [2] A Survey of Interpretable Machine Learning Methods
    Wang, Yan
    Tuerhong, Gulanbaier
    2022 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY, HUMAN-COMPUTER INTERACTION AND ARTIFICIAL INTELLIGENCE, VRHCIAI, 2022, : 232 - 237
  • [3] Dropout Prediction by Interpretable Machine Learning Model Towards Preventing Student Dropout
    Katsuragi, Miki
    Tanaka, Kenji
    TRANSDISCIPLINARITY AND THE FUTURE OF ENGINEERING, 2022, 28 : 678 - 683
  • [4] Machine Learning Methods for Property Prediction in Chemoinformatics: Quo Vadis?
    Varnek, Alexandre
    Baskin, Igor
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2012, 52 (06) : 1413 - 1437
  • [5] Personalizing the Prediction: Interactive and Interpretable machine learning
    Koh, Seunghun
    Wi, Hee Ju
    Kim, Byung Hyung
    Jo, Sungho
    2019 16TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS (UR), 2019, : 354 - 359
  • [6] Interpretable machine learning models for crime prediction
    Zhang, Xu
    Liu, Lin
    Lan, Minxuan
    Song, Guangwen
    Xiao, Luzi
    Chen, Jianguo
    COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2022, 94
  • [7] Definitions, methods, and applications in interpretable machine learning
    Murdoch, W. James
    Singh, Chandan
    Kumbier, Karl
    Abbasi-Asl, Reza
    Yu, Bin
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (44) : 22071 - 22080
  • [8] A comparison of machine learning methods for ozone pollution prediction
    Pan, Qilong
    Harrou, Fouzi
    Sun, Ying
    JOURNAL OF BIG DATA, 2023, 10 (01)
  • [9] Comparison of machine learning methods for multiphase flowrate prediction
    Jiang, Zhenyu
    Wang, Haokun
    Yang, Yunjie
    Li, Yi
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS & TECHNIQUES (IST 2019), 2019,
  • [10] A Comparison of Machine Learning Methods for the Prediction of Breast Cancer
    Silva, Sara
    Anunciacao, Orlando
    Lotz, Marco
    EVOLUTIONARY COMPUTATION, MACHINE LEARNING AND DATA MINING IN BIOINFORMATICS, 2011, 6623 : 159 - +