The prediction of crystal densities of a big data set using 1D and 2D structure features

被引:0
|
作者
Li, Xianlan [1 ]
Kong, Dingling [1 ]
Luan, Yue [1 ]
Guo, Lili [1 ]
Lu, Yanhua [2 ]
Li, Wei [2 ]
Tang, Meng [3 ]
Zhang, Qingyou [1 ]
Pang, Aimin [2 ]
机构
[1] Henan Univ, Henan Engn Res Ctr Ind Circulating Water Treatment, Henan Joint Int Res Lab Environm Pollut Control Ma, Kaifeng 475004, Peoples R China
[2] Hubei Inst Aerosp Chemotechnol, Sci & Technol Aerosp Chem Power Lab, Xiangyang 441003, Hubei, Peoples R China
[3] Harbin Inst Technol, Sch Phys, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Density; Quantitative structure-property relationships; Big data set; Partial least squares; Random forest; NITRATE ESTERS; IONIC LIQUIDS; QSPR; ENTHALPIES; VAPORIZATION; EXPLOSIVES; NITRAMINES; SURFACE; HEAT;
D O I
10.1007/s11224-024-02279-4
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A large data set of over 30 thousand organic compounds containing carbon, nitrogen, oxygen, fluorine, and hydrogen was collected, and the density of each compound was predicted by 1D descriptors derived from its molecular formula and 2D descriptors derived from its constitutional structural features. The 2D structural features are composed of Benson's groups, corrected groups, and 2D structural features of the whole molecular structures. All the descriptors were extracted by an in-house program in Java with a function to ensure that each atom (or bond) of molecules is represented by Benson's groups once for atom-based (or bond-based) descriptors. Partial least square (PLS) and random forest (RF) methods were used separately to build models to predict the density. Further, the variable selection of descriptors was performed by variable importance of RF. For partial least square, the combination of the models constructed by descriptors based on the atoms and the bonds achieved the best results in this paper: for the cross-validation of the training set, the Pearson correlation coefficient (R) = 0.9270, mean absolute error (MAE) = 0.0270 g center dot cm-3, and root mean squared error (RMSE) = 0.0426 g center dot cm-3; for the prediction of the test set, R = 0.9454, MAE = 0.0263 g center dot cm-3, and RMSE = 0.0375 g center dot cm-3.
引用
收藏
页码:1375 / 1385
页数:11
相关论文
共 50 条
  • [21] Integrative Assessment of Mixture Toxicity of Three Ionic Liquids on Acetylcholinesterase Using a Progressive Approach from 1D Point, 2D Curve, to 3D Surface
    Ge, Huilin
    Tao, Shanshan
    Zhou, Min
    Han, Bingjun
    Yuan, Hongqiu
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (21)
  • [22] Network structural hardening of polypropylene matrix using hybrid of 0D, 1D and 2D carbon-ceramic nanoparticles with enhanced mechanical and thermomechanical properties
    Uyor, Uwa O.
    Popoola, Patricia A., I
    Popoola, Olawale M.
    JOURNAL OF POLYMER ENGINEERING, 2022, 42 (06) : 520 - 534
  • [23] Dimensional diversity (0D, 1D, 2D, and 3D) in perovskite solar cells: exploring the potential of mixed-dimensional integrations
    Li, Xin
    Aftab, Sikandar
    Hussain, Sajjad
    Kabir, Fahmid
    Henaish, A. M. A.
    Al-Sehemi, Abdullah G.
    Pallavolu, Mohan Reddy
    Koyyada, Ganesh
    JOURNAL OF MATERIALS CHEMISTRY A, 2024, 12 (08) : 4421 - 4440
  • [24] Intermolecular interaction-induced hierarchical transformation in 1D nanohybrids: Analysis of conformational changes by 2D correlation spectroscopy
    Ho, Seok Park
    Yeong, Suk Choi
    Young, Mee Jung
    Won, Hi Hong
    Journal of the American Chemical Society, 2008, 130 (03): : 845 - 852
  • [25] Self-assembly of NiTPP on Cu(111): a transition from disordered 1D wires to 2D chiral domains
    Fatayer, Shadi
    Veiga, Roberto G. A.
    Prieto, Mauricio J.
    Perim, Eric
    Landers, Richard
    Miwa, Roberto H.
    de Siervo, Abner
    PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2015, 17 (28) : 18344 - 18352
  • [26] Ambient synthesis of a multifunctional 1D/2D hierarchical Ag-Ag2S nanowire/nanosheet heterostructure with diverse applications
    Xiong, Jinyan
    Han, Chao
    Li, Weijie
    Sun, Qiao
    Chen, Jun
    Chou, Shulei
    Li, Zhen
    Dou, Shixue
    CRYSTENGCOMM, 2016, 18 (06): : 930 - 937
  • [27] Titanium oxide-based 1D nanofilaments, 2D sheets, and mesoporous particles: Synthesis, characterization, and ion intercalation
    Badr, Hussein O.
    Cope, Jacob
    Kono, Takayuki
    Torita, Takeshi
    Lagunas, Francisco
    Castiel, Emmanuel
    Klie, Robert F.
    Barsoum, Michel W.
    MATTER, 2023, 6 (10) : 3538 - 3554
  • [28] A geostatistical approach to estimating the parameters of a 3D Cox-Boolean discrete fracture network from 1D and 2D sampling observations
    Hekmatnejad, Amin
    Emery, Xavier
    Elmo, Davide
    INTERNATIONAL JOURNAL OF ROCK MECHANICS AND MINING SCIENCES, 2019, 113 : 183 - 190
  • [29] Integrated 2D photonic crystal stack filter fabricated using nanoreplica molding
    Yang, Fuchyi
    Yen, Gary
    Cunningham, Brian T.
    OPTICS EXPRESS, 2010, 18 (11): : 11846 - 11858
  • [30] Constructing 1D/2D BiOI/ZnWO4 p-n heterojunction photocatalyst with enhanced photocatalytic removal of NO
    Gong, Siwen
    Zhu, Gangqiang
    Bello, Isaac Asusheyi
    Rao, Fei
    Li, Shiping
    Gao, Jianzhi
    Zubairu, Siyaka Mj
    Peng, Jianhong
    Hojamberdiev, Mirabbos
    JOURNAL OF CHEMICAL TECHNOLOGY AND BIOTECHNOLOGY, 2020, 95 (06) : 1705 - 1716