Machine learning-based q-RASPR modeling of power conversion efficiency of organic dyes in dye-sensitized solar cells

被引:17
作者
Pore, Souvik [1 ]
Banerjee, Arkaprava [1 ]
Roy, Kunal [1 ]
机构
[1] Jadavpur Univ, Dept Pharmaceut Technol, Drug Theoret & Chemoinformat Lab, 188 Raja SC Mullick Rd, Kolkata 700032, India
关键词
PHOTOVOLTAIC PERFORMANCE; SELECTION METHODS; QSAR; DESIGN; VALIDATION; PREDICTION;
D O I
10.1039/d3se00457k
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Different computational tools are now popularly used as an alternative to experiments for predicting several property endpoints of industrial importance. Recently, read-across and quantitative structure-property relationship (QSPR) have been merged to develop a new modeling technique read-across structure-property relationship (RASPR) which appears to have much potential in predictive modeling. This approach is also promising for modeling relatively smaller data sets as the similarity-based RASPR descriptors are computed from multiple structural and physicochemical features. To understand the potential of RASPR in data gap filling, we have undertaken a case study of modeling Power Conversion Efficiency (PCE) of different classes of organic dyes used in Dye-Sensitized Solar Cells (DSSCs) for renewable energy generation. We have used a large dataset of 429 compounds covering 4 classes of organic dyes. We initially performed read-across analysis using different similarity measures with structural analogues for query compounds and calculated the weighted average predictions. Based on the read-across optimized settings, RASPR descriptors were calculated, and these were then merged with the chemical descriptors, and finally, a single partial least squares (PLS) model was developed for each of the dye classes after feature selection, followed by additional Machine Learning (ML) models. The external prediction quality of the final RASPR models superseded those of the previously developed QSPR models using the same level of chemical information. The important structural features and similarity measures contributing to the PCE have been extracted using the RASPR method which can be used to enhance the PCE values in the newly designed dyes. The RASPR method may also be efficiently applied in modeling other properties of interest in a similar manner.
引用
收藏
页码:3412 / 3431
页数:20
相关论文
共 71 条
[1]   What Is the Best Anchoring Group for a Dye in a Dye-Sensitized Solar Cell? [J].
Ambrosio, Francesco ;
Martsinovich, Natalia ;
Troisi, Alessandro .
JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2012, 3 (11) :1531-1535
[2]   A comprehensive study of renewable energy sources: Classifications, challenges and suggestions [J].
Ang, Tze-Zhang ;
Salem, Mohamed ;
Kamarol, Mohamad ;
Das, Himadry Shekhar ;
Nazari, Mohammad Alhuyi ;
Prabaharan, Natarajan .
ENERGY STRATEGY REVIEWS, 2022, 43
[3]   Comprehensive study on dye sensitized solar cell in subsystem level to excel performance potential: A review [J].
Arka, Girija Nandan ;
Prasad, Shashi Bhushan ;
Singh, Subhash .
SOLAR ENERGY, 2021, 226 :192-213
[4]   Fluorene-Based Sensitizers with a Phenothiazine Donor: Effect of Mode of Donor Tethering on the Performance of Dye-Sensitized Solar Cells [J].
Baheti, Abhishek ;
Thomas, K. R. Justin ;
Li, Chun-Ting ;
Lee, Chuan-Pei ;
Ho, Kuo-Chuan .
ACS APPLIED MATERIALS & INTERFACES, 2015, 7 (04) :2249-2262
[5]  
Banerjee, 2023, CHEMRXIV, DOI [10.26434/chemrxiv-2023-20v0k, DOI 10.26434/CHEMRXIV-2023-20V0K]
[6]   Machine-learning-based similarity meets traditional QSAR: "q-RASAR" for the enhancement of the external predictivity and detection of prediction confidence outliers in an hERG toxicity dataset [J].
Banerjee, Arkaprava ;
Roy, Kunal .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2023, 237
[7]   Efficient predictions of cytotoxicity of TiO2-based multi-component nanoparticles using a machine learning-based q-RASAR approach [J].
Banerjee, Arkaprava ;
Kar, Supratik ;
Pore, Souvik ;
Roy, Kunal .
NANOTOXICOLOGY, 2023, 17 (01) :78-93
[8]   A machine learning q-RASPR approach for efficient predictions of the specific surface area of perovskites [J].
Banerjee, Arkaprava ;
Gajewicz-Skretna, Agnieszka ;
Roy, Kunal .
MOLECULAR INFORMATICS, 2023, 42 (04)
[9]   Quantitative predictions from chemical read-across and their confidence measures [J].
Banerjee, Arkaprava ;
Chatterjee, Mainak ;
De, Priyanka ;
Roy, Kunal .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2022, 227
[10]   First report of q-RASAR modeling toward an approach of easy interpretability and efficient transferability [J].
Banerjee, Arkaprava ;
Roy, Kunal .
MOLECULAR DIVERSITY, 2022, 26 (05) :2847-2862