Machine learning prediction of lignin content in poplar with Raman spectroscopy

被引:59
作者
Gao, Wenli [1 ,2 ]
Zhou, Liang [1 ,2 ]
Liu, Shengquan [1 ,2 ]
Guan, Ying [1 ,2 ]
Gao, Hui [1 ,2 ]
Hui, Bin [3 ]
机构
[1] Anhui Agr Univ, Sch Forestry & Landscape Architecture, Hefei 230036, Anhui, Peoples R China
[2] Key Lab State Forest & Grassland Adm Wood Qual Im, Hefei 230036, Anhui, Peoples R China
[3] Qingdao Univ, Sch Mat Sci & Engn, Inst Marine Biobased Mat, State Key Lab Biofibers & Ecotext, Qingdao 266071, Peoples R China
基金
中国国家自然科学基金;
关键词
Raman spectroscopy; Lignin content; Machine learning; XGBoost; Gradient boosting machine; CELL-WALLS; WOOD; MICROSCOPY;
D O I
10.1016/j.biortech.2022.126812
中图分类号
S2 [农业工程];
学科分类号
0828 ;
摘要
Based on features extracted from Raman spectra, regularization algorithms, SVR, DT, RF, LightGBM, CatBoost, and XGBoost were used to develop prediction models for lignin content in poplar. Firstly, Raman features extracted from FT-Raman spectra after data processing were used as input of models and determined lignin contents were output. Secondly, grid-search combined with cross-validation was used to adjust the hyperparameters of models. Finally, the predictive models were built by aforementioned algorithms. The results indicated regularization algorithms, SVR, DT held test R-2 were >0.80 which means the predictive values from model still deviate from measured ones. Meanwhile, RF, LightGBM, CatBoost, and XGBoost were better than above algorithms, and their test R-2 were >0.91 which suggesting the predictive values was nearly close to measured ones. Therefore, fast and accurate methods for predicting lignin content were obtained and will be useful for screening suitable lignocellulosic resource with expected lignin content.
引用
收藏
页数:9
相关论文
共 48 条
[1]  
Adeosun S.O., 2019, Sustainable Lignin for Carbon Fibers: Principles, Techniques, and Applications, P193, DOI [10.1007/978-3-030-18792-75, DOI 10.1007/978-3-030-18792-75]
[2]   Analysis of Cellulose and Lignocellulose Materials by Raman Spectroscopy: A Review of the Current Status [J].
Agarwal, Umesh P. .
MOLECULES, 2019, 24 (09)
[3]   FT-Raman spectroscopy of wood: Identifying contributions of lignin and carbohydrate polymers in the spectrum of black spruce (Picea mariana) [J].
Agarwal, UP ;
Ralph, SA .
APPLIED SPECTROSCOPY, 1997, 51 (11) :1648-1655
[4]  
Agarwal UP, 2003, TAPPI J, V2, P22
[5]   INSITU RAMAN MICROPROBE STUDIES OF PLANT-CELL WALLS - MACROMOLECULAR ORGANIZATION AND COMPOSITIONAL VARIABILITY IN THE SECONDARY WALL OF PICEA-MARIANA (MILL) BSP [J].
AGARWAL, UP ;
ATALLA, RH .
PLANTA, 1986, 169 (03) :325-332
[6]  
Anghel A, 2019, Arxiv, DOI arXiv:1809.04559
[7]  
[Anonymous], Regression Diagnostics1980, P85, DOI DOI 10.1002/04717255153.CH3
[8]   A review on biopolymer production via lignin valorization [J].
Banu, J. Rajesh ;
Kavitha, S. ;
Kannah, R. Yukesh ;
Devi, T. Poornima ;
Gunasekaran, M. ;
Kim, Sang-Hyoun ;
Kumar, Gopalakrishnan .
BIORESOURCE TECHNOLOGY, 2019, 290
[9]   CatBoost model and artificial intelligence techniques for corporate failure prediction [J].
Ben Jabeur, Sami ;
Gharib, Cheima ;
Mefteh-Wali, Salma ;
Ben Arfi, Wissal .
TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2021, 166
[10]   Investigation of eucalypt and pine wood acid-soluble lignin by Py-GC-MS [J].
Brumano, Gabriel Castro ;
Colodette, Jorge Luiz ;
Fernandes, Sergio A. ;
Barbosa, Bianca Moreira ;
Borges Gomes, Fernando Jose .
HOLZFORSCHUNG, 2020, 74 (02) :149-155