Machine learning prediction of lignin content in poplar with Raman spectroscopy

被引:52
作者
Gao, Wenli [1 ,2 ]
Zhou, Liang [1 ,2 ]
Liu, Shengquan [1 ,2 ]
Guan, Ying [1 ,2 ]
Gao, Hui [1 ,2 ]
Hui, Bin [3 ]
机构
[1] Anhui Agr Univ, Sch Forestry & Landscape Architecture, Hefei 230036, Anhui, Peoples R China
[2] Key Lab State Forest & Grassland Adm Wood Qual Im, Hefei 230036, Anhui, Peoples R China
[3] Qingdao Univ, Sch Mat Sci & Engn, Inst Marine Biobased Mat, State Key Lab Biofibers & Ecotext, Qingdao 266071, Peoples R China
基金
中国国家自然科学基金;
关键词
Raman spectroscopy; Lignin content; Machine learning; XGBoost; Gradient boosting machine; CELL-WALLS; WOOD; MICROSCOPY;
D O I
10.1016/j.biortech.2022.126812
中图分类号
S2 [农业工程];
学科分类号
0828 ;
摘要
Based on features extracted from Raman spectra, regularization algorithms, SVR, DT, RF, LightGBM, CatBoost, and XGBoost were used to develop prediction models for lignin content in poplar. Firstly, Raman features extracted from FT-Raman spectra after data processing were used as input of models and determined lignin contents were output. Secondly, grid-search combined with cross-validation was used to adjust the hyperparameters of models. Finally, the predictive models were built by aforementioned algorithms. The results indicated regularization algorithms, SVR, DT held test R-2 were >0.80 which means the predictive values from model still deviate from measured ones. Meanwhile, RF, LightGBM, CatBoost, and XGBoost were better than above algorithms, and their test R-2 were >0.91 which suggesting the predictive values was nearly close to measured ones. Therefore, fast and accurate methods for predicting lignin content were obtained and will be useful for screening suitable lignocellulosic resource with expected lignin content.
引用
收藏
页数:9
相关论文
共 48 条
  • [1] Adeosun S.O., 2019, Sustainable Lignin for Carbon Fibers: Principles, Techniques, and Applications, P193, DOI [10.1007/978-3-030-18792-75, DOI 10.1007/978-3-030-18792-75]
  • [2] Analysis of Cellulose and Lignocellulose Materials by Raman Spectroscopy: A Review of the Current Status
    Agarwal, Umesh P.
    [J]. MOLECULES, 2019, 24 (09):
  • [3] FT-Raman spectroscopy of wood: Identifying contributions of lignin and carbohydrate polymers in the spectrum of black spruce (Picea mariana)
    Agarwal, UP
    Ralph, SA
    [J]. APPLIED SPECTROSCOPY, 1997, 51 (11) : 1648 - 1655
  • [4] Agarwal UP, 2003, TAPPI J, V2, P22
  • [5] INSITU RAMAN MICROPROBE STUDIES OF PLANT-CELL WALLS - MACROMOLECULAR ORGANIZATION AND COMPOSITIONAL VARIABILITY IN THE SECONDARY WALL OF PICEA-MARIANA (MILL) BSP
    AGARWAL, UP
    ATALLA, RH
    [J]. PLANTA, 1986, 169 (03) : 325 - 332
  • [6] Anghel A, 2019, Arxiv, DOI arXiv:1809.04559
  • [7] A review on biopolymer production via lignin valorization
    Banu, J. Rajesh
    Kavitha, S.
    Kannah, R. Yukesh
    Devi, T. Poornima
    Gunasekaran, M.
    Kim, Sang-Hyoun
    Kumar, Gopalakrishnan
    [J]. BIORESOURCE TECHNOLOGY, 2019, 290
  • [8] Belsley D.A., 1980, REGRESSION DIAGNOSTI, P85, DOI [10.1002/04717255153.ch3, DOI 10.1002/04717255153.CH3]
  • [9] CatBoost model and artificial intelligence techniques for corporate failure prediction
    Ben Jabeur, Sami
    Gharib, Cheima
    Mefteh-Wali, Salma
    Ben Arfi, Wissal
    [J]. TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2021, 166
  • [10] Investigation of eucalypt and pine wood acid-soluble lignin by Py-GC-MS
    Brumano, Gabriel Castro
    Colodette, Jorge Luiz
    Fernandes, Sergio A.
    Barbosa, Bianca Moreira
    Borges Gomes, Fernando Jose
    [J]. HOLZFORSCHUNG, 2020, 74 (02) : 149 - 155