FIBER QUALITY PREDICTION USING NIR SPECTRAL DATA: TREE-BASED ENSEMBLE LEARNING VS DEEP NEURAL NETWORKS

被引:6
作者
Nasir, Vahid [1 ,2 ]
Ali, Syed Danish [3 ,4 ]
Mohammadpanah, Ahmad [5 ]
Raut, Sameen
Nabavi, Mohamad
Dahlen, Joseph [6 ]
Schimleck, Laurence [2 ]
机构
[1] Univ British Columbia, Vancouver, BC, Canada
[2] Oregon State Univ, Dept Wood Sci & Engn, Corvallis, OR 97331 USA
[3] Forest Serv, USDA, Forest Prod Lab, Madison, WI USA
[4] Univ Wisconsin, Dept Biol Syst Engn, Madison, WI USA
[5] Loyola Marymount Univ, Dept Mech Engn, Los Angeles, CA USA
[6] Univ Georgia, Warnell Sch Forestry & Nat Resources, Athens, GA USA
来源
WOOD AND FIBER SCIENCE | 2023年 / 55卷 / 01期
基金
美国国家科学基金会;
关键词
Convolutional neural network (CNN); deep learning; ensemble learning; gradient-boosting (TreeNet); near infrared (NIR) spectroscopy; random forest; wood; NEAR-INFRARED SPECTROSCOPY; PLANTED LOBLOLLY-PINE; WOOD SPECIFIC-GRAVITY; REGIONAL-VARIATION; CLASSIFICATION; PERFORMANCE; SECTIONS; MODELS; LENGTH;
D O I
10.22382/wfs-2023-10
中图分类号
S7 [林业];
学科分类号
0829 ; 0907 ;
摘要
The growing applications of near infrared (NIR) spectroscopy in wood quality control and monitoring necessitates focusing on data-driven methods to develop predictive models. Despite the advancements in analyzing NIR spectral data, literature on wood science and engineering has mainly uti-lized the classic model development methods, such as principal component analysis (PCA) regression or partial least squares (PLS) regression, with relatively limited studies conducted on evaluating machine learning (ML) models, and specifically, artificial neural networks (ANNs). This could potentially limit the performance of predictive models, specifically for some wood properties, such as tracheid width that are both time-consuming to measure and challenging to predict using spectral data. This study aims to enhance the prediction accuracy for tracheid width using deep neural networks and tree-based ensemble learning algorithms on a dataset consisting of 2018 samples and 692 features (NIR spectra wavelengths). Accord-ingly, NIR spectra were fed into multilayer perceptron (MLP), 1 dimensional-convolutional neural net-works (1D-CNNs), random forest, TreeNet gradient-boosting, extreme gradient-boosting (XGBoost), and light gradient-boosting machine (LGBM). It was of interest to study the performance of the models with and without applying PCA to assess how effective they would perform when analyzing NIR spectra with-out employing dimensionality reduction on data. It was shown that gradient-boosting machines outper-formed the ANNs regardless of the number of features (data dimension). All the models performed better without PCA. It is concluded that tree-based gradient-boosting machines could be effectively used for wood characterization utilizing a medium-sized NIR spectral dataset.
引用
收藏
页码:100 / 115
页数:16
相关论文
共 61 条
  • [1] Optuna: A Next-generation Hyperparameter Optimization Framework
    Akiba, Takuya
    Sano, Shotaro
    Yanase, Toshihiko
    Ohta, Takeru
    Koyama, Masanori
    [J]. KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 2623 - 2631
  • [2] Review of deep learning: concepts, CNN architectures, challenges, applications, future directions
    Alzubaidi, Laith
    Zhang, Jinglan
    Humaidi, Amjad J.
    Al-Dujaili, Ayad
    Duan, Ye
    Al-Shamma, Omran
    Santamaria, J.
    Fadhel, Mohammed A.
    Al-Amidie, Muthana
    Farhan, Laith
    [J]. JOURNAL OF BIG DATA, 2021, 8 (01)
  • [3] Modeling the longitudinal variation in wood specific gravity of planted loblolly pine (Pinus taeda) in the United States
    Antony, F.
    Schimleck, L. R.
    Daniels, R. F.
    Clark, A., III
    Hall, D. B.
    [J]. CANADIAN JOURNAL OF FOREST RESEARCH, 2010, 40 (12) : 2439 - 2451
  • [4] Effect of wood surface roughness on prediction of structural timber properties by infrared spectroscopy using ANFIS, ANN and PLS regression
    Ayanleye, Samuel
    Nasir, Vahid
    Avramidis, Stavros
    Cool, Julie
    [J]. EUROPEAN JOURNAL OF WOOD AND WOOD PRODUCTS, 2021, 79 (01) : 101 - 115
  • [5] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [6] Predicting and Correlating the Strength Properties of Wood Composite Process Parameters by Use of Boosted Regression Tree Models
    Carty, Dillon M.
    Young, Timothy M.
    Zaretzki, Russell L.
    Guess, Frank M.
    Petutschnigg, Alexander
    [J]. FOREST PRODUCTS JOURNAL, 2015, 65 (7-8) : 365 - 371
  • [7] Carvalho MG, 1997, TAPPI J, V80, P137
  • [8] A survey on feature selection methods
    Chandrashekar, Girish
    Sahin, Ferat
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (01) : 16 - 28
  • [9] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [10] Artificial neural network and partial least square regressions for rapid estimation of cellulose pulp dryness based on near infrared spectroscopic data
    Costa, Livia Ribeiro
    Denzin Tonoli, Gustavo Henrique
    Milagres, Flaviana Reis
    Gherardi Hein, Paulo Ricardo
    [J]. CARBOHYDRATE POLYMERS, 2019, 224