Variable selection and data fusion for diesel cetane number prediction

被引:3
作者
Buendia-Garcia, J. [1 ,3 ]
Lacoue-Negre, M. [1 ,3 ]
Gornay, J. [1 ]
Mas-Garcia, S. [2 ,3 ]
Bendoula, R. [2 ,3 ]
Roger, J. M. [2 ,3 ]
机构
[1] IFP Energies Nouvelles, Solaize, France
[2] Univ Montpellier, Inst Agro, ITAP, INRAE, Montpellier, France
[3] ChemHouse Res Grp, Montpellier, France
关键词
Variable selection; Near-Infrared (NIR); Process variables; Data fusion; Hydrocracking; Diesel fuel; Cetane number; MULTIVARIATE CALIBRATION; SPECTROSCOPY; ALGORITHM; MODEL;
D O I
10.1016/j.fuel.2022.126297
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
This study evaluates the potential of variable selection to improve the performance of data fusion modelling to estimate diesel cetane number from NIR spectroscopy information acquired on total effluent samples obtained from the hydrocracking process and their operating variables. The evaluation conducted in this research was divided into four steps. First, predictive models were developed using each data block separately. Next, seven variable selection methods were applied on the NIR block, and eleven methods were applied on the process variable block. Then, with each data set generated from the variable selection analysis, single prediction models were generated and compared with those developed in the first step. Finally, data fusion was performed once the best variable selection method was defined for each data block. Two data fusion models were generated, a first using all the variables in the two blocks and a second using only the previously selected variables. In addition, the potential of the sequential and orthogonalized covariance selection (SO-CovSel) method was also analyzed. The results showed that the data fusion modelling using all variables from each data block improves the estimation of the diesel cetane number compared to single models (about 20% reduction of the RMSEP). However, using variable selection analysis before data fusion significantly improves the estimation of this property and leads to greater model stability regarding the RMSE's and r's (about 47% of the RMSEP). The Covariance Selection (CovSel) method was the most efficient in the NIR data block, while for the process variable data block, it was the sequential backward floating feature selection method (SBFFS) that gave the best performance. The advantages offered by the variable selection resulted not only in having a more accurate prediction of the property but also in improving the analysis and understanding of the process by determining the variables that significantly impact the property studied.
引用
收藏
页数:12
相关论文
共 54 条
[41]   VSN: Variable sorting for normalization [J].
Rabatel, Gilles ;
Marini, Federico ;
Walczak, Beata ;
Roger, Jean-Michel .
JOURNAL OF CHEMOMETRICS, 2020, 34 (02)
[42]   Biomarker discovery in mass spectral profiles by means of selectivity ratio plot [J].
Rajalahti, Tarja ;
Arneberg, Reidar ;
Berven, Frode S. ;
Myhr, Kjell-Morten ;
Ulvik, Rune J. ;
Kvalheim, Olav M. .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2009, 95 (01) :35-48
[43]   CovSel: Variable selection for highly multivariate and multi-response calibration Application to IR spectroscopy [J].
Roger, J. M. ;
Palagos, B. ;
Bertrand, D. ;
Fernandez-Ahumada, E. .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2011, 106 (02) :216-223
[44]   SMOOTHING + DIFFERENTIATION OF DATA BY SIMPLIFIED LEAST SQUARES PROCEDURES [J].
SAVITZKY, A ;
GOLAY, MJE .
ANALYTICAL CHEMISTRY, 1964, 36 (08) :1627-&
[45]  
Shukla A, 2020, Variable selection and modelling from NIR spectra data: A case study of diesel quality prediction using LASSO and Regression Tree
[46]  
Smolinska A., 2019, Data Handling in Science and Technology, V31, P51
[47]   Fusing NIR and Process Sensors Data for Polymer Production Monitoring [J].
Strani, Lorenzo ;
Mantovani, Erik ;
Bonacini, Francesco ;
Marini, Federico ;
Cocchi, Marina .
FRONTIERS IN CHEMISTRY, 2021, 9
[49]   Variable selection, outlier detection, and figures of merit estimation in a partial least-squares regression multivariate calibration model. A case study for the determination of quality parameters in the alcohol industry by near-infrared spectroscopy [J].
Valderrama, Patricia ;
Braga, Jez Willian B. ;
Poppi, Ronei Jesus .
JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2007, 55 (21) :8331-8338
[50]   Optimization of the multivariate calibration of a Vis-NIR sensor for the on-line monitoring of marine diesel engine lubricating oil by variable selection methods [J].
Villar, Alberto ;
Fernandez, Santiago ;
Gorritxategi, Eneko ;
Ciria, Jose I. ;
Fernandez, Luis A. .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2014, 130 :68-75