Variable selection and data fusion for diesel cetane number prediction

被引:2
|
作者
Buendia-Garcia, J. [1 ,3 ]
Lacoue-Negre, M. [1 ,3 ]
Gornay, J. [1 ]
Mas-Garcia, S. [2 ,3 ]
Bendoula, R. [2 ,3 ]
Roger, J. M. [2 ,3 ]
机构
[1] IFP Energies Nouvelles, Solaize, France
[2] Univ Montpellier, Inst Agro, ITAP, INRAE, Montpellier, France
[3] ChemHouse Res Grp, Montpellier, France
关键词
Variable selection; Near-Infrared (NIR); Process variables; Data fusion; Hydrocracking; Diesel fuel; Cetane number; MULTIVARIATE CALIBRATION; SPECTROSCOPY; ALGORITHM; MODEL;
D O I
10.1016/j.fuel.2022.126297
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
This study evaluates the potential of variable selection to improve the performance of data fusion modelling to estimate diesel cetane number from NIR spectroscopy information acquired on total effluent samples obtained from the hydrocracking process and their operating variables. The evaluation conducted in this research was divided into four steps. First, predictive models were developed using each data block separately. Next, seven variable selection methods were applied on the NIR block, and eleven methods were applied on the process variable block. Then, with each data set generated from the variable selection analysis, single prediction models were generated and compared with those developed in the first step. Finally, data fusion was performed once the best variable selection method was defined for each data block. Two data fusion models were generated, a first using all the variables in the two blocks and a second using only the previously selected variables. In addition, the potential of the sequential and orthogonalized covariance selection (SO-CovSel) method was also analyzed. The results showed that the data fusion modelling using all variables from each data block improves the estimation of the diesel cetane number compared to single models (about 20% reduction of the RMSEP). However, using variable selection analysis before data fusion significantly improves the estimation of this property and leads to greater model stability regarding the RMSE's and r's (about 47% of the RMSEP). The Covariance Selection (CovSel) method was the most efficient in the NIR data block, while for the process variable data block, it was the sequential backward floating feature selection method (SBFFS) that gave the best performance. The advantages offered by the variable selection resulted not only in having a more accurate prediction of the property but also in improving the analysis and understanding of the process by determining the variables that significantly impact the property studied.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Predicting cetane number in diesel fuels using FTIR spectroscopy and PLS regression
    Barra, Issam
    Kharbach, Mourad
    Qannari, El Mostafa
    Hanafi, Mohamed
    Cherrah, Yahia
    Bouklouze, Abdelaziz
    VIBRATIONAL SPECTROSCOPY, 2020, 111
  • [22] Impact of dicyclopentadiene addition to diesel on cetane number, sooting propensity, and soot characteristics
    Alrefaai, Mhd Maher
    Pena, Gerardo D. J. Guerrero
    Raj, Abhijeet
    Stephen, Samuel
    Anjana, Tharalekshmy
    Dindi, Abdallah
    FUEL, 2018, 216 : 110 - 120
  • [23] EQUATIONS FOR PREDICTING THE CETANE NUMBER OF DIESEL FUELS FROM THEIR PHYSICAL-PROPERTIES
    LADOMMATOS, N
    GOACHER, J
    FUEL, 1995, 74 (07) : 1083 - 1093
  • [24] Relationship between hydrocarbons reaction and cetane number in diesel hydro-upgrading
    Zhang, Y. (zhangyongkui.sjlh@sinopec.com), 1600, Science Press (29): : 376 - 382
  • [25] A COMPREHENSIVE MODEL FOR CETANE NUMBER PREDICTION USING MACHINE LEARNING
    Jameel, Abdul Gani Abdul
    PROCEEDINGS OF ASME TURBO EXPO 2021: TURBOMACHINERY TECHNICAL CONFERENCE AND EXPOSITION, VOL 3B, 2021,
  • [26] Artificial neural networks used for the prediction of the cetane number of biodiesel
    Ramadhas, A. S.
    Jayaraj, S.
    Muraleedharan, C.
    Padmakumari, K.
    RENEWABLE ENERGY, 2006, 31 (15) : 2524 - 2533
  • [27] Prediction of the cetane number of biodiesel using artificial neural networks and multiple linear regression
    Piloto-Rodriguez, Ramon
    Sanchez-Borroto, Yisel
    Lapuerta, Magin
    Goyos-Perez, Leonardo
    Verhelst, Sebastian
    ENERGY CONVERSION AND MANAGEMENT, 2013, 65 : 255 - 261
  • [28] Effect of fuel cetane number and injection pressure on a DI Diesel engine performance and emissions
    Içingür, Y
    Altiparmak, D
    ENERGY CONVERSION AND MANAGEMENT, 2003, 44 (03) : 389 - 397
  • [29] VARIABLE SELECTION AND PREDICTION WITH INCOMPLETE HIGH-DIMENSIONAL DATA
    Liu, Ying
    Wang, Yuanjia
    Feng, Yang
    Wall, Melanie M.
    ANNALS OF APPLIED STATISTICS, 2016, 10 (01) : 418 - 450
  • [30] Development of a Mathematical Model for Calculating the Cetane Number of Diesel Fuel Based on Their Hydrocarbon Composition and Intermolecular Interactions of Mixture Components
    Maylin, M., V
    Frantsina, E., V
    Grinko, A. A.
    COMBUSTION SCIENCE AND TECHNOLOGY, 2021, 193 (07) : 1140 - 1153