Evaluation of the sample size requirements of machine learning models used in engine design and research

被引:0
作者
Ravindran, Arun C. [1 ]
Kokjohn, Sage L. [1 ]
机构
[1] Univ Wisconsin, Dept Mech Engn, 1513 Univ Ave, Madison, WI 53706 USA
关键词
Engine research; machine learning; Gaussian process regression; sample size requirements; CDC; RCCI; DISI; GAUSSIAN PROCESS REGRESSION; OPTIMIZATION;
D O I
10.1177/14680874221137185
中图分类号
O414.1 [热力学];
学科分类号
摘要
Machine Learning (ML) techniques have been effectively used to learn the intricate relationships between variables that play a significant role in the field of engine design. However, there are two challenges to this approach - (1) Identifying ML regression models that could capture the trends of a response variable, given a non-parametric training set of relatively small size, with an acceptable accuracy and response time, and (2) identifying the size of the dataset, with respect to the input parameters, to be used for training and validation of the ML models. There is not enough information in the literature to reach a consensus on the sampling size to be used for an engine design problem that would yield an acceptable measure of goodness-of-fit. This is evident from the varied size of the training/validation data size used within the engine research community, as will be elaborated on in the following sections. The objective of this paper is to provide an insight into the sampling size required by the different ML models to achieve an acceptable fit between the model and the data, to be used in three types of engine design/optimization problems - (1) conventional diesel combustion (CDC) engine performance over a wide range of speed and load, (2) cold-start operation of a direct-injected spark-ignition (DISI) engine, and (3) high-load performance of a dual-fuel reactivity-controlled compression ignition (RCCI) engine.
引用
收藏
页码:2973 / 2990
页数:18
相关论文
共 26 条
[1]   Combining neural networks and genetic algorithms to predict and reduce diesel engine emissions [J].
Alonso, Jose M. ;
Alvarruiz, Fernando ;
Desantes, Jose M. ;
Hernandez, Leonor ;
Hernandez, Vicente ;
Molto, German .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2007, 11 (01) :46-55
[2]  
Amsden A.A., 1999, KIVA-3V, Release 2, Improvements to KIVA-3V
[3]  
Anguita D., 2012, 20 EUROPEAN S ARTIFI, P441
[4]  
Badra J., 2019, INTERNAL COMBUSTION
[5]  
Bertram AM., 2019, SAE TECHNICAL PAPER
[6]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[7]  
Clifton L, 2012, IEEE ENG MED BIO, P6161, DOI 10.1109/EMBC.2012.6347400
[8]   A Numerical Methodology For The Multi-Objective Optimization Of The DI Diesel Engine Combustion [J].
Costa, Marco ;
Bianchi, Gian Marco ;
Forte, Claudio ;
Cazzoli, Giulio .
ATI 2013 - 68TH CONFERENCE OF THE ITALIAN THERMAL MACHINES ENGINEERING ASSOCIATION, 2014, 45 :711-720
[9]  
Dürichen R, 2014, 2014 IEEE-EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS (BHI), P492, DOI 10.1109/BHI.2014.6864410
[10]  
He Y., 2002, SAE TRANSACTIONS, P1532