Reducing the effect of sample bias for small data sets with double-weighted support vector transfer regression

被引:30
作者
Luo, Huan [1 ]
Paal, Stephanie German [1 ]
机构
[1] Texas A&M Univ, Zachry Dept Civil & Environm Engn, College Stn, TX 77843 USA
关键词
SHEAR-STRENGTH; MACHINE; ROBUSTNESS; PREDICTION; ALGORITHM;
D O I
10.1111/mice.12617
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Small data sets are an extremely challenging problem in the machine learning (ML) realm, and in specific, in regression scenarios, as the lack of relevant data can lead to ML models that have large bias. However, there are many applications for which a purely data-driven procedure would be advantageous, but a large amount of data are not available. This article proposes a novel regression-based transfer learning (TL) model to address this challenge, where TL is defined as knowledge transfer from a large, relevant data set (source domain data) to a small data set (target domain data). The proposed TL model is termed double-weighted support vector transfer regression (DW-SVTR), which couples least squares support vector machines for regression (LS-SVMR) with two weight functions. The first weight function uses kernel mean matching (KMM) to reweight the source domain data such that the mean values of the source and target domain data in a reproduced kernel Hilbert space (RKHS) are close. In this way, the source domain data points relevant to the target domain points have a larger weight than irrelevant source domain points. The second weight is a function of estimated residuals, which aims to further reduce the negative interference of irrelevant source domain points. The proposed approach is assessed and validated via simulated data and by enhanced shear strength prediction of nonductile columns based on limited availability of nonductile column data. Specifically, the results for the latter show that the proposed DW-SVTR can reduce the root mean square error (RMSE) by 34% and enhance the coefficient of determination (R-2) by 229%. These numerical results demonstrate that the DW-SVTR significantly reduces the effect of small sample bias and improves prediction performance compared to standard ML methods.
引用
收藏
页码:248 / 263
页数:16
相关论文
共 46 条
[1]   Neural networks in civil engineering: 1989-2000 [J].
Adeli, H .
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2001, 16 (02) :126-142
[2]   Modelling mechanical behaviour of rubber concrete using evolutionary polynomial regression [J].
Ahangar-Asr, Alireza ;
Faramarzi, Asaad ;
Javadi, Akbar A. ;
Giustolisi, Orazio .
ENGINEERING COMPUTATIONS, 2011, 28 (3-4) :492-507
[3]   A robust predictive model for base shear of steel frame structures using a hybrid genetic programming and simulated annealing method [J].
Aminian, Pejman ;
Javid, Mohamad Reza ;
Asghari, Abazar ;
Gandomi, Amir Hossein ;
Esmaeili, Milad Arab .
NEURAL COMPUTING & APPLICATIONS, 2011, 20 (08) :1321-1332
[4]  
[Anonymous], 2013, INTRO STAT LEARNING
[5]  
[Anonymous], 2007, NIPS
[6]  
[Anonymous], 2007, P 24 INT C ICML 2007, DOI [10.1145/1273496.1273521, DOI 10.1145/1273496.1273521]
[7]   Convex multi-task feature learning [J].
Argyriou, Andreas ;
Evgeniou, Theodoros ;
Pontil, Massimiliano .
MACHINE LEARNING, 2008, 73 (03) :243-272
[8]   Autonomous Structural Visual Inspection Using Region-Based Deep Learning for Detecting Multiple Damage Types [J].
Cha, Young-Jin ;
Choi, Wooram ;
Suh, Gahyun ;
Mahmoudkhani, Sadegh ;
Buyukozturk, Oral .
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2018, 33 (09) :731-747
[9]   Deep Learning-Based Crack Damage Detection Using Convolutional Neural Networks [J].
Cha, Young-Jin ;
Choi, Wooram ;
Buyukozturk, Oral .
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2017, 32 (05) :361-378
[10]   Evolutionary multivariate adaptive regression splines for estimating shear strength in reinforced-concrete deep beams [J].
Cheng, Min-Yuan ;
Cao, Minh-Tu .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 28 :86-96