Multiple-v support vector regression based on spectral risk measure minimization

被引:3
作者
Wang, Yongqiao [1 ]
Ni, He [1 ]
Wang, Shouyang [2 ]
机构
[1] Zhejiang Gongshang Univ, Sch Finance, Hangzhou 310018, Zhejiang, Peoples R China
[2] Chinese Acad Sci, Acad Math & Syst Sci, Inst Syst Sci, Beijing 100080, Peoples R China
基金
中国国家自然科学基金;
关键词
Conditional value-at-risk; Spectral risk measure; Support vector regression; Kernel methods; VALUE-AT-RISK; CONDITIONAL VALUE; NU; FRAMEWORK; MACHINES; TUTORIAL; MODELS;
D O I
10.1016/j.neucom.2012.09.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Statistical learning theory provides the justification of the c-insensitive loss in support vector regression, but suggests little guidance on the determination of the critical hyper-parameter c. Instead of predefining c, v-support vector regression automatically selects c by making the percent of deviations larger than c be asymptotically equal to v. In stochastic programming terminology, the goal of v-support vector regression is to minimize the conditional Value-at-Risk measure of deviations, i.e. the expectation of the larger v-percent deviations. This paper tackles the determination of the critical hyper-parameter v in v-support vector regression when the error term follows a complex distribution. Instead of one singleton v, the paper assumes v to be a combination of multiple, finite or infinite, candidate choices. Thus, the cost function becomes a weighted sum of component conditional value-at-risk measures associated with these base vs. This paper shows that this cost function can be represented with a spectral risk measure and its minimization can be reformulated to a linear programming problem. Experiments on three artificial data sets show that this multiple-v support vector regression has great advantage over the classical v-support vector regression when the error terms follow mixed polynomial distributions. Experiments on 10 real-world data sets also clearly demonstrate that this new method can achieve better performance than c-support vector regression and v-support vector regression. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:217 / 228
页数:12
相关论文
共 46 条
[1]   On the coherence of expected shortfall [J].
Acerbi, C ;
Tasche, D .
JOURNAL OF BANKING & FINANCE, 2002, 26 (07) :1487-1503
[2]  
Acerbi C., 2002, TECHNICAL REPORT
[3]   Spectral risk measures and portfolio selection [J].
Adam, Alexandre ;
Houkari, Mohamed ;
Laurent, Jean-Paul .
JOURNAL OF BANKING & FINANCE, 2008, 32 (09) :1870-1882
[4]  
Aizerman M. A., 1964, Automation and Remote Control, V25, P821
[5]   Coherent measures of risk [J].
Artzner, P ;
Delbaen, F ;
Eber, JM ;
Heath, D .
MATHEMATICAL FINANCE, 1999, 9 (03) :203-228
[6]  
Bach F., 2004, P 21 INT C MACH LEAR
[7]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[8]   Experimentally optimal ν in support vector regression for different noise models and parameter settings [J].
Chalimourda, A ;
Schölkopf, B ;
Smola, AJ .
NEURAL NETWORKS, 2004, 17 (01) :127-141
[9]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[10]   A tutorial on v-support vector machines [J].
Chen, PH ;
Lin, CJ ;
Schölkopf, B .
APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2005, 21 (02) :111-136