Fast Cross-Validation

被引:0
作者
Liu, Yong [1 ]
Lin, Hailun [1 ]
Ding, Lizhong [2 ]
Wang, Weiping [1 ]
Liao, Shizhong [3 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] King Abdullah Univ Sci & Technol KAUST, Thuwal, Saudi Arabia
[3] Tianjin Univ, Tianjin, Peoples R China
来源
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2018年
基金
中国国家自然科学基金;
关键词
MODEL SELECTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-validation (CV) is the most widely adopted approach for selecting the optimal model. However, the computation of CV has high complexity due to multiple times of learner training, making it disabled for large scale model selection. In this paper, we present an approximate approach to CV based on the theoretical notion of Bouligand influence function (BIF) and the Nystrom method for kernel methods. We first establish the relationship between the theoretical notion of BIF and CV, and propose a method to approximate the CV via the Taylor expansion of BIF. Then, we provide a novel computing method to calculate the BIF for general distribution, and evaluate BIF for sample distribution. Finally, we use the Nystrom method to accelerate the computation of the BIF matrix for giving the finally approximate CV criterion. The proposed approximate CV requires training only once and is suitable for a wide variety of kernel methods. Experimental results on lots of datasets show that our approximate CV has no statistical discrepancy with the original CV, but can significantly improve the efficiency.
引用
收藏
页码:2497 / 2503
页数:7
相关论文
共 25 条
  • [1] [Anonymous], 2011, 3 ASIAN C MACHINE LE
  • [2] Efficient approximate leave-one-out cross-validation for kernel logistic regression
    Cawley, Gavin C.
    Talbot, Nicola L. C.
    [J]. MACHINE LEARNING, 2008, 71 (2-3) : 243 - 264
  • [3] Cawley GC, 2007, J MACH LEARN RES, V8, P841
  • [4] Cawley GC, 2006, IEEE IJCNN, P1661
  • [5] Fast exact leave-one-out cross-validation of sparse least-squares support vector machines
    Cawley, GC
    Talbot, NLC
    [J]. NEURAL NETWORKS, 2004, 17 (10) : 1467 - 1475
  • [6] Choosing multiple parameters for support vector machines
    Chapelle, O
    Vapnik, V
    Bousquet, O
    Mukherjee, S
    [J]. MACHINE LEARNING, 2002, 46 (1-3) : 131 - 159
  • [7] Christmann A, 2008, J MACH LEARN RES, V9, P915
  • [8] De Vito E, 2004, J MACH LEARN RES, V5, P1363
  • [9] Debruyne M, 2008, J MACH LEARN RES, V9, P2377
  • [10] Ding JP, 2014, 2014 9TH INTERNATIONAL SYMPOSIUM ON COMMUNICATION SYSTEMS, NETWORKS & DIGITAL SIGNAL PROCESSING (CSNDSP), P1159, DOI 10.1109/CSNDSP.2014.6924005