Active learning in regression, with application to stochastic dynamic programming

被引:0
作者
Teytaud, Olivier [1 ]
Gelly, Sylvain [1 ]
Mary, Jeremie [2 ]
机构
[1] Univ Paris Sud, INRIA, CNRS, UMR 8623,TAO, Paris, France
[2] Univ Lille, Inria, Grappa, Villeneuve Dascq, France
来源
ICINCO 2007: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL ICSO: INTELLIGENT CONTROL SYSTEMS AND OPTIMIZATION | 2007年
关键词
intelligent control systems and optimization; machine learning in control applications; active learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study active learning as a derandomized form of sampling. We show that full derandomization is not suitable in a robust framework, propose partially derandomized samplings, and develop new active learning methods (i) in which expert knowledge is easy to integrate (ii) with a parameter for the exploration/exploitation dilemma (iii) less randomized than the full-random sampling (yet also not deterministic). Experiments are performed in the case of regression for value-function learning on a continuous domain. Our main results are (i) efficient partially derandomized point sets (ii) moderate-derandomization theorems (iii) experimental evidence of the importance of the frontier (iv) a new regression-specific user-friendly sampling tool less-robust than blind samplers but that sometimes works very efficiently in large dimensions. All experiments can be reproduced by downloading the source code and running the provided command line.
引用
收藏
页码:198 / +
页数:2
相关论文
共 30 条
  • [1] [Anonymous], 1998, MONTE CARLO METHODS, DOI DOI 10.1515/MCMA.1998.4.1.87
  • [2] [Anonymous], 1994, SIGIR
  • [3] Baeck T., 1995, EVOLUTIONARY ALGORIT
  • [4] BARTO AG, 1993, UMCS1993002
  • [5] Bertsekas D. P., 1996, Neuro Dynamic Programming, V1st
  • [6] CERVELLERA C, 2003, P WIRN 03, P53
  • [7] CHAPEL L, 2006, KERNEL MACHINES REIN
  • [8] Cohn D. A., 1995, Advances in Neural Information Processing Systems 7, P705
  • [9] SVMTorch: Support vector machines for large-scale regression problems
    Collobert, R
    Bengio, S
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) : 143 - 160
  • [10] DEVROYE L, 1994, STRONG UNIVERSAL CON