Sequential Tests for Large-Scale Learning

被引:1
作者
Korattikara, Anoop [1 ]
Chen, Yutian [2 ,3 ]
Welling, Max [4 ]
机构
[1] Univ Calif Irvine, Dept Comp Sci, Irvine, CA 92697 USA
[2] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
[3] Univ Calif Irvine, Irvine, CA 92697 USA
[4] Univ Amsterdam, Inst Informat, NL-1098 XH Amsterdam, Netherlands
基金
美国国家科学基金会;
关键词
CHAIN MONTE-CARLO; ALGORITHMS;
D O I
10.1162/NECO_a_00796
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We argue that when faced with big data sets, learning and inference algorithms should compute updates using only subsets of data items. We introduce algorithms that use sequential hypothesis tests to adaptively select such a subset of data points. The statistical properties of this subsampling process can be used to control the efficiency and accuracy of learning or inference. In the context of learning by optimization, we test for the probability that the update direction is no more than 90 degrees in the wrong direction. In the context of posterior inference using Markov chain Monte Carlo, we test for the probability that our decision to accept or reject a sample is wrong. We experimentally evaluate our algorithms on a number of models and data sets.
引用
收藏
页码:45 / 70
页数:26
相关论文
共 33 条
[1]   Noisy Monte Carlo: convergence of Markov chains with approximate transition kernels [J].
Alquier, P. ;
Friel, N. ;
Everitt, R. ;
Boland, A. .
STATISTICS AND COMPUTING, 2016, 26 (1-2) :29-47
[2]  
Amari S, 1996, ADV NEUR IN, V8, P757
[3]  
[Anonymous], ADV NEURAL INFORM PR
[4]  
[Anonymous], P INT C MACH LEARN
[5]  
[Anonymous], ARXIV14111690V2
[6]  
[Anonymous], P INT C MACH LEARN
[7]  
[Anonymous], 2007, Large Scale Kernel Machines
[8]  
[Anonymous], THESIS
[9]  
[Anonymous], 2014, ADV NEURAL INFORM PR
[10]  
[Anonymous], 2011, P 12 INT SOC MUS INF, DOI DOI 10.7916/D8NZ8J07