Coreset-based Conformal Prediction for Large-scale Learning

被引:0
作者
Riquelme-Granada, Nery [1 ]
Khuong An Nguyen [1 ]
Luo, Zhiyuan [1 ]
机构
[1] Royal Holloway Univ London, Dept Comp Sci, Egham TW20 0EX, Surrey, England
来源
CONFORMAL AND PROBABILISTIC PREDICTION AND APPLICATIONS, VOL 105 | 2019年 / 105卷
关键词
Coreset; logistic regression; importance sampling; conformal predictors; ALGORITHMS; SETS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the volume of data increase rapidly, most traditional machine learning algorithms become computationally prohibitive. Furthermore, the available data can be so big that a single machine's memory can easily be overflown. We propose Coreset-Based Conformal Prediction, a strategy for dealing with big data by applying conformal predictors to a weighted summary of data - namely the coreset. We compare our approach against standalone inductive conformal predictors over three large competition-grade datasets to demonstrate that our coreset-based strategy may not only significantly improve the learning speed, but also retains predictions validity and the predictors' efficiency.
引用
收藏
页数:21
相关论文
共 30 条
[1]  
Ackermann MR, 2012, J EXP ALGORITHM, V17, P2, DOI [10.1145/2133803.2184450, DOI 10.1145/2133803.2184450]
[2]  
Agarwal PK, 2010, PROC APPL MATH, V135, P1481
[3]   Approximating extent measures of points [J].
Agarwal, PK ;
Har-Peled, S ;
Varadarajan, KR .
JOURNAL OF THE ACM, 2004, 51 (04) :606-635
[4]  
Bachem O, 2017, Arxiv, DOI arXiv:1703.06476
[5]  
Badoiu M, 2003, SIAM PROC S, P801
[6]   Optimal core-sets for balls [J].
Badoiu, Mihai ;
Clarkson, Kenneth L. .
COMPUTATIONAL GEOMETRY-THEORY AND APPLICATIONS, 2008, 40 (01) :14-22
[7]  
Balasubramanian Vineeth, 2014, Conformal prediction for reliable machine learning: theory, adaptations and applications
[8]   Comparative accuracies of artificial neural networks and discriminant analysis in predicting forest cover types from cartographic variables [J].
Blackard, JA ;
Dean, DJ .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 1999, 24 (03) :131-151
[9]  
Braverman V, 2020, Arxiv, DOI arXiv:1612.00889
[10]   Faster core-set constructions and data-stream algorithms in fixed dimensions [J].
Chan, Timothy M. .
COMPUTATIONAL GEOMETRY-THEORY AND APPLICATIONS, 2006, 35 (1-2) :20-35