Multi-class Gaussian Process Classification with Noisy Inputs

被引:0
作者
Villacampa-Calvo, Carlos [1 ]
Zaldivar, Bryan [2 ,3 ]
Garrido-Merchan, Eduardo C. [1 ]
Hernandez-Lobato, Daniel [1 ]
机构
[1] Univ Autonoma Madrid, Comp Sci Dept, Madrid 28049, Spain
[2] Univ Autonoma Madrid, Theoret Phys Dept, Madrid 28049, Spain
[3] Inst Fis Teor CA, Madrid 28049, Spain
关键词
Gaussian processes; Multi-class classification; Input dependent noise; VARIATIONAL INFERENCE; BAYESIAN-INFERENCE; REGRESSION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is a common practice in the machine learning community to assume that the observed data are noise-free in the input attributes. Nevertheless, scenarios with input noise are common in real problems, as measurements are never perfectly accurate. If this input noise is not taken into account, a supervised machine learning method is expected to perform sub-optimally. In this paper, we focus on multi-class classification problems and use Gaussian processes (GPs) as the underlying classifier. Motivated by a data set coming from the astrophysics domain, we hypothesize that the observed data may contain noise in the inputs. Therefore, we devise several multi-class GP classifiers that can account for input noise. Such classifiers can be efficiently trained using variational inference to approximate the posterior distribution of the latent variables of the model. Moreover, in some situations, the amount of noise can be known before-hand. If this is the case, it can be readily introduced in the proposed methods. This prior information is expected to lead to better performance results. We have evaluated the proposed methods by carrying out several experiments, involving synthetic and real data. These include several data sets from the UCI repository, the MNIST data set and a data set coming from astrophysics. The results obtained show that, although the classification error is similar across methods, the predictive distribution of the proposed methods is better, in terms of the test log-likelihood, than the predictive distribution of a classifier based on GPs that ignores input noise.
引用
收藏
页数:52
相关论文
共 70 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]   Fermi Large Area Telescope Fourth Source Catalog [J].
Abdollahi, S. ;
Acero, F. ;
Ackermann, M. ;
Ajello, M. ;
Atwood, W. B. ;
Axelsson, M. ;
Baldini, L. ;
Ballet, J. ;
Barbiellini, G. ;
Bastieri, D. ;
Becerra Gonzalez, J. ;
Bellazzini, R. ;
Berretta, A. ;
Bissaldi, E. ;
Blandford, R. D. ;
Bloom, E. D. ;
Bonino, R. ;
Bottacini, E. ;
Brandt, T. J. ;
Bregeon, J. ;
Bruel, P. ;
Buehler, R. ;
Burnett, T. H. ;
Buson, S. ;
Cameron, R. A. ;
Caputo, R. ;
Caraveo, P. A. ;
Casandjian, J. M. ;
Castro, D. ;
Cavazzuti, E. ;
Charles, E. ;
Chaty, S. ;
Chen, S. ;
Cheung, C. C. ;
Chiaro, G. ;
Ciprini, S. ;
Cohen-Tanugi, J. ;
Cominsky, L. R. ;
Coronado-Blazquez, J. ;
Costantin, D. ;
Cuoco, A. ;
Cutini, S. ;
D'Ammando, F. ;
DeKlotz, M. ;
Luque, P. de la Tone ;
de Palma, F. ;
Desai, A. ;
Digel, S. W. ;
Di Lalla, N. ;
Di Mauro, M. .
ASTROPHYSICAL JOURNAL SUPPLEMENT SERIES, 2020, 247 (01)
[3]  
[Anonymous], 2018, Advances in Neural Information Processing Systems
[4]  
[Anonymous], 2014, PROC INT C LEARN REP, Patent No. 13126114
[5]  
[Anonymous], 2005, P 22 INT C MACHINE L, DOI DOI 10.1145/1102351.1102413
[6]  
[Anonymous], 2000, ACM SIGKDD Explor. Newsl.
[7]  
[Anonymous], 2006, Springer google schola, DOI [10.1117/1.2819119, DOI 10.18637/JSS.V017.B05]
[8]  
[Anonymous], 2012, MACHINE LEARNING PRO
[9]  
Armand A, 2013, IEEE INT C INTELL TR, P1650, DOI 10.1109/ITSC.2013.6728466
[10]  
Barford N.C., 1985, EXPT MEASUREMENTS PR