Posterior consistency of semi-supervised regression on graphs

被引:6
作者
Bertozzi, Andrea L. [1 ]
Hosseini, Bamdad [2 ]
Li, Hao [1 ]
Miller, Kevin [1 ]
Stuart, Andrew M. [2 ]
机构
[1] Univ Calif Los Angeles, Dept Math, Los Angeles, CA 90095 USA
[2] CALTECH, Comp & Math Sci, Pasadena, CA 91125 USA
基金
加拿大自然科学与工程研究理事会;
关键词
semi-supervised learning; classification; consistency; graph Laplacian; Bayesian inference; SPECTRAL PARTITIONING WORKS; PLANAR GRAPHS; CLASSIFICATION; KERNEL; RATES; REGULARIZATION; CONTRACTION;
D O I
10.1088/1361-6420/ac1e80
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Graph-based semi-supervised regression (SSR) involves estimating the value of a function on a weighted graph from its values (labels) on a small subset of the vertices; it can be formulated as a Bayesian inverse problem. This paper is concerned with the consistency of SSR in the context of classification, in the setting where the labels have small noise and the underlying graph weighting is consistent with well-clustered vertices. We present a Bayesian formulation of SSR in which the weighted graph defines a Gaussian prior, using a graph Laplacian, and the labeled data defines a likelihood. We analyze the rate of contraction of the posterior measure around the ground truth in terms of parameters that quantify the small label error and inherent clustering in the graph. We obtain bounds on the rates of contraction and illustrate their sharpness through numerical experiments. The analysis also gives insight into the choice of hyperparameters that enter the definition of the prior.
引用
收藏
页数:32
相关论文
共 54 条
[1]   Posterior contraction rates for the Bayesian approach to linear ill-posed inverse problems [J].
Agapiou, Sergios ;
Larsson, Stig ;
Stuart, Andrew M. .
STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 2013, 123 (10) :3828-3860
[2]  
[Anonymous], 2005, SEMISUPERVISED LEARN
[3]  
[Anonymous], 2007, International Conference on Artificial Intelligence and Statistics
[4]  
[Anonymous], 2003, ICML 2003 WORKSH CON
[5]  
[Anonymous], 2007, An Introduction to Bayesian Scientific Computing: Ten Lectures on Subjective Computing
[6]   Solving inverse problems using data-driven models [J].
Arridge, Simon ;
Maass, Peter ;
Oktem, Ozan ;
Schonlieb, Carola-Bibiane .
ACTA NUMERICA, 2019, 28 :1-174
[7]   Regularization and semi-supervised learning on large graphs [J].
Belkin, M ;
Matveeva, I ;
Niyogi, P .
LEARNING THEORY, PROCEEDINGS, 2004, 3120 :624-638
[8]  
Belkin M, 2002, ADV NEUR IN, V14, P585
[9]  
Belkin M, 2006, J MACH LEARN RES, V7, P2399
[10]   Uncertainty Quantification in Graph-Based Classification of High Dimensional Data [J].
Bertozzi, Andrea L. ;
Luo, Xiyang ;
Stuart, Andrew M. ;
Zygalakis, Konstantinos C. .
SIAM-ASA JOURNAL ON UNCERTAINTY QUANTIFICATION, 2018, 6 (02) :568-595