Low-Rank Approximation of Structural Redundancy for Self-Supervised Learning

被引:0
作者
Du, Kang [1 ]
Xiang, Yu [1 ]
机构
[1] Univ Utah, Salt Lake City, UT 84112 USA
来源
CAUSAL LEARNING AND REASONING, VOL 236 | 2024年 / 236卷
关键词
Self-supervised learning; redundancy; low-rank approximation; ridge regression; BOUNDS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the data-generating mechanism for reconstructive SSL to shed light on its effectiveness. With an infinite amount of labeled samples, we provide a sufficient and necessary condition for perfect linear approximation. The condition reveals a full-rank component that preserves the label classes of Y, along with a redundant component. Motivated by the condition, we propose to approximate the redundant component by a low-rank factorization and measure the approximation quality by introducing a new quantity epsilon(s), parameterized by the rank of factorization s. We incorporate epsilon(s) into the excess risk analysis under both linear regression and ridge regression settings, where the latter regularization approach is to handle scenarios when the dimension of the learned features is much larger than the number of labeled samples n for downstream tasks. We design three stylized experiments to compare SSL with supervised learning under different settings to support our theoretical findings.
引用
收藏
页码:1008 / 1032
页数:25
相关论文
共 28 条
[1]  
Alec RadfordKarthik Narasimhan., 2018, IMPROVING LANGUAGE U
[2]  
Arora Sanjeev, 2019, P MACHINE LEARNING R, V97
[3]  
Candes E, 2007, ANN STAT, V35, P2313, DOI 10.1214/009053606000001523
[4]  
Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
[5]  
Du K, 2022, EUR SIGNAL PR CONF, P1387
[6]  
Du Kang, 2023, IEEE Journal on Selected Areas in Information Theory
[7]  
Du Kang, 2023, PROC IEEE INT C ACOU, P1
[8]   ON A PROBLEM OF WEIGHTED LOW-RANK APPROXIMATION OF MATRICES [J].
Dutta, Aritra ;
Li, Xin .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2017, 38 (02) :530-553
[10]  
Gidaris S, 2018, Arxiv, DOI arXiv:1803.07728