Memory Bounds for Continual Learning

被引:5
作者
Chen, Xi [1 ]
Papadimitriou, Christos [1 ]
Peng, Binghui [1 ]
机构
[1] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
来源
2022 IEEE 63RD ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS) | 2022年
关键词
Continual learning; lifelong learning; foundation of machine learning; COMPLEXITY;
D O I
10.1109/FOCS54457.2022.00056
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Continual learning, or lifelong learning, is a formidable current challenge to machine learning. It requires the learner to solve a sequence of k different learning tasks, one after the other, while retaining its aptitude for earlier tasks; the continual learner should scale better than the obvious solution of developing and maintaining a separate learner for each of the k tasks. We embark on a complexity-theoretic study of continual learning in the PAC framework. We make novel uses of communication complexity to establish that any continual learner, even an improper one, needs memory that grows linearly with k, strongly suggesting that the problem is intractable. When logarithmically many passes over the learning tasks are allowed, we provide an algorithm based on multiplicative weights update whose memory requirement scales well; we also establish that improper learning is necessary for such performance. We conjecture that these results may lead to new promising approaches to continual learning.
引用
收藏
页码:519 / 530
页数:12
相关论文
共 55 条
  • [1] Rusu AA, 2016, Arxiv, DOI arXiv:1606.04671
  • [2] Limitations of quantum advice and one-way communication
    Aaronson, S
    [J]. 19TH IEEE ANNUAL CONFERENCE ON COMPUTATIONAL COMPLEXITY, PROCEEDINGS, 2004, : 320 - 332
  • [3] [Anonymous], 2012, P C LEARN THEOR
  • [4] Arora S, 2012, Theory Comput., V8, P121
  • [5] Balcan, 2015, P C LEARNING THEORY, P191
  • [6] HOW TO COMPRESS INTERACTIVE COMMUNICATION
    Barak, Boaz
    Braverman, Mark
    Chen, Xi
    Rao, Anup
    [J]. SIAM JOURNAL ON COMPUTING, 2013, 42 (03) : 1327 - 1363
  • [7] Beimel A, 2019, J MACH LEARN RES, V20
  • [8] Blum A., 2017, Proceedings of the 31st International Conference on Neural Information Processing Systems, V30, P2389
  • [9] Braverman M, 2021, PR MACH LEARN RES, V134, P724
  • [10] INTERACTIVE INFORMATION COMPLEXITY
    Braverman, Mark
    [J]. SIAM JOURNAL ON COMPUTING, 2015, 44 (06) : 1698 - 1739