Memory Efficient Experience Replay for Streaming Learning

被引:61
作者
Hayes, Tyler L. [1 ]
Cahill, Nathan D. [1 ]
Kanan, Christopher [1 ]
机构
[1] Rochester Inst Technol, Carlson Ctr Imaging Sci, Rochester, NY 14623 USA
来源
2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2019年
关键词
WEIGHTED MAJORITY; NEURAL-NETWORK; ALGORITHM; ARTMAP; CLASSIFICATION;
D O I
10.1109/icra.2019.8793982
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In supervised machine learning, an agent is typically trained once and then deployed. While this works well for static settings, robots often operate in changing environments and must quickly learn new things from data streams. In this paradigm, known as streaming learning, a learner is trained online, in a single pass, from a data stream that cannot be assumed to be independent and identically distributed (iid). Streaming learning will cause conventional deep neural networks (DNNs) to fail for two reasons: 1) they need multiple passes through the entire dataset; and 2) non-iid data will cause catastrophic forgetting. An old fix to both of these issues is rehearsal. To learn a new example, rehearsal mixes it with previous examples, and then this mixture is used to update the DNN. Full rehearsal is slow and memory intensive because it stores all previously observed examples, and its effectiveness for preventing catastrophic forgetting has not been studied in modern DNNs. Here, we describe the ExStream algorithm for memory efficient rehearsal and compare it to alternatives. We find that full rehearsal can eliminate catastrophic forgetting in a variety of streaming learning settings, with ExStream performing well using far less memory and computation.
引用
收藏
页码:9769 / 9776
页数:8
相关论文
共 75 条
[1]   Memory retention - the synaptic stability versus plasticity dilemma [J].
Abraham, WC ;
Robins, A .
TRENDS IN NEUROSCIENCES, 2005, 28 (02) :73-78
[2]  
Aggarwal C., 2004, P 30 INT C VER LARG, V30, P852
[3]  
[Anonymous], 2004, Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, DOI DOI 10.1145/1014052.1014110
[4]  
[Anonymous], 2018, STREAM INFRASTRUCTUR
[5]  
[Anonymous], 1996, SIGMOD REC ACM SPEC, DOI DOI 10.1145/235968.233324
[6]  
[Anonymous], 2011, Technical Report CNS-TR-2011-001
[7]  
[Anonymous], 2018, P AUSTR COMP SCI WEE
[8]  
[Anonymous], 2017, INFORM FUSION, DOI DOI 10.1016/j.inffus.2017.02.004
[9]  
[Anonymous], 2007, PROC 24 INT C MACHIN, DOI DOI 10.1145/1273496.1273521
[10]  
[Anonymous], 2003, P 29 INT C VER LARG