Recurrent Neural Networks and Their Memory Behavior: A Survey

被引:20
作者
Su, Yuanhang [1 ]
Kuo, C. -C. Jay [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
关键词
Recurrent neural networks; long short-term memory; gated recurrent unit; bidirectional recurrent neural networks; natural language processing; PROBABILISTIC FUNCTIONS;
D O I
10.1561/116.00000123
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
After their inception in the late 1980s, recurrent neural networks (RNNs) as a sequence computing model have seen mushrooming interests in communities of natural language processing, speech recognition, computer vision, etc. Recent variations of RNNs have made breakthroughs in fields such as machine translation where machines can achieve human level quality. RNNs assisted speech recognition technology is providing services on subtitles for live streaming videos. In this survey, we will offer a historical perspective by walking through the early years of RNNs all the way to their modern forms, detailing their most popular architectural designs and, perhaps more importantly, demystify the mathematical aspect behind their memory behaviors.
引用
收藏
页数:38
相关论文
共 79 条
[1]  
[Anonymous], 1997, Advances in Psychology
[2]  
[Anonymous], 2014, P 2014 C EMP METH NA, DOI DOI 10.3115/V1/D14-1179
[3]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[4]  
Balle J., 2018, P INT C LEARN REPR, P1
[5]   Hierarchical Boundary-Aware Neural Encoder for Video Captioning [J].
Baraldi, Lorenzo ;
Grana, Costantino ;
Cucchiara, Rita .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3185-3194
[6]  
Baskar MK, 2017, INT CONF ACOUST SPEE, P4810, DOI 10.1109/ICASSP.2017.7953070
[7]   STATISTICAL INFERENCE FOR PROBABILISTIC FUNCTIONS OF FINITE STATE MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T .
ANNALS OF MATHEMATICAL STATISTICS, 1966, 37 (06) :1554-&
[8]   A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T ;
SOULES, G ;
WEISS, N .
ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&
[9]  
Bengio Y, 2001, ADV NEUR IN, V13, P932
[10]  
Bertasius G, 2021, PR MACH LEARN RES, V139