An Attention-Based Predictive Agent for Static and Dynamic Environments

被引:5
作者
Baruah, Murchana [1 ,2 ]
Banerjee, Bonny [1 ,2 ]
Nagar, Atulya K. [3 ]
机构
[1] Univ Memphis, Inst Intelligent Syst, Memphis, TN 38152 USA
[2] Univ Memphis, Dept Elect & Comp Engn, Memphis, TN 38152 USA
[3] Liverpool Hope Univ, Sch Math Comp Sci & Engn, Liverpool L16 9JD, Merseyside, England
关键词
Predictive models; Videos; Visualization; Data models; Computational modeling; Decoding; Mathematical models; Agent; attention; handwriting generation; multimodal; perception; proprioception;
D O I
10.1109/ACCESS.2022.3149585
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real-world applications of intelligent agents demand accuracy and efficiency, and seldom provide reinforcement signals. Currently, most agent models are reinforcement-based and concentrate exclusively on accuracy. We propose a general-purpose agent model consisting of proprioceptive and perceptual pathways. The agent actively samples its environment via a sequence of glimpses. It completes the partial propriocept and percept sequences observed till each sampling instant, and learns where and what to sample by minimizing prediction error, without reinforcement or supervision (class labels). The model is evaluated by exposing it to two kinds of stimuli: images of fully-formed handwritten numerals and alphabets, and videos of gradual formation of numerals. It yields state-of-the-art prediction accuracy upon sampling only 22.6% of the scene on average. The model saccades when exposed to images and tracks when exposed to videos. This is the first known attention-based agent to generate realistic handwriting with state-of-the-art accuracy and efficiency by interacting with and learning end-to-end from static and dynamic environments.
引用
收藏
页码:17310 / 17317
页数:8
相关论文
共 31 条
[1]  
Baruah M., 2020, P COGSCI VIRT 29 JUL, P1171
[2]  
Chung J., 2015, Neural Information Processing Systems, V28, P2980
[3]  
Cohen G, 2017, IEEE IJCNN, P2921, DOI 10.1109/IJCNN.2017.7966217
[4]  
de Jong E. D, ARXIV161103068
[5]  
Elsayed G., 2019, NeurIPS, V32, P702
[6]  
Eslami S. M. A., 2016, ARXIV160308575, V10, P2022
[7]   Perceptions as hypotheses: saccades as experiments [J].
Friston, Karl J. ;
Adams, Rick A. ;
Perrinet, Laurent ;
Breakspear, Michael .
FRONTIERS IN PSYCHOLOGY, 2012, 3
[8]   The free-energy principle: a unified brain theory? [J].
Friston, Karl J. .
NATURE REVIEWS NEUROSCIENCE, 2010, 11 (02) :127-138
[9]   Reinforcement Learning or Active Inference? [J].
Friston, Karl J. ;
Daunizeau, Jean ;
Kiebel, Stefan J. .
PLOS ONE, 2009, 4 (07)
[10]   Attention Branch Network: Learning of Attention Mechanism for Visual Explanation [J].
Fukui, Hiroshi ;
Hirakawa, Tsubasa ;
Yamashita, Takayoshi ;
Fujiyoshi, Hironobu .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :10697-10706