Solving Partially Observable Environments with Universal Search Using Dataflow Graph-Based Programming Model

被引:1
作者
Paul, Swarna Kamal [1 ,2 ]
Bhaumik, Parama [2 ]
机构
[1] Tata Consultancy Serv, Kolkata, India
[2] Jadavpur Univ, Dept Informat Technol, Kolkata 700032, India
关键词
Artificial intelligence; dataflow graph programming; functional programming; POMDP; reinforcement learning; universal search; FORMAL THEORY; ALGORITHM;
D O I
10.1080/03772063.2021.2004461
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Universal search allows one to create an asymptotically optimal intelligent agent, which can act in a wide range of computable environments. However combinatorial explosion is still lurking behind though its dependency shifted from problem size to solution size. To combat this scenario, we propose a dataflow graph-based functional programming model to be used in the universal search for solution program generation. We have justified the superiority of our proposed model compared to sequential token-based languages when used in universal search-based intelligent agents. We have shown how applying an equivalent program pruning strategy can handle the problem of semantically redundant program generation. An incremental learning strategy based on gradient ascent is also proposed for our designed agent. Experimental results positively reinforced the theoretical justifications. We used our agent to solve some partially observable environments and compared them with the current state of art methods and it reveals the exceptional performance of our agent.
引用
收藏
页码:6137 / 6151
页数:15
相关论文
共 33 条
  • [1] Akidau T, 2015, PROC VLDB ENDOW, V8, P1792
  • [2] Bhaumik, 2018, ICIBCA, P310
  • [3] Bird Richard, 1988, Introduction to Functional Programming
  • [4] Computation semantics of the functional scientific workflow language Cuneiform
    Brandt, Joergen
    Reisig, Wolfgang
    Leser, Ulf
    [J]. JOURNAL OF FUNCTIONAL PROGRAMMING, 2017, 27
  • [5] Cover T. M., 1999, Elements of Information Theory
  • [6] Glasmachers Tobias, 2011, Artificial General Intelligence. Proceedings 4th International Conference, AGI 2011, P52, DOI 10.1007/978-3-642-22887-2_6
  • [7] Grunwald P. D., 2008, Handbook of the Philosophy of Information, P281
  • [8] Hausknecht M., 2015, 2015 AAAI FALL S SER, P29
  • [9] Generalising monads to arrows
    Hughes, J
    [J]. SCIENCE OF COMPUTER PROGRAMMING, 2000, 37 (1-3) : 67 - 111
  • [10] Hutter M., 2003, A gentle introduction to the universal algorithmic agent AIXI