Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies

被引:0
|
作者
Seyde, Tim [1 ]
Gilitschenski, Igor [2 ]
Schwarting, Wilko [1 ]
Stellato, Bartolomeo [3 ]
Riedmiller, Martin [4 ]
Wulfmeier, Markus [4 ]
Rus, Daniela [1 ]
机构
[1] MIT CSAIL, Cambridge, MA 02139 USA
[2] Univ Toronto, Toronto, ON, Canada
[3] Princeton Univ, Princeton, NJ 08544 USA
[4] DeepMind, London, England
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021) | 2021年 / 34卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) for continuous control typically employs distributions whose support covers the entire action space. In this work, we investigate the colloquially known phenomenon that trained agents often prefer actions at the boundaries of that space. We draw theoretical connections to the emergence of bang-bang behavior in optimal control, and provide extensive empirical evaluation across a variety of recent RL algorithms. We replace the normal Gaussian by a Bernoulli distribution that solely considers the extremes along each action dimension - a bang-bang controller. Surprisingly, this achieves state-of-the-art performance on several continuous control benchmarks - in contrast to robotic hardware, where energy and maintenance cost affect controller choices. Since exploration, learning, and the final solution are entangled in RL, we provide additional imitation learning experiments to reduce the impact of exploration on our analysis. Finally, we show that our observations generalize to environments that aim to model real-world challenges and evaluate factors to mitigate the emergence of bang-bang solutions. Our findings emphasise challenges for benchmarking continuous control algorithms, particularly in light of potential real-world applications.(3)
引用
收藏
页数:13
相关论文
共 50 条
  • [41] To robust bang-bang body attitude bounded control
    Vorotnikov, V.I.
    Doklady Akademii Nauk, 2001, 381 (04) : 487 - 492
  • [42] Fuzzy bang-bang control of a switching voltage regulator
    Bizon, N.
    Oproescu, M.
    Raducu, M.
    2008 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS (AQTR 2008), THETA 16TH EDITION, VOL II, PROCEEDINGS, 2008, : 192 - 197
  • [43] Modified bang-bang piezoelectric control of vibrating beams
    Bruch, JC
    Sloss, JM
    Adali, S
    Sadek, IS
    SMART MATERIALS & STRUCTURES, 1999, 8 (05): : 647 - 653
  • [44] NOVEL BANG-BANG ALGORITHM FOR DIRECT DIGITAL CONTROL
    ZOSS, LM
    VETTER, KH
    MORTIMER, K
    INSTRUMENTATION TECHNOLOGY, 1970, 17 (04): : 59 - &
  • [46] SIMPLIFYING AND IMPROVING THE ACCURACY OF A BANG-BANG CONTROL LOOP
    REDFERN, T
    CONTROL ENGINEERING, 1987, 34 (02) : 98 - 101
  • [47] Multistage Uncertain Random Bang-Bang Optimal Control
    Chen, Xin
    Yan, Hongyan
    Zhu, Yuanguo
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 2077 - 2082
  • [48] Thermal equilibrium control by frequent bang-bang modulation
    Yang, Cheng-Xi
    Wang, Xiang-Bin
    PHYSICAL REVIEW E, 2010, 81 (05):
  • [49] Modified bang-bang piezoelectric control of vibrating beams
    Dept. of Mech. and Environ. Eng., University of California, Santa Barbara, CA, United States
    不详
    不详
    不详
    Smart Mater Struct, 5 (647-653):
  • [50] Bang-Bang Control Applied in Airfoil Roll Control with Plasma Actuators
    Wei, Qingkai
    Niu, Zhongguo
    Chen, Bao
    Huang, Xun
    JOURNAL OF AIRCRAFT, 2013, 50 (02): : 670 - 677