Learning Interpretable, High-Performing Policies for Autonomous Driving

被引:0
|
作者
Paleja, Rohan [1 ]
Niu, Yam [1 ]
Silva, Andrew [1 ]
Ritchie, Chace [1 ]
Choi, Sugju [1 ]
Gombolay, Matthew [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
基金
美国国家科学基金会;
关键词
TREE REGULARIZATION; DECISION TREES; BLACK-BOX; MODELS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gradient-based approaches in reinforcement learning (RL) have achieved tremendous success in learning policies for autonomous vehicles. While the performance of these approaches warrants real-world adoption, these policies lack interpretability, limiting deployability in the safety-critical and legally-regulated domain of autonomous driving (AD). AD requires interpretable and verifiable control policies that maintain high performance. We propose Interpretable Continuous Control Trees (ICCTs), a tree-based model that can be optimized via modern, gradient-based, RL approaches to produce high-performing, interpretable policies. The key to our approach is a procedure for allowing direct optimization in a sparse decision-tree-like representation. We validate ICCTs against baselines across six domains, showing that ICCTs are capable of learning interpretable policy representations that parity or outperform baselines by up to 33% in AD scenarios while achieving a 300x-600x reduction in the number of policy parameters against deep learning baselines. Furthermore, we demonstrate the interpretability and utility of our ICCTs through a 14-car physical robot demonstration.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Building high-performing and integrated project teams
    Ahiaga-Dagbui, Dominic D.
    Tokede, Olubukola
    Morrison, John
    Chirnside, Anthony
    ENGINEERING CONSTRUCTION AND ARCHITECTURAL MANAGEMENT, 2020, 27 (10) : 3341 - 3361
  • [42] High-performing lubricants based on renewable resources
    Legrand, J
    AGRO FOOD INDUSTRY HI-TECH, 1998, 9 (05): : 16 - 18
  • [43] Creating high-performing software development teams
    Pattit, JM
    Wilemon, D
    R & D MANAGEMENT, 2005, 35 (04) : 375 - 393
  • [44] High-performing lubricants based on renewable resources
    Legrand, J
    Dürr, K
    BIOMASS FOR ENERGY AND INDUSTRY, 1998, : 90 - 92
  • [45] High-performing teams: Is collective intelligence the answer?
    Rowe, Luke I.
    Hattie, John
    Munro, John
    PLOS ONE, 2024, 19 (08):
  • [46] Project teams and high-performing project culture
    Jokinen, Tauno
    Muhos, Matti
    Peltoniemi, Mirja
    Proceedings of IRNOP VII Project Research Conference, 2006, : 176 - 185
  • [47] Automatic Searching of Lightweight and High-Performing CNN Architectures for EEG-Based Driving Fatigue Detection
    Li, Qingqing
    Luo, Zhirui
    Qi, Ruobin
    Zheng, Jun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 11
  • [48] Can Low-Performing Hospitals Train High-Performing Residents?
    Legnini, Mark W.
    AMERICAN JOURNAL OF MEDICAL QUALITY, 2011, 26 (05) : 408 - 410
  • [49] Exploring the Learning Experience of High-Performing Preclinical Undergraduate Dental Students: A Qualitative Study
    Lin, Galvin Sim Siang
    Tan, Wen Wu
    Afrashtehfar, Kelvin I.
    EDUCATION SCIENCES, 2022, 12 (11):
  • [50] Using Semantic Information to improve Generalization of Reinforcement Learning Policies for Autonomous Driving
    Carton, Florence
    Filliat, David
    Rabarisoa, Jaonary
    Quoc Cuong Pham
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2021), 2021, : 144 - 151