Learning-Based Probabilistic LTL Motion Planning With Environment and Motion Uncertainties

被引:32
|
作者
Cai, Mingyu [1 ]
Peng, Hao [2 ]
Li, Zhijun [3 ]
Kan, Zhen [3 ]
机构
[1] Univ Iowa, Dept Mech Engn, Iowa City, IA 52246 USA
[2] ApexAI Inc, Palo Alto, CA 94303 USA
[3] Univ Sci & Technol China, Dept Automat, Hefei 230052, Peoples R China
关键词
Uncertainty; Probabilistic logic; Task analysis; Planning; Learning automata; Markov processes; Autonomous agents; Linear temporal logic (LTL); Markov decision process (MDP); motion planning; reinforcement learning; MARKOV DECISION-PROCESSES; LOGIC; FRAMEWORK;
D O I
10.1109/TAC.2020.3006967
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article considers control synthesis of an autonomous agent with linear temporal logic (LTL) specifications subject to environment and motion uncertainties. Specifically, the probabilistic motion of the agent is modeled by a Markov decision process (MDP) with unknown transition probabilities. The operating environment is assumed to be partially known, where the desired LTL specifications might be partially infeasible. A relaxed product MDP is constructed that allows the agent to revise its motion plan without strictly following the desired LTL constraints. A utility function composed of violation cost and state rewards is developed. Rigorous analysis shows that, if there almost surely (i.e., with probability 1) exists a policy that satisfies the relaxed product MDP, any algorithm that optimizes the expected utility is guaranteed to find such a policy. A reinforcement learning-based approach is then developed to generate policies that fulfill the desired LTL specifications as much as possible by optimizing the expected discount utility of the relaxed product MDP.
引用
收藏
页码:2386 / 2392
页数:7
相关论文
共 50 条
  • [1] Optimal Probabilistic Motion Planning With Potential Infeasible LTL Constraints
    Cai, Mingyu
    Xiao, Shaoping
    Li, Zhijun
    Kan, Zhen
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (01) : 301 - 316
  • [2] A survey of learning-based robot motion planning
    Wang, Jiankun
    Zhang, Tianyi
    Ma, Nachuan
    Li, Zhaoting
    Ma, Han
    Meng, Fei
    Meng, Max Q. -H.
    IET CYBER-SYSTEMS AND ROBOTICS, 2021, 3 (04) : 302 - 314
  • [3] Learning-based Adaptive Sampling for Manipulator Motion Planning
    Gaebert, Carl
    Thomas, Ulrike
    2022 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2022, : 715 - 721
  • [4] Probabilistic roadmaps: A motion planning approach based on active learning
    Latombe, Jean-Claude
    PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, : 1 - +
  • [5] Motion Planning Networks: Bridging the Gap Between Learning-Based and Classical Motion Planners
    Qureshi, Ahmed Hussain
    Miao, Yinglong
    Simeonov, Anthony
    Yip, Michael C.
    IEEE TRANSACTIONS ON ROBOTICS, 2021, 37 (01) : 48 - 66
  • [6] Reinforcement Learning-Based Motion Planning for Automatic Parking System
    Zhang, Jiren
    Chen, Hui
    Song, Shaoyu
    Hu, Fengwei
    IEEE ACCESS, 2020, 8 : 154485 - 154501
  • [7] Parting with Misconceptions about Learning-based Vehicle Motion Planning
    Dauner, Daniel
    Hallgarten, Marcel
    Geiger, Andreas
    Chitta, Kashyap
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [8] Temporal Logic Based Motion Planning with Infeasible LTL Specification
    Xie, Guoshan
    Yin, Zhihong
    Li, Jianqing
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 4899 - 4904
  • [9] A Graphical Language for LTL Motion and Mission Planning
    Srinivas, Shashank
    Kermani, Ramtin
    Kim, Kangjin
    Kobayashi, Yoshihiro
    Fainekos, Georgios
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2013, : 704 - 709
  • [10] A review of mobile robot motion planning methods: from classical motion planning workflows to reinforcement learning-based architectures
    Dong, Lu
    He, Zichen
    Song, Chunwei
    Sun, Changyin
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2023, 34 (02) : 439 - 459