High-Level Decision Making in a Hierarchical Control Framework: Integrating HMDP and MPC for Autonomous Systems

被引：0

作者：

Wang, Xue-Fang ^{[1
]}

Jiang, Jingjing ^{[2
]}

Chen, Wen-Hua ^{[2
]}

机构：

[1] Univ Leicester, Sch Engn, Leicester LE1 7RH, England

[2] Loughborough Univ, Dept Aeronaut & Automot Engn, Loughborough LE11 3TU, England

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2025年 / 55卷 / 04期

基金：

英国工程与自然科学研究理事会;

关键词：

Decision making; Vehicle dynamics; Safety; Control systems; Dynamical systems; Autonomous vehicles; Automobiles; Uncertainty; Trajectory; Switched systems; Autonomous decision making; hybrid Markov decision process (HMDP); model predictive control (MPC); safety and optimality; unified hierarchical control framework; MODEL;

D O I：

10.1109/TCYB.2025.3535159

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article addresses challenges of autonomous decisions making influenced by discrete system states, underlying continuous dynamics, and evolving operational environments. A comprehensive framework is proposed, encompassing new modeling, problem formulation, control design, and stability analysis. The framework integrates continuous system dynamics, used for low-level control, with discrete Markov decision processes (MDP) for high-level decision making. To capture the interactions between these domains, the decision-making system is modeled as a hybrid system consisting of a controlled MDP and autonomous (uncontrolled) continuous dynamics, collectively referred to as the hybrid Markov decision process (HMDP). The design focuses on ensuring safety and optimality by accounting for both discrete and continuous state variables across different levels. With the help of the model predictive control (MPC) concept, a decision-making scheme is developed for the hybrid model, with guarantees for recursive feasibility and stability. The proposed framework is applied to the autonomous lane changing system for intelligent vehicles, and simulation shows its capability to handle diverse behaviors in dynamic and complex environments.

引用

页码：1903 / 1916

页数：14

共 45 条

[1] Autonomy and metrics of autonomy [J].

Antsaklis, Panos .

ANNUAL REVIEWS IN CONTROL, 2020, 49 :15-26

[2] SOS: Safe, Optimal and Small Strategies for Hybrid Markov Decision Processes [J].

Ashok, Pranav ;

Kretinsky, Jan ;

Larsen, Kim Guldstrand ;

Le Coeent, Adrien ;

Taankvist, Jakob Haahr ;

Weininger, Maximilian .

QUANTITATIVE EVALUATION OF SYSTEMS (QEST 2019), 2019, 11785 :147-164

[3]

Beard JJ, 2022, Arxiv, DOI arXiv:2203.03451

[4] A MARKOVIAN DECISION PROCESS [J].

BELLMAN, R .

JOURNAL OF MATHEMATICS AND MECHANICS, 1957, 6 (05) :679-684

[5] A Fast Markov Decision Process-Based Algorithm for Collision Avoidance in Urban Air Mobility [J].

Bertram, Josh ;

Wei, Peng ;

Zambreno, Joseph .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (09) :15420-15433

[6]

Bojarski M, 2016, Arxiv, DOI [arXiv:1604.07316, DOI 10.48550/ARXIV.1604.07316]

[7] Reinforcement learning for control: Performance, stability, and deep approximators [J].

Busoniu, Lucian ;

de Bruin, Tim ;

Tolic, Domagoj ;

Kober, Jens ;

Palunko, Ivana .

ANNUAL REVIEWS IN CONTROL, 2018, 46 :8-28

[8] Personalized Driver/Vehicle Lane Change Models for ADAS [J].

Butakov, Vadim A. ;

Ioannou, Petros .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2015, 64 (10) :4422-4431

[9] The ASSISTANT project: AI for high level decisions in manufacturing [J].

Castane, G. ;

Dolgui, A. ;

Kousi, N. ;

Meyers, B. ;

Thevenin, S. ;

Vyhmeister, E. ;

Ostberg, P-O .

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2023, 61 (07) :2288-2306

[10] A quasi-infinite horizon nonlinear model predictive control scheme with guaranteed stability [J].

Chen, H ;

Allgower, F .

AUTOMATICA, 1998, 34 (10) :1205-1217

← 1 2 3 4 5 →