RL-AKF: An Adaptive Kalman Filter Navigation Algorithm Based on Reinforcement Learning for Ground Vehicles

被引：54

作者：

Gao, Xile ^{[1
]}

Luo, Haiyong ^{[1
]}

Ning, Bokun ^{[2
]}

Zhao, Fang ^{[2
]}

Bao, Linfeng ^{[1
]}

Gong, Yilin ^{[2
]}

Xiao, Yimin ^{[2
]}

Jiang, Jinguang ^{[3
]}

机构：

[1] Chinese Acad Sci, Inst Comp Technol, Beijing Key Lab Mobile Comp & Pervas Device, Beijing 100190, Peoples R China

[2] Beijing Univ Posts & Telecommun, Sch Software Engn, Beijing 100876, Peoples R China

[3] Wuhan Univ, GNSS Res Ctr, Wuhan 430072, Peoples R China

来源：

REMOTE SENSING | 2020年 / 12卷 / 11期

基金：

中国国家自然科学基金; 北京市自然科学基金;

关键词：

integrated navigation; Kalman filter; process noise covariance estimation; reinforcement learning; deep deterministic policy gradient; MONOCULAR VISION; GNSS; IDENTIFICATION; COVARIANCE;

D O I：

10.3390/rs12111704

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Kalman filter is a commonly used method in the Global Navigation Satellite System (GNSS)/Inertial Navigation System (INS) integrated navigation system, in which the process noise covariance matrix has a significant influence on the positioning accuracy and sometimes even causes the filter to diverge when using the process noise covariance matrix with large errors. Though many studies have been done on process noise covariance estimation, the ability of the existing methods to adapt to dynamic and complex environments is still weak. To obtain accurate and robust localization results under various complex and dynamic environments, we propose an adaptive Kalman filter navigation algorithm (which is simply called RL-AKF), which can adaptively estimate the process noise covariance matrix using a reinforcement learning approach. By taking the integrated navigation system as the environment, and the opposite of the current positioning error as the reward, the adaptive Kalman filter navigation algorithm uses the deep deterministic policy gradient to obtain the most optimal process noise covariance matrix estimation from the continuous action space. Extensive experimental results show that our proposed algorithm can accurately estimate the process noise covariance matrix, which is robust under different data collection times, different GNSS outage time periods, and using different integration navigation fusion schemes. The RL-AKF achieves an average positioning error of 0.6517 m within 10 s GNSS outage for GNSS/INS integrated navigation system and 14.9426 m and 15.3380 m within 300 s GNSS outage for the GNSS/INS/Odometer (ODO) and the GNSS/INS/Non-Holonomic Constraint (NHC) integrated navigation systems, respectively.

引用

页数：25

共 50 条

[31] A novel adaptive control algorithm based on reinforcement learning [J].

Qian Zheng ;

Sun Liang ;

Ruan Xiaogang .

Proceedings of the 24th Chinese Control Conference, Vols 1 and 2, 2005, :651-654

[32] Adaptive Deep Reinforcement Learning for Efficient 3D Navigation of Autonomous Underwater Vehicles [J].

Politi, Elena ;

Stefanidou, Artemis ;

Chronis, Christos ;

Dimitrakopoulos, George ;

Varlamis, Iraklis .

IEEE ACCESS, 2024, 12 :178209-178221

[33] Interacting Multiple Model UAV Navigation Algorithm Based on a Robust Cubature Kalman Filter [J].

Liu, Xuhang ;

Liu, Xiaoxiong ;

Zhang, Weiguo ;

Yang, Yue .

IEEE ACCESS, 2020, 8 :81034-81044

[34] RL-Routing: An SDN Routing Algorithm Based on Deep Reinforcement Learning [J].

Chen, Yi-Ren ;

Rezapour, Amir ;

Tzeng, Wen-Guey ;

Tsai, Shi-Chun .

IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (04) :3185-3199

[35] Air data assisted attitude algorithm based on fuzzy adaptive Kalman filter [J].

Li, Wen ;

Li, Qingdong ;

Li, Liang ;

Chen, Jian ;

Ren, Zhang ;

Lian, Chengbin ;

Wang, Haoliang .

Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2015, 36 (04) :1267-1274

[36] Extended Kalman Filter Based Resilient Formation Tracking Control of Multiple Unmanned Vehicles via Game-Theoretical Reinforcement Learning [J].

Xue, Lei ;

Ma, Bei ;

Liu, Jian ;

Mu, Chaoxu ;

Wunsch, Donald C. .

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (03) :2307-2318

[37] RL-SPIHT: Reinforcement Learning-Based Adaptive Selection of Compression Ratios for 1-D SPIHT Algorithm [J].

Shin, Jin ;

Kim, Hyun .

IEEE ACCESS, 2021, 9 :82485-82496

[38] Damping algorithm of transverse strap-down inertial navigation system based on Kalman filter [J].

Luo L. ;

Zhang Y. ;

Fang T. .

Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2019, 41 (06) :1336-1341

[39] A Strapdown Interial Navigation System/Beidou/Doppler Velocity Log Integrated Navigation Algorithm Based on a Cubature Kalman Filter [J].

Gao, Wei ;

Zhang, Ya ;

Wang, Jianguo .

SENSORS, 2014, 14 (01) :1511-1527

[40] RL-KDA: A K-degree Anonymity Algorithm Based on Reinforcement Learning [J].

Ma, Xuebin ;

Xiang, Nan ;

Gao, Yulan .

2023 IEEE 47TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC, 2023, :729-734

← 1 2 3 4 5 →