Reinforcement Learning Based Data Fusion Method for Multi-Sensors

被引:37
作者
Zhou, Tongle [1 ]
Chen, Mou [1 ]
Zou, Jie [2 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Automat Engn, Nanjing 211106, Peoples R China
[2] Luoyang Inst Electroopt Equipment Av, Sci & Technol Elect Opt Control Lab, Luoyang 471023, Peoples R China
关键词
Air combat; cubic B-spline interpolation; data fusion; reinforcement learning; MULTISENSOR DATA FUSION; ITERATIVE APPROXIMATION; EXOSKELETON; ROBOT;
D O I
10.1109/JAS.2020.1003180
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to improve detection system robustness and reliability, multi-sensors fusion is used in modern air combat. In this paper, a data fusion method based on reinforcement learning is developed for multi-sensors. Initially, the cubic B-spline interpolation is used to solve time alignment problems of multi-source data. Then, the reinforcement learning based data fusion (RLBDF) method is proposed to obtain the fusion results. With the case that the priori knowledge of target is obtained, the fusion accuracy reinforcement is realized by the error between fused value and actual value. Furthermore, the Fisher information is instead used as the reward if the priori knowledge is unable to be obtained. Simulations results verify that the developed method is feasible and effective for the multi-sensors data fusion in air combat.
引用
收藏
页码:1489 / 1497
页数:9
相关论文
共 32 条
[1]  
[Anonymous], 2018, IEEE T IND ELECT, DOI DOI 10.1093/JIPM/PMX029
[2]  
[Anonymous], 2015, IEEE T INTELLIGENT T
[3]  
[Anonymous], 2017, REINFORCEMENT LEARNI
[4]   Progressive iterative approximation for triangular Bezier surfaces [J].
Chen, Jie ;
Wang, Guo-Jin .
COMPUTER-AIDED DESIGN, 2011, 43 (08) :889-895
[5]   Distributed Filtering Algorithm Based on Tunable Weights Under Untrustworthy Dynamics [J].
Chen, Shiming ;
Chen, Xiaoling ;
Pei, Zhengkai ;
Zhang, Xingxing ;
Fang, Huajing .
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2016, 3 (02) :225-232
[6]   Progressive and iterative approximation for least squares B-spline curve and surface fitting [J].
Deng, Chongyang ;
Lin, Hongwei .
COMPUTER-AIDED DESIGN, 2014, 47 :32-44
[7]   Adaptive Neural Network-Based Control for a Class of Nonlinear Pure-Feedback Systems With Time-Varying Full State Constraints [J].
Gao, Tingting ;
Liu, Yan-Jun ;
Liu, Lei ;
Li, Dapeng .
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2018, 5 (05) :923-933
[8]  
Hall DL, 1997, P IEEE, V85, P6, DOI 10.1109/ISCAS.1998.705329
[9]   B-spline surface fitting by iterative geometric interpolation/approximation algorithms [J].
Kineri, Yuki ;
Wang, Mingsi ;
Lin, Hongwei ;
Maekawa, Takashi .
COMPUTER-AIDED DESIGN, 2012, 44 (07) :697-708
[10]  
Lascara B. J., 2013, 2013 INT COMM NAV SU, P1