Learning to Construct a Solution for the Agile Satellite Scheduling Problem With Time-Dependent Transition Times

被引:18
作者
Chen, Ming [1 ]
Du, Yonghao [1 ]
Tang, Ke [2 ,3 ]
Xing, Lining [4 ]
Chen, Yuning [1 ]
Chen, Yingwu [1 ]
机构
[1] Natl Univ Def Technol, Coll Syst Engn, Changsha 410073, Peoples R China
[2] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Shenzhen 518055, Peoples R China
[3] Southern Univ Sci & Technol, Res Inst Trustworthy Autonomous Syst, Shenzhen 518055, Peoples R China
[4] Xidian Univ, Coll Elect Engn, Xian 710126, Peoples R China
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2024年 / 54卷 / 10期
基金
中国国家自然科学基金;
关键词
Deep learning; deep reinforcement learning (DRL); feature engineering; satellite scheduling; MULTISATELLITE;
D O I
10.1109/TSMC.2024.3411640
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The agile earth observation satellite scheduling problem (AEOSSP) with time-dependent transition times is a complex combinational optimization problem that has emerged from the development of large-scale satellite management techniques. To address this problem, we propose a deep reinforcement learning-based construction model (DRL-CM) that consists of five parts: 1) a Markov decision process (MDP); 2) a feature engineering; 3) a constructive heuristic neural network (CHNN); 4) an RL training method; and 5) an evaluation system. Specifically, the CHNN comprises six modules containing three special components that we propose: a dynamic encoder, a dynamic global layer, and a two-stage attention layer. First, we build the MDP of the AEOSSP and the feature engineering with effective features required for decision-making. Second, we design the CHNN to function as the MDP policy and train it with an RL model. Finally, we propose a comprehensive evaluation system for the validation of our model. The experimental results indicate that the proposed DRL-CM outperforms the state-of-the-art algorithm in terms of both optimization speed and quality. In addition, the feature engineering and network architecture built in our model are verified to be effective in comprehensive experiments.
引用
收藏
页码:5949 / 5963
页数:15
相关论文
共 39 条
[1]  
Bi JY, 2022, ADV NEUR IN
[2]   A heuristic for the multi-satellite, multi-orbit and multi-user management of Earth observation satellites [J].
Bianchessi, Nicola ;
Cordeau, Jean-Francois ;
Desrosiers, Jacques ;
Laporte, Gilbert ;
Raymond, Vincent .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 177 (02) :750-762
[3]  
Chen, 2019, P ADV NEUR INF PROC, V32, P564
[4]   A Heuristic Construction Neural Network Method for the Time-Dependent Agile Earth Observation Satellite Scheduling Problem [J].
Chen, Jiawei ;
Chen, Ming ;
Wen, Jun ;
He, Lei ;
Liu, Xiaolu .
MATHEMATICS, 2022, 10 (19)
[5]   Heuristic algorithms based on deep reinforcement learning for quadratic unconstrained binary optimization [J].
Chen, Ming ;
Chen, Yuning ;
Du, Yonghao ;
Wei, Luona ;
Chen, Yingwu .
KNOWLEDGE-BASED SYSTEMS, 2020, 207
[6]  
Chen M, 2019, 2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), P126, DOI 10.1109/SSCI44817.2019.9002957
[7]   A mixed integer linear programming model for multi-satellite scheduling [J].
Chen, Xiaoyu ;
Reinelt, Gerhard ;
Dai, Guangming ;
Spitz, Andreas .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2019, 275 (02) :694-707
[8]   Optimization-Based Scheduling Method for Agile Earth-Observing Satellite Constellation [J].
Cho, Doo-Hyun ;
Kim, Jun-Hong ;
Choi, Han-Lim ;
Ahn, Jaemyung .
JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2018, 15 (11) :611-626
[9]   Deep Reinforcement Learning for the Agile Earth Observation Satellite Scheduling Problem [J].
Chun, Jie ;
Yang, Wenyuan ;
Liu, Xiaolu ;
Wu, Guohua ;
He, Lei ;
Xing, Lining .
MATHEMATICS, 2023, 11 (19)
[10]  
EDDY M, 2020, IEEE AEROSP C, P1