Generalized Model and Deep Reinforcement Learning-Based Evolutionary Method for Multitype Satellite Observation Scheduling

被引:32
作者
Song, Yanjie [1 ,2 ]
Ou, Junwei [3 ,4 ]
Pedrycz, Witold [5 ,6 ,7 ]
Suganthan, Ponnuthurai Nagaratnam [8 ]
Wang, Xinwei [9 ]
Xing, Lining [10 ]
Zhang, Yue [11 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
[2] Natl Def Univ, Beijing 100091, Peoples R China
[3] Xiangtan Univ, Dept Comp Sci, Xiangtan 411105, Peoples R China
[4] Xiangtan Univ, Cyberspace Secur Coll, Xiangtan 411105, Peoples R China
[5] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6R 2G7, Canada
[6] Polish Acad Sci, Syst Res Inst, PL-00901 Warsaw, Poland
[7] Istinye Univ, Fac Engn & Nat Sci, Dept Comp Engn, TR-34010 Sariyer Istanbul, Turkiye
[8] Qatar Univ, Coll Engn, KINDI Ctr Comp Res, Doha, Qatar
[9] Delft Univ Technol, Dept Cognit Robot, NL-2628 CD Delft, Netherlands
[10] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
[11] Beihang Univ, Sch Reliabil & Syst Engn, Beijing 100191, Peoples R China
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2024年 / 54卷 / 04期
关键词
Combinatorial optimization problem; deep reinforcement learning (DRL); evolutionary algorithm (EA); generalized model; multitype; satellite observation; scheduling; CONSTELLATION; ALGORITHM; SYSTEM; AREA;
D O I
10.1109/TSMC.2023.3345928
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multitype satellite observation, including optical observation satellites, synthetic aperture radar (SAR) satellites, and electromagnetic satellites, has become an important direction in integrated satellite applications due to its ability to cope with various complex situations. In the multitype satellite observation scheduling problem (MTSOSP), the constraints involved in different types of satellites make the problem challenging. This article proposes a mixed-integer programming model and a generalized profit representation method in the model to effectively cope with the situation of multiple types of satellite observations. To obtain a suitable observation plan, a deep reinforcement learning-based genetic algorithm (DRL-GA) is proposed by combining the learning method and genetic algorithm. The DRL-GA adopts a solution generation method to obtain the initial population and assist with local search. In this method, a set of statistical indicators that consider resource utilization and task arrangement performance are regarded as states. By using deep neural networks to estimate the $Q$ value of each action, this method can determine the preferred order of task scheduling. An individual update strategy and an elite strategy are used to enhance the search performance of DRL-GA. Simulation results verify that DRL-GA can effectively solve the MTSOSP and outperforms the state-of-the-art algorithms in several aspects. This work reveals the advantages of the proposed generalized model and scheduling method, which exhibit good scalability for various types of observation satellite scheduling problems.
引用
收藏
页码:2576 / 2589
页数:14
相关论文
共 44 条
[1]   QUEST - A new quadratic decision model for the multi-satellite scheduling problem [J].
Berger, J. ;
Lo, N. ;
Barkaoui, M. .
COMPUTERS & OPERATIONS RESEARCH, 2020, 115
[2]   The Earth-Observing Satellite Constellation: A review from a meteorological perspective of a complex, interconnected global system with extensive applications [J].
Boukabara, Sid-Ahmed ;
Eyre, John ;
Anthes, Richard A. ;
Holmlund, Kenneth ;
St. Germain, Karen M. ;
Hoffman, Ross N. .
IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2021, 9 (03) :26-42
[3]   Integrated scheduling problem for earth observation satellites based on three modeling frameworks: an adaptive bi-objective memetic algorithm [J].
Chang, Zhongxiang ;
Zhou, Zhongbao ;
Xing, Lining ;
Yao, Feng .
MEMETIC COMPUTING, 2021, 13 (02) :203-226
[4]  
Chang Ziyi, 2022, ARXIV
[5]   A mixed integer linear programming model for multi-satellite scheduling [J].
Chen, Xiaoyu ;
Reinelt, Gerhard ;
Dai, Guangming ;
Spitz, Andreas .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2019, 275 (02) :694-707
[6]  
Chen Y., 2012, P AITIA, P441, DOI DOI 10.1007/978-3-642-26001-858
[7]  
Cheng Q, 2013, PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON MATERIAL SCIENCE AND ENVIRONMENTAL ENGINEERING (MSEE 2013), P1
[8]   Historical background and current developments for mapping burned area from satellite Earth observation [J].
Chuvieco, Emilio ;
Mouillot, Florent ;
van der Werf, Guido R. ;
San Miguel, Jesus ;
Tanase, Mihai ;
Koutsias, Nikos ;
Garcia, Mariano ;
Yebra, Marta ;
Padilla, Marc ;
Gitas, Ioannis ;
Heil, Angelika ;
Hawbaker, Todd J. ;
Giglio, Louis .
REMOTE SENSING OF ENVIRONMENT, 2019, 225 :45-64
[9]  
Du Y, 2023, IEEE T EM TOP COMP I, V7, P1036, DOI [10.1109/TETCI.2022.3145706, 10.1109/IECON49645.2022.9968766]
[10]   Large Region Targets Observation Scheduling by Multiple Satellites Using Resampling Particle Swarm Optimization [J].
Gu, Yi ;
Han, Chao ;
Chen, Yuhan ;
Liu, Shenggang ;
Wang, Xinwei .
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (02) :1800-1815