Generalized Model and Deep Reinforcement Learning-Based Evolutionary Method for Multitype Satellite Observation Scheduling

Journal Article (2024)
Author(s)

Yanjie Song (Xidian University)

J. Ou (Xiangtan University, Xiangtan)

Witold Pedrycz (University of Alberta, Istinye University, Polish Academy of Sciences)

Ponnuthurai Nagaratnam Suganthan (Qatar University)

X. Wang (TU Delft - Learning & Autonomous Control)

Lining Xing (Xidian University)

Yue Zhang (Beihang University)

Research Group
Learning & Autonomous Control
DOI related publication
https://doi.org/10.1109/TSMC.2023.3345928
More Info
expand_more
Publication Year
2024
Language
English
Research Group
Learning & Autonomous Control
Issue number
4
Volume number
54
Pages (from-to)
2576-2589
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Multitype satellite observation, including optical observation satellites, synthetic aperture radar (SAR) satellites, and electromagnetic satellites, has become an important direction in integrated satellite applications due to its ability to cope with various complex situations. In the multitype satellite observation scheduling problem (MTSOSP), the constraints involved in different types of satellites make the problem challenging. This article proposes a mixed-integer programming model and a generalized profit representation method in the model to effectively cope with the situation of multiple types of satellite observations. To obtain a suitable observation plan, a deep reinforcement learning-based genetic algorithm (DRL-GA) is proposed by combining the learning method and genetic algorithm. The DRL-GA adopts a solution generation method to obtain the initial population and assist with local search. In this method, a set of statistical indicators that consider resource utilization and task arrangement performance are regarded as states. By using deep neural networks to estimate the <inline-formula> <tex-math notation="LaTeX">$Q$</tex-math> </inline-formula> value of each action, this method can determine the preferred order of task scheduling. An individual update strategy and an elite strategy are used to enhance the search performance of DRL-GA. Simulation results verify that DRL-GA can effectively solve the MTSOSP and outperforms the state-of-the-art algorithms in several aspects. This work reveals the advantages of the proposed generalized model and scheduling method, which exhibit good scalability for various types of observation satellite scheduling problems.

Files

Generalized_Model_and_Deep_Rei... (pdf)
(pdf | 2.05 Mb)
- Embargo expired in 15-07-2024
License info not available