Simultaneous task and energy planning using deep reinforcement learning

被引:14
|
作者
Wang, Di [1 ]
Hu, Mengqi [1 ]
Weir, Jeffery D. [2 ]
机构
[1] Univ Illinois, Dept Mech & Ind Engn, Chicago, IL 60607 USA
[2] Air Force Inst Technol, Dept Operat Sci, Wright Patterson AFB, OH 45433 USA
基金
美国国家科学基金会;
关键词
Simultaneous task and energy planning; Neural combinatorial optimization; Deep reinforcement learning; End-to-end learning; Sequence-to-sequence decision; VEHICLE-ROUTING PROBLEM; ALGORITHM; ENVIRONMENT; NETWORK;
D O I
10.1016/j.ins.2022.06.015
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To improve energy awareness of unmanned autonomous vehicles, it is critical to co optimize task planning and energy scheduling. To the best of our knowledge, most of the existing task planning algorithms either ignore energy constraints or make energy scheduling decisions based on simple rules. To bridge these research gaps, we propose a combinatorial optimization model for the simultaneous task and energy planning (STEP) problem. In this paper, we propose three variants of STEP problems (i) the vehicle can visit stationary charging stations multiple times at various locations; (ii) the vehicle can efficiently coordinate with mobile charging stations to achieve zero waiting time for recharging, and (iii) the vehicle can maximally harvest solar energy by considering time variance in solar irradiance. Besides, in order to obtain fast and reliable solutions to STEP problems, we propose a neural combinatorial optimizer using the deep reinforcement learning algorithm with a proposed link information filter. The near-optimal solutions can be obtained very fast without solving the problem from scratch when environments change. Our simulation results demonstrate that (i) our proposed neural optimizer can find solutions close to the optimum and outperform the exact and heuristic algorithms in terms of computational cost; (ii) the end-to-end learning (directly mapping from perceptions to control) model outperforms the traditional learning (mapping from perception to prediction to control) model. (c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:931 / 946
页数:16
相关论文
共 50 条
  • [1] Deep Reinforcement Learning for Task Planning of Virtual Characters
    Souza, Caio
    Velhor, Luiz
    INTELLIGENT COMPUTING, VOL 2, 2021, 284 : 694 - 711
  • [2] Task Planning in "Block World" with Deep Reinforcement Learning
    Ayunts, Edward
    Panov, Alekasndr I.
    BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES (BICA) FOR YOUNG SCIENTISTS, 2018, 636 : 3 - 9
  • [3] Hierarchical Task and Motion Planning through Deep Reinforcement Learning
    Newaz, Abdullah Al Redwan
    Alam, Tauhidul
    2021 FIFTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2021), 2021, : 100 - 105
  • [4] Simultaneous Planning for Item Picking and Placing by Deep Reinforcement Learning
    Tanaka, Tatsuya
    Kaneko, Toshimitsu
    Sekine, Masahiro
    Tangkaratt, Voot
    Sugiyama, Masashi
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 9705 - 9711
  • [5] Task Scheduling in Cloud Using Deep Reinforcement Learning
    Swarup, Shashank
    Shakshuki, Elhadi M.
    Yasar, Ansar
    12TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT) / THE 4TH INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40) / AFFILIATED WORKSHOPS, 2021, 184 : 42 - 51
  • [6] Advanced planning for autonomous vehicles using reinforcement learning and deep inverse reinforcement learning
    You, Changxi
    Lu, Jianbo
    Filev, Dimitar
    Tsiotras, Panagiotis
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2019, 114 : 1 - 18
  • [7] Optimizing Robotic Task Sequencing and Trajectory Planning on the Basis of Deep Reinforcement Learning
    Dong, Xiaoting
    Wan, Guangxi
    Zeng, Peng
    Song, Chunhe
    Cui, Shijie
    BIOMIMETICS, 2024, 9 (01)
  • [8] Deep Reinforcement Learning for Task Allocation in Energy Harvesting Mobile Crowdsensing
    Dongare, Sumedh
    Ortiz, Andrea
    Klein, Anja
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 269 - 274
  • [9] Deep reinforcement learning with planning guardrails for building energy demand response
    Jang, Doseok
    Spangher, Lucas
    Nadarajah, Selvaprabu
    Spanos, Costas
    ENERGY AND AI, 2023, 11
  • [10] Laser power planning in directed energy deposition by deep reinforcement learning
    Ren, Kai
    Liu, Ning
    Zhang, Wei
    Chew, Youxiang
    Zhang, Yunfeng
    Fuh, Jerry Yinghsi
    Bi, Guijun
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2024, 135 (9-10): : 4683 - 4694