A novel approach for self-driving car in partially observable environment using life long reinforcement learning

被引:1
|
作者
Quadir, Md Abdul [1 ]
Jaiswal, Dibyanshu [1 ]
Mohan, Senthilkumar [2 ]
Innab, Nisreen [3 ]
Sulaiman, Riza [4 ]
Alaoui, Mohammed Kbiri [5 ]
Ahmadian, Ali [6 ,7 ]
机构
[1] Vellore Inst Technol, Sch Comp Sci & Engn, Chennai 600127, India
[2] Vellore Inst Technol, Sch Comp Sci Engn & Informat Syst, Vellore 632014, Tamilnadu, India
[3] AlMaarefa Univ, Coll Appl Sci, Dept Comp Sci & Informat Syst, Riyadh, Saudi Arabia
[4] Univ Kebangsaan Malaysia, Inst Visual Informat, Bangi 43600, Malaysia
[5] King Khalid Univ, Coll Sci, Dept Math, Abha 61413, POB 9004, Saudi Arabia
[6] Mediterranea Univ Reggio Calabria, Decis Lab, Reggio Di Calabria, Italy
[7] Istanbul Okan Univ, Fac Engn & Nat Sci, Istanbul, Turkiye
关键词
Reinforcement Learning; Lifelong Learning; Self-driving car; Lifelong reinforcement learning; Partially observable Environment; POLICY; GAMES;
D O I
10.1016/j.segan.2024.101356
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Despite ground-breaking advancements in robotics, gaming, and other challenging domains, reinforcement learning still faces significant challenges in solving dynamic, open-world problems. Since reinforcement learning algorithms usually perform poorly when exposed to new tasks outside of their data distribution, continuous learning algorithms have drawn significant attention. In parallel with work on lifelong learning algorithms, there is a need for challenging environments, properly planned trials, and metrics to measure research success. In this context, a Deep Asynchronous Autonomous Learning System (DAALS) is proposed in this paper for training a selfdriving car in a partially observable environment for real-world scenarios in a continuous state-action space. To cater to three different use cases, three different algorithms were used. To train their agents for learning and upgrading discrete state policies, DAALS used the Asynchronous Advantage Stager Reviewer (AASR) algorithm. To train its agent for continuous state spaces, DAALS also uses an Extensive Deterministic Policy Gradient (EDPG) algorithm. To train the agent in a lifelong form of learning for partially observable environments, DAALS uses a Deep Deterministic Policy Gradient Novel Lifelong Learning Algorithm (DDPGNLLA). The system offers flexibility to the user to train the agents for both discrete and continuous state-action spaces. Compared to previous models in continuous state-action spaces, Deep deterministic policy gradient lifelong learning algorithm outperforms previous models by 46.09%. Furthermore, the Deep Asynchronous Autonomous System tends to outperform all previous reinforcement learning algorithms, making our proposed approach a real-world solution. As DAALS has tested on number of different environments it provides the insights on how modern Artificial Intelligence (AI) solutions can be generalized making it one of the better solutions for AI general domain problems.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] A decision-making model for self-driving vehicles based on GPT-4V, federated reinforcement learning, and blockchain
    Alam, Tanweer
    Gupta, Ruchi
    Ahamed, N. Nasurudeen
    Ullah, Arif
    Neural Computing and Applications, 2024, 36 (34) : 21545 - 21560
  • [42] Simulated Autonomous Driving in a Realistic Driving Environment using Deep Reinforcement Learning and a Deterministic Finite State Machine
    Klose, Patrick
    Mester, Rudolf
    PROCEEDINGS OF 2ND INTERNATIONAL CONFERENCE ON APPLICATIONS OF INTELLIGENT SYSTEMS (APPIS 2019), 2019,
  • [43] Toward Learning Human-Like, Safe and Comfortable Car-Following Policies With a Novel Deep Reinforcement Learning Approach
    Yavas, M. Ugur
    Kumbasar, Tufan
    Ure, Nazim Kemal
    IEEE ACCESS, 2023, 11 : 16843 - 16854
  • [44] Self-learning swimming of a three-disk microrobot in a viscous and stochastic environment using reinforcement learning
    Abdi, Hossein
    Pishkenari, Hossein Nejat
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [45] A novel physical activity recognition approach using deep ensemble optimized transformers and reinforcement learning
    Ahmadian, Sajad
    Rostami, Mehrdad
    Farrahi, Vahid
    Oussalah, Mourad
    NEURAL NETWORKS, 2024, 173
  • [46] Fatigue life prognosis of composite structures using a transferable deep reinforcement learning-based approach
    Liu, Cheng
    Chen, Yan
    Xu, Xuebing
    COMPOSITE STRUCTURES, 2025, 353
  • [47] Mean line aerodynamic design of an axial compressor using a novel design approach based on reinforcement learning
    Liu, Yi
    Chen, Jiang
    Cheng, Jinxin
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2022, 236 (12) : 2433 - 2446
  • [48] Self-Organizing Sustainable Spectrum Management Methodology in Cognitive Radio Vehicular Adhoc Network (CRAVENET) Environment: A Reinforcement Learning Approach
    Ghanshala, Kamal Kumar
    Sharma, Sachin
    Mohan, Seshadri
    Nautiyal, Lata
    Mishra, Preeti
    Joshi, R. C.
    2018 FIRST INTERNATIONAL CONFERENCE ON SECURE CYBER COMPUTING AND COMMUNICATIONS (ICSCCC 2018), 2018, : 168 - 172
  • [49] Self-Organizing Networks: A Packet Scheduling Approach for Coverage/Capacity Optimization in 4G Networks Using Reinforcement Learning
    Tiwana, Moazzam Islam
    Nawaz, Syed Junaid
    Ikram, Ataul Aziz
    Tiwana, Mohsin Islam
    ELEKTRONIKA IR ELEKTROTECHNIKA, 2014, 20 (09) : 59 - 64
  • [50] Optimizing Relay Selection in D2D Communication for Next-Generation Wireless Networks Using Multi-Agent Reinforcement Learning: A Novel Approach
    Muharrem Sirma
    Adnan Kavak
    A. Burak Inner
    Wireless Personal Communications, 2025, 140 (3) : 945 - 969