Optimization of Neuroprosthetic Vision via End-to-End Deep Reinforcement Learning

被引:15
作者
Kucukoglu, Burcu [1 ]
Rueckauer, Bodo [1 ]
Ahmad, Nasir [1 ]
van Steveninck, Jaap de Ruyter [1 ]
Guclu, Umut [1 ]
van Gerven, Marcel [1 ]
机构
[1] Radboud Univ Nijmegen, Donders Inst Brain Cognit & Behav, Dept Artificial Intelligence, Heyendaalseweg 135, NL-6525 AJ Nijmegen, Gelderland, Netherlands
关键词
Visual neuroprosthesis; phosphene vision; deep reinforcement learning; end-to-end optimization; ENVIRONMENT;
D O I
10.1142/S0129065722500526
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual neuroprostheses are a promising approach to restore basic sight in visually impaired people. A major challenge is to condense the sensory information contained in a complex environment into meaningful stimulation patterns at low spatial and temporal resolution. Previous approaches considered task-agnostic feature extractors such as edge detectors or semantic segmentation, which are likely suboptimal for specific tasks in complex dynamic environments. As an alternative approach, we propose to optimize stimulation patterns by end-to-end training of a feature extractor using deep reinforcement learning agents in virtual environments. We present a task-oriented evaluation framework to compare different stimulus generation mechanisms, such as static edge-based and adaptive end-to-end approaches like the one introduced here. Our experiments in Atari games show that stimulation patterns obtained via task-dependent end-to-end optimized reinforcement learning result in equivalent or improved performance compared to fixed feature extractors on high difficulty levels. These findings signify the relevance of adaptive reinforcement learning for neuroprosthetic vision in complex environments.
引用
收藏
页数:16
相关论文
共 50 条
[31]   End-to-End Reinforcement Learning of Curative Curtailment with Partial Measurement Availability [J].
Wolf, Hinrikus ;
Boetcher, Luis ;
Bouchkati, Sarra ;
Lutat, Philipp ;
Breitung, Jens ;
Jung, Bastian ;
Moellemann, Tina ;
Todosijevic, Viktor ;
Schiefelbein-Lach, Jan ;
Pohl, Oliver ;
Ulbig, Andreas ;
Grohe, Martin .
2024 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES EUROPE, ISGT EUROPE, 2024,
[32]   Deep Reinforcement Learning for Solving Two-Echelon Capacity Vehicle Routing Problem: An End-to-End Method [J].
Sun, Weice ;
Pei, Zhi .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (06) :6432-6439
[33]   A Survey of Intelligent End-to-End Networking Solutions: Integrating Graph Neural Networks and Deep Reinforcement Learning Approaches [J].
Tam, Prohim ;
Ros, Seyha ;
Song, Inseok ;
Kang, Seungwoo ;
Kim, Seokhoon .
ELECTRONICS, 2024, 13 (05)
[34]   An End-to-End Deep Reinforcement Learning-Based Intelligent Agent Capable of Autonomous Exploration in Unknown Environments [J].
Dooraki, Amir Ramezani ;
Lee, Deok-Jin .
SENSORS, 2018, 18 (10)
[35]   Comparison of end-to-end and hybrid deep reinforcement learning strategies for controlling cable-driven parallel robots [J].
Xiong, Hao ;
Ma, Tianqi ;
Zhang, Lin ;
Diao, Xiumin .
NEUROCOMPUTING, 2020, 377 :73-84
[36]   An end-to-end decentralised scheduling framework based on deep reinforcement learning for dynamic distributed heterogeneous flowshop scheduling [J].
Li, Haoran ;
Gao, Liang ;
Fan, Qingsong ;
Li, Xinyu ;
Han, Baoan .
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2025, 63 (12) :4368-4388
[37]   An End-to-End Reinforcement Learning Method for Automated Guided Vehicle Path Planning [J].
Sun Yu ;
Li Haisheng .
INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2020, 2020, 11574
[38]   Stabilization Approaches for Reinforcement Learning-Based End-to-End Autonomous Driving [J].
Chen, Siyuan ;
Wang, Meiling ;
Song, Wenjie ;
Yang, Yi ;
Li, Yujun ;
Fu, Mengyin .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (05) :4740-4750
[39]   New Results in End-to-end Image and Video Compression by Deep Learning [J].
Ozsoy, Gokberk ;
Yilmaz, Melih ;
Kirmemis, Ogun ;
Tekalp, A. Murat .
2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
[40]   End-to-end deep reinforcement learning and control with multimodal perception for planetary robotic dual peg-in-hole assembly [J].
Li, Boxin ;
Wang, Zhaokui .
ADVANCES IN SPACE RESEARCH, 2024, 74 (11) :5860-5873