A reinforcement learning-based routing for delay tolerant networks

被引:39
作者
Rolla, Vitor G. [1 ]
Curado, Marilia [1 ]
机构
[1] Univ Coimbra, CISUC, Dept Engn Informat, P-3030290 Coimbra, Portugal
关键词
Delay tolerant routing; Multi-agent systems; Reinforcement-learning; Gossip algorithms;
D O I
10.1016/j.engappai.2013.07.017
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Delay Tolerant Reinforcement-Based (DTRB) is a delay tolerant routing solution for IEEE 80231 wireless networks which enables device to device data exchange without the support of any pre-existing network infrastructure. The solution utilizes Multi-Agent Reinforcement Learning techniques to learn about routes in the network and forward/replicate the messages that produce the best reward. The rewarding process is executed by a learning algorithm based on the distances between the nodes, which are calculated as a function of time from the last meetings. DTRB is a flooding-based delay tolerant routing solution. The simulation results show that DTRB can deliver more messages than a traditional delay tolerant routing solution does in densely populated areas, with similar end-to-end delay and lower network overhead. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2243 / 2250
页数:8
相关论文
共 34 条
[1]   Menger's theorem for infinite graphs [J].
Aharoni, Ron ;
Berger, Eli .
INVENTIONES MATHEMATICAE, 2009, 176 (01) :1-62
[2]  
[Anonymous], 2000, TECHNICAL REPORT
[3]  
[Anonymous], NS2 NETWORK SIMULATO
[4]  
Bai F., 2003, IEEE INFOCOM 2003
[5]   Replication Routing in DTNs: A Resource Allocation Approach [J].
Balasubramanian, Aruna ;
Levine, Brian Neil ;
Venkataramani, Arun .
IEEE-ACM TRANSACTIONS ON NETWORKING, 2010, 18 (02) :596-609
[6]   Behavior-based formation control for multirobot teams [J].
Balch, T ;
Arkin, RC .
IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1998, 14 (06) :926-939
[7]   Multiagent learning using a variable learning rate [J].
Bowling, M ;
Veloso, M .
ARTIFICIAL INTELLIGENCE, 2002, 136 (02) :215-250
[8]  
Boyan J. A., 1994, Advances in neural information processing systems
[9]   A comprehensive survey of multiagent reinforcement learning [J].
Busoniu, Lucian ;
Babuska, Robert ;
De Schutter, Bart .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02) :156-172
[10]   GeoDTN plus Nav: Geographic DTN Routing with Navigator Prediction for Urban Vehicular Environments [J].
Cheng, Pei-Chun ;
Lee, Kevin C. ;
Gerla, Mario ;
Haerri, Jerome .
MOBILE NETWORKS & APPLICATIONS, 2010, 15 (01) :61-82