From cognition to docition: The teaching radio paradigm for distributed & autonomous deployments

被引:10
作者
Giupponi, Lorenza [1 ]
Galindo-Serrano, Ana M. [1 ]
Dohler, Mischa [1 ]
机构
[1] PMT, CTTC, Barcelona 08860, Spain
关键词
Cognitive radio; Aggregated interference; Multi-agent system; Decentralized Q-learning; Docitive learning;
D O I
10.1016/j.comcom.2010.07.025
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We advocate for a novel communication paradigm of docition which facilitates distributed and autonomous networking at minimal control overhead and maximal performance. We consider that the nodes in foreseen networks are intelligent radios able to learn and thus self-adapt to prior set performance targets within a given surrounding environment. We briefly review the state-of-the-art of purely distributed learning algorithms, and we identify the most appropriate approaches allowing for self-adaptation to particular system dynamics. In such distributed settings, however, the learning is typically complex. imprecise and slow due to mutually-impacting decisions resulting in non-stationarities. The docitive paradigm proposes a timely solution which encourages more knowledgeable nodes to teach surrounding nodes to speed up the development of their cognitive state. We advocate for different degrees of docition, such as teaching at start-up or run-time, and demonstrate that this improves the convergence speed and precision of known cognitive algorithms. We evaluate the docitive paradigm in the context of a femtocell network modeled as a multi-agent system, where the agents are the femto access points, implementing a realtime multi-agent reinforcement learning technique known as decentralized Q-learning. We propose different docitive algorithms and we show their superiority to the well know paradigm of independent learning (C) 2010 Elsevier B.V. All rights reserved
引用
收藏
页码:2015 / 2020
页数:6
相关论文
共 23 条
[1]   Expertness based cooperative Q-learning [J].
Ahmadabadi, MN ;
Asadpour, M .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2002, 32 (01) :66-76
[2]  
[Anonymous], P COMP LEARN THEOR C
[3]  
[Anonymous], 1998, THEORY LEARNING GAME
[4]  
[Anonymous], 2009, 3GPP TSG RAN WG4 RAD
[5]  
[Anonymous], 2004, P NIPS
[6]  
Bellman R. E., 1957, Dynamic programming. Princeton landmarks in mathematics
[7]  
Brown GW., 1951, Activity analysis of production and allocation, V13
[8]  
CLAUS C, P 15 NAT C ART INT, P746
[9]  
DOHLER M, 2010, IEEE COMMUNICATI JAN
[10]  
FREUND Y, 2003, P 2 EUR COMP LEARN T, P23