Amoeba-inspired algorithm for cognitive medium access

被引：24

作者：

Kima, Song-Ju ^{[1
]}

Aono, Masashi ^{[2
,3
]}

机构：

[1] Natl Inst Mat Sci, WPI Ctr Mat Nanoarchitecton MANA, 1-1 Namiki, Tsukuba, Ibaraki 3050044, Japan

[2] Tokyo Inst Technol, Earth Life Sci Inst, Meguro Ku, Tokyo 1528550, Japan

[3] Japan Sci & Technol Agcy, PRESTO, Saitama 3320012, Japan

来源：

IEICE NONLINEAR THEORY AND ITS APPLICATIONS | 2014年 / 5卷 / 02期

关键词：

cognitive radio; medium access control; multi-armed bandit problem; bio-inspired computing; parallel search; machine learning;

D O I：

10.1587/nolta.5.198

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

The "tug-of-war (TOW) model" is a unique parallel search algorithm for solving the multi-armed bandit problem (BP), which was inspired by the photoavoidance behavior of a single-celled amoeboid organism, the true slime mold Physarum polycephalum [1-4]. "The cognitive medium access (CMA) problem," which refers to multiuser channel allocations of the cognitive radio, can be interpreted as a "competitive multi-armed bandit problem (CBP) [5, 6]." Unlike the normal BP, the CBP considers a competitive situation in which more than one user selects a channel whose reward probability (probability of which channel is free) varies depending on the number and combination of the selecting users as indicated in a payoff matrix. Depending on the payoff matrix, the CBP provides a hard problem instance in which the users should not be attracted to the Nash equilibrium to achieve the "social maximum," which is the most desirable state to obtain the maximum total score (throughput) for all the users. In this study, we propose two variants of the TOW model (solid type and liquid type) for the CBP toward developing a CMA protocol using a distributed control in uncertain environments. Using the minimum CBP cases where both the users choose a channel from the two considered channels, we show that the performance of our solid-type TOW model is better than that of the well-known upper confidence bound 1 (UCB1)-tuned algorithm, particularly for the hard problem instances. The aim of this study is to explore how the users can achieve the social maximum in a decentralized manner. We also show that our liquid-type TOW model, which introduces direct interactions among the users for avoiding mutual collisions, makes it possible to achieve the social maximum for general CBP instances.

引用

页码：198 / 209

页数：12

共 22 条

[1] Agarwal D, 2009, P ICDM2009
[2] Aono M, 2009, LECT NOTES COMPUT SC, V5715, P56, DOI 10.1007/978-3-642-03745-0_13
[3] Amoeba-based Chaotic Neurocomputing: Combinatorial Optimization by Coupled Biological Oscillators
Aono, Masashi
Hirata, Yoshito
Hara, Masahiko
Aihara, Kazuyuki
[J]. NEW GENERATION COMPUTING, 2009, 27 (02) : 129 - 157
[4] Finite-time analysis of the multiarmed bandit problem
Auer, P
Cesa-Bianchi, N
Fischer, P
[J]. MACHINE LEARNING, 2002, 47 (2-3) : 235 - 256
[5] Ant algorithms for discrete optimization
Dorigo, M
Di Caro, G
Gambardella, LM
[J]. ARTIFICIAL LIFE, 1999, 5 (02) : 137 - 172
[6] Gelly S., 2006, RR6062, P1
[7] Karaboga, 2005, TR06 ERC U
[8] Kim S.-J., 2013, CCS2012037 IEICE, P37
[9] Kim S.-J., 2012, P NOLTA2012, P590
[10] Kim S.-J., 2010, P INT S NONL THEOR I, P520

← 1 2 3 →