The single decision maker chooses one of the actions repeatedly. She chooses the action with the highest weighted average of the past payoffs. In the long run either the action with highest expected payoff or the action with highest minimal payoff is chosen depending on how weights evolve. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:303 / 305
页数:3
相关论文
共 7 条
[1]
[Anonymous], MODERN APPROACH PROB
[2]
[Anonymous], 1998, INDIVIDUAL STRATEGY, DOI DOI 10.1515/9780691214252