Regret Bounds for Online Portfolio Selection with a Cardinality Constraint

被引:0
作者
Ito, Shinji [1 ]
Hatano, Daisuke [2 ]
Sumita, Hanna [3 ]
Yabe, Akihiro [1 ]
Fukunaga, Takuro [2 ,4 ]
Kakimura, Naonori [5 ]
Kawarabayashi, Ken-ichi [6 ]
机构
[1] NEC Corp Ltd, Minato, Japan
[2] RIKEN AIP, Tokyo, Japan
[3] Tokyo Metropolitan Univ, Tokyo, Japan
[4] JST PRESTO, Tokyo, Japan
[5] Keio Univ, Tokyo, Japan
[6] Natl Inst Informat, Tokyo, Japan
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018) | 2018年 / 31卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online portfolio selection is a sequential decision-making problem in which a learner repetitively selects a portfolio over a set of assets, aiming to maximize long-term return. In this paper, we study the problem with the cardinality constraint that the number of assets in a portfolio is restricted to be at most k, and consider two scenarios: (i) in the full-feedback setting, the learner can observe price relatives (rates of return to cost) for all assets, and (ii) in the bandit-feedback setting, the learner can observe price relatives only for invested assets. We propose efficient algorithms for these scenarios, which achieve sublinear regrets. We also provide regret (statistical) lower bounds for both scenarios which nearly match the upper bounds when k is a constant. In addition, we give a computational lower bound, which implies that no algorithm maintains both computational efficiency, as well as a small regret upper bound.
引用
收藏
页数:10
相关论文
共 29 条
  • [1] Agarwal A., 2006, P 23 INT C MACHINE L, P9, DOI DOI 10.1145/1143844.1143846
  • [2] Alon Noga, 2004, PROBABILISTIC METHOD
  • [3] [Anonymous], 2002, J MACH LEARN RES
  • [4] [Anonymous], 2013, INT C MACH LEARN
  • [5] [Anonymous], FDN TRENDS OPTIM
  • [6] [Anonymous], 2012, ELEMENTS INFORM THEO
  • [7] [Anonymous], 2012, Theory of Computing
  • [8] [Anonymous], 2003, P 20 INT C MACH ICML
  • [9] Auer P, 2003, SIAM J COMPUT, V32, P48, DOI 10.1137/S0097539701398375
  • [10] Finite-time analysis of the multiarmed bandit problem
    Auer, P
    Cesa-Bianchi, N
    Fischer, P
    [J]. MACHINE LEARNING, 2002, 47 (2-3) : 235 - 256