Resource allocation with stochastic optimal control approach

被引:0
作者
Haleh Valian
Mohsen A. Jafari
Davood Golmohammadi
机构
[1] Biogen Idec,Department of Industrial and Systems Engineering
[2] Rutgers University,College of Management
[3] University of Massachusetts Boston,undefined
来源
Annals of Operations Research | 2016年 / 239卷
关键词
Decision system; Optimization; Resource allocation; Stochastic;
D O I
暂无
中图分类号
学科分类号
摘要
A control-theoretic decision making system is proposed for an agent (decision maker) to “optimally” allocate and deploy his/her resources over time among a dynamically changing list of opportunities (e.g., financial assets), in an uncertain market environment. The solution is a sequence of actions with the objective of optimizing total reward function. This control-theoretic approach is unique in a sense that it solves the problem at distinct time epochs over a finite time horizon and strategies are discovered directly. Rather than basing a decision making system on forecasts or training via a reinforcement learning algorithm using current state data, we train our system via a Q-learning algorithm using Geometric Brownian Motion as an asset price function. While the above problem is quite general, we focus solely on the problem of dynamic financial portfolio management with the objective of maximizing the expected utility for a given risk level. The performance functions that we consider for our system are realized mean return, drawdown and standard deviation. We find that our model achieves a better return and drawdown compared to a known market index as a benchmark.
引用
收藏
页码:625 / 641
页数:16
相关论文
共 15 条
[1]  
Brown D(2011)Dynamic portfolio optimization with transaction costs: Heuristics and dual bounds Management Science 57 1752-1770
[2]  
Smith J(2011)Portfolio selection with imperfect information: A hidden Markov model Applied Stochastic Models in Business and Industry 27 95-114
[3]  
Çanakoğlu E(2006)Neuro-dynamic trading methods European Journal of Operational Research 175 16-174
[4]  
Özekici S(1970)Multiperiod consumption-investment decisions American Economics Review 60 163-39
[5]  
Casqueiroa XP(2007)Portfolio choice in stochastic environments The Review of Financial Studies 20 1-91
[6]  
Rodrigues AJL(1952)Portfolio selection Journal of Finance 7 77-889
[7]  
Fama EF(2001)Learning to trade via direct reinforcement IEEE Transactions on Neural Networks 12 875-233
[8]  
Liu J(1972)Random nature of stock market prices Journal of Economics and Business 6 220-1524
[9]  
Markowitz H(2008)A dynamic stochastic programming model for international portfolio management European Journal of Operational Research 185 1501-undefined
[10]  
Moody J(undefined)undefined undefined undefined undefined-undefined