Uncertainty-Aware Reinforcement Learning for Portfolio Optimization

被引：0

作者：

Enkhsaikhan, Bayaraa ^{[1
]}

Jo, Ohyun ^{[1
]}

机构：

[1] Chungbuk Natl Univ, Dept Comp Sci, Cheongju 361763, South Korea

来源：

IEEE ACCESS | 2024年 / 12卷

基金：

新加坡国家研究基金会;

关键词：

Uncertainty; Portfolios; Optimization; Investment; Deep learning; Bayes methods; Data models; Predictive models; Measurement uncertainty; Decoding; Risk-averse reinforcement learning; portfolio optimization; variational auto encoder;

D O I：

10.1109/ACCESS.2024.3494859

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We explored the use of Reinforcement Learning (RL) combined with risk assessment for optimizing investment portfolios. The dynamic nature of trading, compounded by market frictions, the responses of other market participants, and uncertainties, poses challenges to portfolio optimization. The financial market's intricacies make it difficult to model accurately, compounded by regulatory requirements and internal risk policies mandating risk-averse decisions to avoid catastrophic outcomes. To address this, we proposed risk estimation for investor's risk tolerance threshold. Moreover, modern Deep Learning models are adept at approximating complex relationship between abundant data, however, the main drawback we face now a day is generalization of the relationship to the unseen data. Therefore, the epistemic uncertainty can pose risk to the decision making system. This uncertainty is further addressed using a Variational Autoencoder (VAE) to estimate, and Cost Network to backpropogate riskiness through the model to learn actions with safe results. The actions with stable result or lower reward will be avoided due to reward optimization of RL. Consequently, we successfully managed to reduce the risk and uncertainties in the agent testing process. Our risk-constrained RL algorithm demonstrated zero violation of the constraint in the testing phase. This suggests that adopting a risk-averse RL approach could be beneficial for portfolio optimization, particularly for risk-averse investors.

引用

页码：166553 / 166563

页数：11

共 41 条

[1]

Achelis S. B, 2001, Technical Analysis from A to Z

[2]

An J., 2015, SPECIAL LECT IE, V2, P1, DOI DOI 10.1007/BF00758335

[3] A deep learning framework for financial time series using stacked autoencoders and long-short term memory [J].

Bao, Wei ;

Yue, Jun ;

Rao, Yulei .

PLOS ONE, 2017, 12 (07)

[4]

Choi H, 2019, Arxiv, DOI arXiv:1810.01392

[5]

Chow Y, 2018, J MACH LEARN RES, V18

[6] Portfolio constructions in cryptocurrency market: A CVaR-based deep reinforcement learning approach [J].

Cui, Tianxiang ;

Ding, Shusheng ;

Jin, Huan ;

Zhang, Yongmin .

ECONOMIC MODELLING, 2023, 119

[7] Risk-averse Reinforcement Learning for Portfolio Optimization [J].

Enkhsaikhan, Bayaraa ;

Jo, Ohyun .

ICT EXPRESS, 2024, 10 (04) :857-862

[8]

Eriksson Hannes, 2020, ESANN 2020, P339

[9]

Fauvel T, 2021, Arxiv, DOI arXiv:2110.09361

[10]

Gal Y, 2016, PR MACH LEARN RES, V48

← 1 2 3 4 5 →