A sparse regression and neural network approach for financial factor modeling

被引:9
作者
Anis, Hassan T. [1 ]
Kwon, Roy H. [1 ]
机构
[1] Univ Toronto, Dept Mech & Ind Engn, 5 Kings Coll Rd, Toronto, ONTARIO M5S 3G8, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Finance; Factor model construction; Best subset selection; Neural networks; Interpretability-accuracy trade-off; SELECTION; EQUILIBRIUM; RETURNS;
D O I
10.1016/j.asoc.2021.107983
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Factor models are central to understanding risk-return trade-offs in finance. Since Fama and French (1993), hundreds of factors have been found to have explanatory power for asset pricing. To construct a factor model, two tasks have to be performed: Feature Selection, selecting a small subset given a large number of factors to overcome overfitting in regression, and Feature Engineering, determining the interactions between the factors. In this work, the process of constructing factor models (not the factors themselves) is examined. A unified, two-step process of dimensionality reduction and nonlinear transformation that produces parsimonious, general factor models is proposed. Comparisons between frameworks implementing linear feature selection models as well as non-linear feature reduction techniques are conducted. A second stage generalizes the models by learning nonlinear interactions. The framework attempts to strike a balance between accuracy and interpretability. Results of computational experiments on historical financial data, on three models of varying degrees of nonlinearity and interpretability suggest that mixed-integer-programming-based formulations are suitable for the task of linear financial factor selection and that the second-stage nonlinearity due to neural networks improves accuracy. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 46 条
[1]  
[Anonymous], PyTorch
[2]   A deep learning framework for financial time series using stacked autoencoders and long-short term memory [J].
Bao, Wei ;
Yue, Jun ;
Rao, Yulei .
PLOS ONE, 2017, 12 (07)
[3]   BEST SUBSET SELECTION VIA A MODERN OPTIMIZATION LENS [J].
Bertsimas, Dimitris ;
King, Angela ;
Mazumder, Rahul .
ANNALS OF STATISTICS, 2016, 44 (02) :813-852
[4]  
Bryzgalova, 2015, LSE UNPUB, V1
[5]   On persistence in mutual fund performance [J].
Carhart, MM .
JOURNAL OF FINANCE, 1997, 52 (01) :57-82
[6]   SCTSC: A Semicentralized Traffic Signal Control Mode With Attribute-Based Blockchain in IoVs [J].
Cheng, Lichen ;
Liu, Jiqiang ;
Xu, Guangquan ;
Zhang, Zonghua ;
Wang, Hao ;
Dai, Hong-Ning ;
Wu, Yulei ;
Wang, Wei .
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2019, 6 (06) :1373-1385
[7]  
Da Costa J, 2020, 2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), P1091, DOI 10.1109/SSCI47803.2020.9308232
[8]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499
[9]   COMMON RISK-FACTORS IN THE RETURNS ON STOCKS AND BONDS [J].
FAMA, EF ;
FRENCH, KR .
JOURNAL OF FINANCIAL ECONOMICS, 1993, 33 (01) :3-56
[10]   A five-factor asset pricing model [J].
Fama, Eugene F. ;
French, Kenneth R. .
JOURNAL OF FINANCIAL ECONOMICS, 2015, 116 (01) :1-22