Time Series Data Generation by Linear Response Model An Application of Linear Response Theory to Finance

被引:0
作者
Naritomi Y. [1 ,2 ]
Adachi T. [2 ]
机构
[1] Mitsubishi UFJ Trust Investment Technology Institute Co., Ltd, Tokyo Metropolitan University
关键词
data augmentation; data generation; time series data;
D O I
10.1527/tjsai.39-4_FIN23-J
中图分类号
学科分类号
摘要
In recent years, data generation techniques based on machine learning approaches have made significant progress and various generation models have been proposed in the fields of text and image. In the field of finance, there has also been research on the data generation of time series on stock prices. However, there are many problems in applying machine learning models to financial time series data. For example, the behavior of the machine learning model when given data that is not in the historical data is unclear, or if the data differs significantly from historical data under strong stresses such as the Lehman or the COVID-19 shock, the predictions by the machine learning model will be completely unreliable. This problem may be solvable by using data generation techniques that add virtual stress to the current state. In this study, we propose a new method for generating virtual time series data based on linear response theory (LRT). Using the LRT, we can obtain an approximate representation of the transition from the current equilibrium state to another equilibrium state by adding second-order fluctuation in the current equilibrium state and external forces (stresses). In other words, if the external force can be estimated in advance, we shall be able to obtain virtual time series data under external forces. As an application, we examined whether this method is effective with data augmentation of the stock prediction model using external forces estimated by historical data. As a result, the accuracy of stock price predictions was improved over the case without data augmentation. © 2024, Japanese Society for Artificial Intelligence. All rights reserved.
引用
收藏
相关论文
共 14 条
  • [1] Alcaraz J. M. L., Strodthoff N., Diffusion-Based Time Series Imputation and Forecasting with Structured State Space Models, (2022)
  • [2] Arjovsky M., Chintala S., Bottou L., Wasserstein Gan, (2017)
  • [3] Bilos M., Rasul K., Schneider A., Nevmyvaka Y., Gunnemann S., Modeling Temporal Data as Continuous Functions with Stochastic Process Diffusion, (2022)
  • [4] Goodfellow I. J., Pouget-Abadie J., Mirza M., Xu B., Warde-Farley D., Ozair S., Courville A., Bengio Y., Generative adversarial nets, Advances in Neural Information Processing Systems, 27, pp. 2672-2680, (2014)
  • [5] Ikeguchi M., Ueno J., Sato M., Kidera A., Protein Structural Change Upon Ligand Binding: Linear Response Theory, Phys. Rev. Lett, 94, (2005)
  • [6] Kubo R., Toda M., Hashitsume N., Statistical Physics II: Nonequilibrium Statistical Mechanics, (1991)
  • [7] Li J., Wang X., Lin Y., Sinha A., Wellman M. P., Generating Realistic Stock Market Order Streams, (2020)
  • [8] Li Y., Lu X., Wang Y., Generative Time Series Forecasting with Diffusion, Denoise, and Disentanglement, (2023)
  • [9] Oord A., Dieleman S., Zen H., Simonyan K., Vinyals O., Graves A., Kavukcuoglu K., Wavenet: A generative model for raw audio, (2016)
  • [10] Tashiro Y., Song J., Song Y., Ermon S., CSDI: Conditional Score-Based Diffusion Models for Probabilistic Time Series Imputation, Advances in Neural Information Processing Systems, (2021)