Network Environment and Financial Risk Using Machine Learning and Sentiment Analysis

被引:26
作者
Li, Nan [2 ]
Liang, Xun [2 ]
Li, Xinli [2 ]
Wang, Chao [2 ]
Wu, Desheng Dash [1 ,3 ]
机构
[1] Reykjavik Univ, Sch Sci & Engn, IS-103 Reykjavik, Iceland
[2] Peking Univ, Inst Comp Sci & Technol, Beijing 100871, Peoples R China
[3] Univ Toronto, RiskLab, Toronto, ON, Canada
来源
HUMAN AND ECOLOGICAL RISK ASSESSMENT | 2009年 / 15卷 / 02期
关键词
risk; natural language processing; sentiment analysis; stock market; machine learning;
D O I
10.1080/10807030902761056
中图分类号
X176 [生物多样性保护];
学科分类号
090705 ;
摘要
Under the network environment, the trading volume and asset price of a financial commodity or instrument are affected by various complicated factors. Machine learning and sentiment analysis provide powerful tools to collect a great deal of data from the website and retrieve useful information for effectively forecasting financial risk of associated companies. This article studies trading volume and asset price risk when sentimental financial information data are available using both sentiment analysis and popular machine learning approaches: artificial neural network (ANN) and support vector machine (SVM). Nonlinear GARCH-based mining models are developed by integrating GARCH (generalized autoregressive conditional heteroskedasticity) theory and ANN and SVM. Empirical studies in the U.S. stock market show that the proposed approach achieves favorable forecast performances. GARCH-based SVM outperforms GARCH-based ANN for volatility forecast, whereas GARCH-based ANN achieves a better forecast result for the volatility trend. Results also indicate a strong correlation between information sentiment and both trading volume and asset price volatility.
引用
收藏
页码:227 / 252
页数:26
相关论文
共 26 条
  • [1] Visualising sentiments in financial texts?
    Ahmad, K
    Almas, Y
    [J]. NINTH INTERNATIONAL CONFERENCE ON INFORMATION VISUALISATION, PROCEEDINGS, 2005, : 363 - 368
  • [2] [Anonymous], 2005, INT C SYST SCI 2005
  • [3] [Anonymous], 2001, P 12 EUR C MACH LEAR, DOI DOI 10.1007/3-540-44795-4_42
  • [4] GENERALIZED AUTOREGRESSIVE CONDITIONAL HETEROSKEDASTICITY
    BOLLERSLEV, T
    [J]. JOURNAL OF ECONOMETRICS, 1986, 31 (03) : 307 - 327
  • [5] A tutorial on Support Vector Machines for pattern recognition
    Burges, CJC
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) : 121 - 167
  • [6] Cai CL, 2003, SECOND IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, PROCEEDINGS, P60
  • [7] Monitoring financial market using French written textual data.
    Chuttur, MY
    Bhurtun, C
    [J]. ICCC 2005: IEEE 3rd International Conference on Computational Cybernetics, 2005, : 239 - 242
  • [8] Natural language processing and information extraction: Qualitative analysis of financial news articles.
    Costantino, M
    Morgan, RG
    Collingham, RJ
    Garigliano, R
    [J]. PROCEEDINGS OF THE IEEE/IAFE 1997 COMPUTATIONAL INTELLIGENCE FOR FINANCIAL ENGINEERING (CIFER), 1997, : 116 - 122
  • [9] Cristianini N., 2000, INTRO SUPPORT VECTOR
  • [10] Dong ZD, 2003, 2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, P820