Yahoo! for Amazon: Sentiment extraction from small talk on the web

被引:660
作者
Das, Sanjiv R. [1 ]
Chen, Mike Y.
机构
[1] Santa Clara Univ, Leavey Sch Business, Dept Finance, Santa Clara, CA 95053 USA
[2] Ludic Labs, San Mateo, CA 94401 USA
关键词
text classification; index formation; computers-computer science; artificial intelligence; finance; investment;
D O I
10.1287/mnsc.1070.0704
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Extracting sentiment from text is a hard semantic problem. We develop a methodology for extracting small investor sentiment from stock message boards. The algorithm comprises different classifier algorithms coupled together by a voting scheme. Accuracy levels are similar to widely used Bayes classifiers, but false positives are lower and sentiment accuracy higher. Time series and cross-sectional aggregation of messaged information improves the quality of the resultant sentiment index, particularly in the presence of slang and ambiguity. Empirical applications evidence a relationship with stock values-tech-sector postings are related to stock index levels, and to volumes and volatility. The algorithms may be used to assess the impact on investor opinion of management announcements, press releases, third-party news, and regulatory changes.
引用
收藏
页码:1375 / 1388
页数:14
相关论文
共 32 条
  • [1] ADMATI A, 2000, 1670R STANF U
  • [2] [Anonymous], INTERNET STOCK MESSA
  • [3] [Anonymous], 1998, P 1998 ACM SIGMOD IN
  • [4] [Anonymous], 1996, BOW TOOLKIT STAT LAN
  • [5] [Anonymous], 1998, 98025 U MICH BUS SCH
  • [6] [Anonymous], 2001, J INVESTING
  • [7] [Anonymous], 1997, Machine Learning
  • [8] Is all that talk just noise? The information content of Internet stock message boards
    Antweiler, W
    Frank, MZ
    [J]. JOURNAL OF FINANCE, 2004, 59 (03) : 1259 - 1294
  • [9] ANTWEILER W, 2005, MARKET IMPACT NEWS S
  • [10] Whisper forecasts of quarterly earnings per share
    Bagnoli, M
    Beneish, MD
    Watts, SG
    [J]. JOURNAL OF ACCOUNTING & ECONOMICS, 1999, 28 (01) : 27 - 50