Text Algorithms in Economics

被引:28
作者
Ash, Elliott [1 ]
Hansen, Stephen [2 ,3 ]
机构
[1] Swiss Fed Inst Technol, Ctr Law & Econ, Zurich, Switzerland
[2] UCL, Dept Econ, London, England
[3] Ctr Econ Policy Res, London, England
关键词
text as data; topic models; word embeddings; large language models; transformer models; MODELS;
D O I
10.1146/annurev-economics-082222-074352
中图分类号
F [经济];
学科分类号
02 ;
摘要
This article provides an overview of the methods used for algorithmic text analysis in economics, with a focus on three key contributions. First, we introduce methods for representing documents as high-dimensional count vectors over vocabulary terms, for representing words as vectors, and for representing word sequences as embedding vectors. Second, we define four core empirical tasks that encompass most text-as-data research in economics and enumerate the various approaches that have been taken so far to accomplish these tasks. Finally, we flag limitations in the current literature, with a focus on the challenge of validating algorithmic output.
引用
收藏
页码:659 / 688
页数:30
相关论文
共 133 条
[1]  
Adams-Prassl A., 2020, 13691 IZA
[2]   What We Teach About Race and Gender: Representation in Images and Text of Children's Books [J].
Adukia, Anjali ;
Eble, Alex ;
Harrison, Emileigh ;
Runesha, Hakizumwami Birali ;
Szasz, Teodora .
QUARTERLY JOURNAL OF ECONOMICS, 2023, 138 (04) :2225-2285
[3]  
Advani A, 2021, CEPR Discuss. Pap. 16115
[4]  
Ahrens M, 2023, Natural language processing for monetary economics: a benchmark
[5]   Can we measure inflation expectations using Twitter? [J].
Angelico, Cristina ;
Marcucci, Juri ;
Miccoli, Marcello ;
Quarta, Filippo .
JOURNAL OF ECONOMETRICS, 2022, 228 (02) :259-277
[6]  
Angrist JD, 2009, MOSTLY HARMLESS ECONOMETRICS: AN EMPIRICISTS COMPANION, P1
[7]  
[Anonymous], 2006, AAAI
[8]  
[Anonymous], NIPS
[9]  
Apel M., 2014, REV EC, V65, P53
[10]  
Arora S, 2016, 5 INT C LEARN REPR T