Text Analysis in R

被引:162
作者
Welbers, Kasper [1 ]
Van Atteveldt, Wouter [2 ]
Benoit, Kenneth [3 ]
机构
[1] Univ Leuven, Inst Media Studies, Sint Andriesstr 2,Box 15530, B-2000 Leuven, Belgium
[2] Vrije Univ Amsterdam, Dept Commun Sci, Amsterdam, Netherlands
[3] London Sch Econ & Polit Sci, Dept Methodol, London, England
关键词
TOPIC MODELS; PARTY; PITFALLS; SUPPORT;
D O I
10.1080/19312458.2017.1387238
中图分类号
G2 [信息与知识传播];
学科分类号
05 ; 0503 ;
摘要
Computational text analysis has become an exciting research field with many applications in communication research. It can be a difficult method to apply, however, because it requires knowledge of various techniques, and the software required to perform most of these techniques is not readily available in common statistical software packages. In this teacher's corner, we address these barriers by providing an overview of general steps and operations in a computational text analysis project, and demonstrate how each step can be performed using the R statistical software. As a popular open-source platform, R has an extensive user community that develops and maintains a wide range of text analysis packages. We show that these packages make it easy to perform advanced text analytics.
引用
收藏
页码:245 / 265
页数:21
相关论文
共 62 条
[1]  
[Anonymous], 2017, XML TOOLS PARSING GE
[2]  
[Anonymous], 2014, SNOWBALLC SNOWBALL S
[3]  
[Anonymous], P 14 INT C MACH LEAR
[4]  
[Anonymous], QUANTEDA QUANTITATIV
[5]  
[Anonymous], 2017, TIF TEXT INTERCHANGE
[6]  
[Anonymous], 2001, Snowball: a language for stemming algorithms
[7]  
Arnold T., 2016, coreNLP: Wrappers around Stanford CoreNLP tools
[8]  
Arnold T, 2017, CLEANNLP TIDY DATA M
[9]  
Arnold T., 2017, kerasR: R Interface to the Keras Deep Learning Library
[10]  
Atteveldt W.V., 2008, Semantic network analysis: Techniques for extracting, representing, and querying media content