Computer assisted text analysis in the social sciences

被引:32
作者
Brier, Alan [1 ]
Hopp, Bruno [2 ]
机构
[1] ESRC Natl Ctr Res Methods, Southampton SO17 1QY, Hants, England
[2] GESIS Leibniz Inst Social Sci, D-50869 Cologne, Germany
关键词
Text analysis; Multidimensional scaling; Correspondence analysis; PINDIS; INDSCAL; ESTIMATING POLICY POSITIONS; INDIVIDUAL-DIFFERENCES;
D O I
10.1007/s11135-010-9350-8
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
We use the term "Computer Assisted Text Analysis" in a broad sense to refer to a range of current techniques from quantitative social science and content analysis to 'data mining' and 'text classification', including the analysis of open-ended survey questions, transcribed interviews and speeches, wherever, in fact, the researcher is confronted with data in the form of natural language texts of social scientific interest. These methods are often used in exploratory data analysis, but can also be applied systematically with moderate statistical rigour in the development and testing of hypotheses at various theoretical levels, ranging from the statistics of word usage to changes within or between discourses over time. The general approach is in the tradition of content analysis, by which words which occur together in relatively close proximity in the same context are interpreted as relating to a common theme or concept in the discourse studied. We review a comprehensive set of tools to identify and visualize structures of co-occurrence of words and concepts both within, and in comparing, a number of texts. These produce results not essentially different from those reached by representing word co-occurrences in terms of network analysis or neural network programming using schematic linguistic templates of various kinds. A comparison of the relational data analysis vs. a dictionary-based MDS approach shows that these provide very close if not identical results, despite the fact that the underlying assumptions are frequently represented as different theoretical approaches.
引用
收藏
页码:103 / 128
页数:26
相关论文
共 63 条
[1]  
Abdi H., 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05)Workshops, P42, DOI DOI 10.1109/CVPR.2005.445
[2]  
Alceste, ALC
[3]  
[Anonymous], ANAL COMMUNICATION C
[4]  
[Anonymous], 2005, VISUALISIERUNG KOMPL
[5]  
[Anonymous], 1963, PRINCIPLES NUMERICAL
[6]  
[Anonymous], 2005, P 22 INT C MACH LEAR
[7]   Analysing parliamentary debate with computer assistance [J].
Bara, Judith ;
Weale, Albert ;
Biquelet, Aude .
SWISS POLITICAL SCIENCE REVIEW, 2007, 13 (04) :577-605
[8]  
Bergman M.M., 2011, MIXED METHO IN PRESS
[9]  
Borg I., 1992, Grundlagen und Ergebnisse der Facettentheorie
[10]  
Borg I., 2005, Modern multidimensional scaling: theory and applications