Using Supervised Machine Learning in Automated Content Analysis: An Example Using Relational Uncertainty

被引:28
作者
Pilny, Andrew [1 ]
McAninch, Kelly [1 ]
Slone, Amanda [1 ]
Moore, Kelsey [1 ]
机构
[1] Univ Kentucky, Dept Commun, Lexington, KY USA
关键词
RELATIONSHIP QUALITY; DEPRESSIVE SYMPTOMS; COMMUNICATION; TURBULENCE; SELECTION; PITFALLS; FRAME; TEXT;
D O I
10.1080/19312458.2019.1650166
中图分类号
G2 [信息与知识传播];
学科分类号
05 ; 0503 ;
摘要
The goal of this research is to make progress towards using supervised machine leaming for automated content analysis dealing with complex interpretations of text. For Step 1, two humans coded a sub-sample of online forum posts for relational uncertainty. For Step 2, we evaluated reliability, in which we trained three different classifiers to learn from those subjective human interpretations. Reliability was established when two different metrics of inter-coder reliability could not distinguish whether a human or a machine coded the text on a separate hold-out set. Finally, in Step 3 we assessed validity. To accomplish this, we administered a survey in which participants described their own relational uncertainty/certainty via text and completed a questionnaire. After classifying the text, the machine's classifications of the participants' text positively correlated with the subjects' own self-reported relational uncertainty and relational satisfaction. We discuss our results in line with areas of computational communication science, content analysis, and interpersonal communication.
引用
收藏
页码:287 / 304
页数:18
相关论文
共 65 条
[1]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[2]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[3]   Shared cognition and participation in small groups: Similarity of member prototypes [J].
Bonito, JA .
COMMUNICATION RESEARCH, 2004, 31 (06) :704-730
[4]  
Bonito JA, 2018, CAMB HANDB PSYCHOL, P387
[5]   TAKING STOCK OF THE TOOLKIT An overview of relevant automated content analysis approaches and techniques for digital journalism scholars [J].
Boumans, Jelle W. ;
Trilling, Damian .
DIGITAL JOURNALISM, 2016, 4 (01) :8-23
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]  
Brownlee J., 2016, Master Machine Learning Algorithms - Discover how they work and Implement Them From Scratch, chapter 7
[8]   Teaching the Computer to Code Frames in News: Comparing Two Supervised Machine Learning Approaches to Frame Analysis [J].
Burscher, Bjoern ;
Odijk, Daan ;
Vliegenthart, Rens ;
de Rijke, Maarten ;
de Vreese, Claes H. .
COMMUNICATION METHODS AND MEASURES, 2014, 8 (03) :190-206
[9]   Using Supervised Machine Learning to Code Policy Issues: Can Classifiers Generalize across Contexts? [J].
Burscher, Bjorn ;
Vliegenthart, Rens ;
de Vreese, Claes H. .
ANNALS OF THE AMERICAN ACADEMY OF POLITICAL AND SOCIAL SCIENCE, 2015, 659 (01) :122-131
[10]   A survey on feature selection methods [J].
Chandrashekar, Girish ;
Sahin, Ferat .
COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (01) :16-28