Natural language processing for social science research: A comprehensive review

被引:0
|
作者
Hou, Yuxin [1 ,2 ]
Huang, Junming [3 ]
机构
[1] Peking Univ, Ctr Social Res, Beijing, Peoples R China
[2] Tsinghua Univ, Inst Educ, Beijing, Peoples R China
[3] Princeton Univ, Paul & Marcia Wythes Ctr Contemporary China, Princeton, NJ 08544 USA
关键词
Big data/data science; language/linguistics; quantitative methods; natural language processing; text analysis; neural network; topic model; COMPUTERIZED TEXT ANALYSIS; MEDIA; CULTURE; TWITTER; CLASSIFICATION; COMMUNICATION; SENTIMENT; MICROBLOGS; CAMPAIGNS; FACEBOOK;
D O I
10.1177/2057150X241306780
中图分类号
C91 [社会学];
学科分类号
030301 ; 1204 ;
摘要
Text data has been a longstanding pivotal source for social science research, providing an informative lens across disciplines including sociology, psychology, and political science. Its salient role in research, combined with the difficulty in numerically digesting unstructured data in natural languages, has been inspiring growing demands for natural language processing techniques to extract meaningful insights from vast text data. Breakthrough advances in natural language processing emerge with the recent expansion in data availability and computational resources, calling for an up-to-date comprehensive review for those methodologies and applications in social science research. This article reviews natural language processing techniques, detailing the procedure from representing unstructured text data to distilling semantic information, with expertise-based algorithms and unsupervised/supervised machine-learning methods. We then introduce their typical applications in producing research outcomes for sociology and political science. Keeping in mind challenges in data representativeness, interpretability, and biases, this review encourages utilizing natural language processing technique responsibly and effectively in social science research to improve quantitative understandings of emerging text data.
引用
收藏
页码:121 / 157
页数:37
相关论文
共 50 条
  • [41] Natural language processing for mental health interventions: a systematic review and research framework
    Malgaroli, Matteo
    Hull, Thomas D.
    Zech, James M.
    Althoff, Tim
    TRANSLATIONAL PSYCHIATRY, 2023, 13 (01)
  • [42] A scoping review on the use of natural language processing in research on political polarization: trends and research prospects
    Renáta Németh
    Journal of Computational Social Science, 2023, 6 : 289 - 313
  • [43] A scoping review on the use of natural language processing in research on political polarization: trends and research prospects
    Nemeth, Renata
    JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2023, 6 (01): : 289 - 313
  • [44] A systematic review of natural language processing applied to radiology reports
    Casey, Arlene
    Davidson, Emma
    Poon, Michael
    Dong, Hang
    Duma, Daniel
    Grivas, Andreas
    Grover, Claire
    Suarez-Paniagua, Victor
    Tobin, Richard
    Whiteley, William
    Wu, Honghan
    Alex, Beatrice
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)
  • [45] A critical review of social media research in sensory-consumer science
    Hutchings, Scott C.
    Dixit, Yash
    Al-Sarayreh, Mahmoud
    Torrico, Damir D.
    Realini, Carolina E.
    Jaeger, Sara R.
    Reis, Marlon M.
    FOOD RESEARCH INTERNATIONAL, 2023, 165
  • [46] Scoping review on natural language processing applications in counselling and psychotherapy
    Laricheva, Maria
    Liu, Yan
    Shi, Edward
    Wu, Amery
    BRITISH JOURNAL OF PSYCHOLOGY, 2024,
  • [47] Deep learning in clinical natural language processing: a methodical review
    Wu, Stephen
    Roberts, Kirk
    Datta, Surabhi
    Du, Jingcheng
    Ji, Zongcheng
    Si, Yuqi
    Soni, Sarvesh
    Wang, Qiong
    Wei, Qiang
    Xiang, Yang
    Zhao, Bo
    Xu, Hua
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (03) : 457 - 470
  • [48] System for Monitoring Natural Disasters using Natural Language Processing in the Social Network Twitter
    Maldonado, Miguel
    Alulema, Darwin
    Morocho, Derlin
    Proano, Mariela
    2016 IEEE INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST), 2016, : 79 - 84
  • [49] Applications of Natural Language Processing and Large Language Models for Social Determinants of Health: Protocol for a Systematic Review
    Rajwal, Swati
    Zhang, Ziyuan
    Chen, Yankai
    Rogers, Hannah
    Sarker, Abeed
    Xiao, Yunyu
    JMIR RESEARCH PROTOCOLS, 2025, 14
  • [50] Natural language processing applications in library and information science
    Taskin, Zehra
    Al, Umut
    ONLINE INFORMATION REVIEW, 2019, 43 (04) : 676 - 690