Natural language processing for social science research: A comprehensive review

被引:0
|
作者
Hou, Yuxin [1 ,2 ]
Huang, Junming [3 ]
机构
[1] Peking Univ, Ctr Social Res, Beijing, Peoples R China
[2] Tsinghua Univ, Inst Educ, Beijing, Peoples R China
[3] Princeton Univ, Paul & Marcia Wythes Ctr Contemporary China, Princeton, NJ 08544 USA
关键词
Big data/data science; language/linguistics; quantitative methods; natural language processing; text analysis; neural network; topic model; COMPUTERIZED TEXT ANALYSIS; MEDIA; CULTURE; TWITTER; CLASSIFICATION; COMMUNICATION; SENTIMENT; MICROBLOGS; CAMPAIGNS; FACEBOOK;
D O I
10.1177/2057150X241306780
中图分类号
C91 [社会学];
学科分类号
030301 ; 1204 ;
摘要
Text data has been a longstanding pivotal source for social science research, providing an informative lens across disciplines including sociology, psychology, and political science. Its salient role in research, combined with the difficulty in numerically digesting unstructured data in natural languages, has been inspiring growing demands for natural language processing techniques to extract meaningful insights from vast text data. Breakthrough advances in natural language processing emerge with the recent expansion in data availability and computational resources, calling for an up-to-date comprehensive review for those methodologies and applications in social science research. This article reviews natural language processing techniques, detailing the procedure from representing unstructured text data to distilling semantic information, with expertise-based algorithms and unsupervised/supervised machine-learning methods. We then introduce their typical applications in producing research outcomes for sociology and political science. Keeping in mind challenges in data representativeness, interpretability, and biases, this review encourages utilizing natural language processing technique responsibly and effectively in social science research to improve quantitative understandings of emerging text data.
引用
收藏
页码:121 / 157
页数:37
相关论文
共 50 条
  • [21] Quantum Natural Language Processing: A Comprehensive Survey
    Varmantchaonala, Charles M.
    Fendji, Jean Louis K. E.
    Schoning, Julius
    Atemkeng, Marcellin
    IEEE ACCESS, 2024, 12 : 99578 - 99598
  • [22] Natural language processing of social network data for the evaluation of agricultural and rural policies
    Dominguez, Alba Gutierrez
    Roig-Tierno, Norat
    Chaparro-Banegas, Nuria
    Garcia-Alvarez-Coque, Jose-Maria
    JOURNAL OF RURAL STUDIES, 2024, 109
  • [23] The use of natural language processing in palliative care research: A scoping review
    Sarmet, Max
    Kabani, Aamna
    Coelho, Luis
    dos Reis, Sara Seabra
    Zeredo, Jorge L.
    Mehta, Ambereen K.
    PALLIATIVE MEDICINE, 2023, 37 (02) : 275 - 290
  • [24] Speciesism in natural language processing research
    Masashi Takeshita
    Rafal Rzepka
    AI and Ethics, 2025, 5 (3): : 2961 - 2976
  • [25] Natural language processing in clinical neuroscience and psychiatry: A review
    Crema, Claudio
    Attardi, Giuseppe
    Sartiano, Daniele
    Redolfi, Alberto
    FRONTIERS IN PSYCHIATRY, 2022, 13
  • [26] Natural Language Processing and Social Determinants of Health in Mental Health Research: AI-Assisted Scoping Review
    Scherbakov, Dmitry A.
    Hubig, Nina C.
    Lenert, Leslie A.
    Alekseyenko, Alexander, V
    Obeid, Jihad S.
    JMIR MENTAL HEALTH, 2025, 12
  • [27] A comprehensive survey of deep learning in the field of medical imaging and medical natural language processing: Challenges and research directions
    Pandey, Babita
    Pandey, Devendra Kumar
    Mishra, Brijendra Pratap
    Rhmann, Wasiur
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 5083 - 5099
  • [28] Natural language processing applied to tourism research: A systematic review and future research directions
    Alvarez-Carmona, Miguel A.
    Aranda, Ramon
    Rodriguez-Gonzalez, Ansel Y.
    Fajardo-Delgado, Daniel
    Guadalupe Sanchez, Maria
    Perez-Espinosa, Humberto
    Martinez-Miranda, Juan
    Guerrero-Rodriguez, Rafael
    Bustio-Martinez, Lazaro
    Diaz-Pacheco, Angel
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (10) : 10125 - 10144
  • [29] Applications of natural language processing in radiology: A systematic review
    Linna, Nathaniel
    Kahn, Charles E., Jr.
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2022, 163
  • [30] Towards the automation of systematic reviews using natural language processing, machine learning, and deep learning: a comprehensive review
    Ofori-Boateng, Regina
    Aceves-Martins, Magaly
    Wiratunga, Nirmalie
    Moreno-Garcia, Carlos Francisco
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (08)