Natural language processing for social science research: A comprehensive review

被引:0
|
作者
Hou, Yuxin [1 ,2 ]
Huang, Junming [3 ]
机构
[1] Peking Univ, Ctr Social Res, Beijing, Peoples R China
[2] Tsinghua Univ, Inst Educ, Beijing, Peoples R China
[3] Princeton Univ, Paul & Marcia Wythes Ctr Contemporary China, Princeton, NJ 08544 USA
关键词
Big data/data science; language/linguistics; quantitative methods; natural language processing; text analysis; neural network; topic model; COMPUTERIZED TEXT ANALYSIS; MEDIA; CULTURE; TWITTER; CLASSIFICATION; COMMUNICATION; SENTIMENT; MICROBLOGS; CAMPAIGNS; FACEBOOK;
D O I
10.1177/2057150X241306780
中图分类号
C91 [社会学];
学科分类号
030301 ; 1204 ;
摘要
Text data has been a longstanding pivotal source for social science research, providing an informative lens across disciplines including sociology, psychology, and political science. Its salient role in research, combined with the difficulty in numerically digesting unstructured data in natural languages, has been inspiring growing demands for natural language processing techniques to extract meaningful insights from vast text data. Breakthrough advances in natural language processing emerge with the recent expansion in data availability and computational resources, calling for an up-to-date comprehensive review for those methodologies and applications in social science research. This article reviews natural language processing techniques, detailing the procedure from representing unstructured text data to distilling semantic information, with expertise-based algorithms and unsupervised/supervised machine-learning methods. We then introduce their typical applications in producing research outcomes for sociology and political science. Keeping in mind challenges in data representativeness, interpretability, and biases, this review encourages utilizing natural language processing technique responsibly and effectively in social science research to improve quantitative understandings of emerging text data.
引用
收藏
页码:121 / 157
页数:37
相关论文
共 50 条
  • [1] Designing a Natural Language Processing System to Support Social Science Research
    Gone, Keshava Pallavi
    Smit, Michael
    PROCEEDINGS OF THE 2023 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING, ASONAM 2023, 2023, : 345 - 347
  • [2] A Natural Language Processing Approach to Social License Management
    Boutilier, Robert G.
    Bahr, Kyle
    SUSTAINABILITY, 2020, 12 (20) : 1 - 12
  • [3] The Utilization of Natural Language Processing for Analyzing Social Media Data in Nursing Research: A Scoping Review
    Wang, Zhenrong
    Ma, Yulin
    Song, Yuanyuan
    Huang, Yao
    Liang, Guopeng
    Zhong, Xi
    JOURNAL OF NURSING MANAGEMENT, 2024, 2024 (01) : 2857497
  • [4] A comprehensive review on resolving ambiguities in natural language processing
    Yadav, Apurwa
    Patel, Aarshil
    Shah, Manan
    AI OPEN, 2021, 2 : 85 - 92
  • [5] A comprehensive review of deep learning for natural language processing
    Bouraoui, Amal
    Jamoussi, Salma
    Ben Hamadou, Abdelmajid
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2022, 14 (02) : 149 - 182
  • [6] Natural language processing (NLP) in management research: A literature review
    Kang, Yue
    Cai, Zhao
    Tan, Chee-Wee
    Huang, Qian
    Liu, Hefu
    JOURNAL OF MANAGEMENT ANALYTICS, 2020, 7 (02) : 139 - 172
  • [7] Transforming epilepsy research: A systematic review on natural language processing applications
    Yew, Arister N. J.
    Schraagen, Marijn
    Otte, Willem M. M.
    van Diessen, Eric
    EPILEPSIA, 2023, 64 (02) : 292 - 305
  • [8] Natural language processing for urban research: A systematic review
    Cai, Meng
    HELIYON, 2021, 7 (03)
  • [9] Advances in natural language processing for healthcare: A comprehensive review of techniques, applications, and future directions
    Alafari, Fatmah
    Driss, Maha
    Cherif, Asma
    COMPUTER SCIENCE REVIEW, 2025, 56
  • [10] Effectiveness of Recent Research Approaches in Natural Language Processing on Data Science-An Insight
    Shruthi, J.
    Swamy, Suma
    COMPUTATIONAL AND STATISTICAL METHODS IN INTELLIGENT SYSTEMS, 2019, 859 : 172 - 182