Natural language processing for social science research: A comprehensive review

被引:0
|
作者
Hou, Yuxin [1 ,2 ]
Huang, Junming [3 ]
机构
[1] Peking Univ, Ctr Social Res, Beijing, Peoples R China
[2] Tsinghua Univ, Inst Educ, Beijing, Peoples R China
[3] Princeton Univ, Paul & Marcia Wythes Ctr Contemporary China, Princeton, NJ 08544 USA
关键词
Big data/data science; language/linguistics; quantitative methods; natural language processing; text analysis; neural network; topic model; COMPUTERIZED TEXT ANALYSIS; MEDIA; CULTURE; TWITTER; CLASSIFICATION; COMMUNICATION; SENTIMENT; MICROBLOGS; CAMPAIGNS; FACEBOOK;
D O I
10.1177/2057150X241306780
中图分类号
C91 [社会学];
学科分类号
030301 ; 1204 ;
摘要
Text data has been a longstanding pivotal source for social science research, providing an informative lens across disciplines including sociology, psychology, and political science. Its salient role in research, combined with the difficulty in numerically digesting unstructured data in natural languages, has been inspiring growing demands for natural language processing techniques to extract meaningful insights from vast text data. Breakthrough advances in natural language processing emerge with the recent expansion in data availability and computational resources, calling for an up-to-date comprehensive review for those methodologies and applications in social science research. This article reviews natural language processing techniques, detailing the procedure from representing unstructured text data to distilling semantic information, with expertise-based algorithms and unsupervised/supervised machine-learning methods. We then introduce their typical applications in producing research outcomes for sociology and political science. Keeping in mind challenges in data representativeness, interpretability, and biases, this review encourages utilizing natural language processing technique responsibly and effectively in social science research to improve quantitative understandings of emerging text data.
引用
收藏
页码:121 / 157
页数:37
相关论文
共 50 条
  • [31] Natural Language Processing in Oncology A Review
    Yim, Wen-wai
    Yetisgen, Meliha
    Harris, William P.
    Kwan, Sharon W.
    JAMA ONCOLOGY, 2016, 2 (06) : 797 - 804
  • [32] Natural language processing with transformers: a review
    Tucudean, Georgiana
    Bucos, Marian
    Dragulescu, Bogdan
    Caleanu, Catalin Daniel
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [33] Review of Natural Language Processing in Radiology
    Luo, Jack W.
    Chong, Jaron J. R.
    NEUROIMAGING CLINICS OF NORTH AMERICA, 2020, 30 (04) : 447 - +
  • [34] A New Approach to Social Engineering with Natural Language Processing: RAKE
    Aydogan, Ahmet Furkan
    An, Min Kyung
    Yilmaz, Mehmet
    9TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSICS AND SECURITY (ISDFS'21), 2021,
  • [35] Natural language processing with transformers: a review
    Tucudean, Georgiana
    Bucos, Marian
    Dragulescu, Bogdan
    Caleanu, Catalin Daniel
    PeerJ Computer Science, 2024, 10
  • [36] HugNLP: A Unified and Comprehensive Library for Natural Language Processing
    Wang, Jianing
    Chen, Nuo
    Sun, Qiushi
    Huang, Wenkang
    Wang, Chengyu
    Gao, Ming
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 5111 - 5116
  • [37] From semantics to pragmatics: where IS can lead in Natural Language Processing (NLP) research
    Li, Yan
    Thomas, Manoj A.
    Liu, Dapeng
    EUROPEAN JOURNAL OF INFORMATION SYSTEMS, 2021, 30 (05) : 569 - 590
  • [38] A systematic review on natural language processing systems for eligibility prescreening in clinical research
    Idnay, Betina
    Dreisbach, Caitlin
    Weng, Chunhua
    Schnall, Rebecca
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 29 (01) : 197 - 206
  • [39] Natural Language Processing Adoption in Governments and Future Research Directions: A Systematic Review
    Jiang, Yunqing
    Pang, Patrick Cheong-Iao
    Wong, Dennis
    Kan, Ho Yin
    APPLIED SCIENCES-BASEL, 2023, 13 (22):
  • [40] Natural Language Processing in Internal Auditing - a Structured Literature Review Completed Research
    Schumann, Gerrit
    Gomez, Jorge Marx
    DIGITAL INNOVATION AND ENTREPRENEURSHIP (AMCIS 2021), 2021,