Natural language processing (NLP) aided qualitative method in health research

被引:13
作者
Cheligeer, Cheligeer [1 ,2 ]
Yang, Lin [3 ,4 ,5 ]
Nandi, Tannistha [6 ]
Doktorchik, Chelsea [2 ,5 ]
Quan, Hude [2 ,5 ]
Zeng, Yong [1 ]
Singh, Shaminder [5 ,7 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ, Canada
[2] Univ Calgary, Cumming Sch Med, Ctr Hlth Informat, Calgary, AB, Canada
[3] Alberta Hlth Serv, Dept Canc Epidemiol & Prevent Res, Calgary, AB, Canada
[4] Univ Calgary, Cumming Sch Med, Dept Canc, Calgary, AB, Canada
[5] Univ Calgary, Cumming Sch Med, Dept Community Hlth Sci, Calgary, AB, Canada
[6] Univ Calgary, Dept Informat Technol, Res Comp Serv, Calgary, AB, Canada
[7] Mt Royal Univ, Fac Hlth Community & Educ, Sch Nursing & Midwifery, Calgary, AB, Canada
关键词
Qualitative health research; grounded theory; natural language processing; machine learning; artificial intelligence; text clustering; keyword extraction;
D O I
10.3233/JID-220013
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Qualitative data analysis is produced frequently in healthcare settings, which is a time-consuming and skilled analytic task. The use of qualitative research findings in clinical settings takes years, which is sometimes obsolete knowledge as the health context is dynamic. Artificial Intelligence (AI)-based qualitative data analysis might present with rapid analysis of text-based data in real-time, thereby empowering qualitative researchers to expedite their analysis and facilitate timely use of the research findings. We tested an AI-based method to complement the manual analysis of text-based data from the verbatim transcripts of seven mall managers' interviews. First, we prepared text data into a machine-calculable format and employed BERT model to extract sentence-level features in our case. Second, we implement TF-IDF-based keywords mining techniques to extract the main candidate themes from the interview transcripts to support text-based analysis, including: 1) primary cluster detection algorithm, and 2) keyword extraction algorithm. The extracted core themes provide qualitative researchers with a more comprehensive overview of the qualitative data. Most of the sentences clustered in meaningful short topics or sentences carrying independent and clear information. The extracted topics and clustered sentences reduced qualitative researchers' workload by condensing and identifying meaningful concepts and naming them. This method combining contextualized word embeddings, unsupervised clustering, and keyword extraction techniques can significantly reduce the overall workload and time consumed in qualitative research using conventional methods.
引用
收藏
页码:41 / 58
页数:18
相关论文
共 43 条
[1]  
Alsentzer E, 2019, Arxiv, DOI arXiv:1904.03323
[2]  
Arora S, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P2650
[3]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[4]   Governing artificial intelligence: ethical, legal and technical opportunities and challenges Introduction [J].
Cath, Corinne .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2018, 376 (2133)
[5]  
Chang J., 2009, NIPS, DOI DOI 10.5555/2984093.2984126
[6]   Qualitative research in healthcare: an introduction to grounded theory using thematic analysis [J].
Chapman, A. L. ;
Hadfield, M. ;
Chapman, C. J. .
JOURNAL OF THE ROYAL COLLEGE OF PHYSICIANS OF EDINBURGH, 2015, 45 (03) :201-205
[7]  
Charmaz K., 2006, Constructing grounded theory: A practical guide through qualitative analysis, P1, DOI [10.7748/nr.13.4.84.s4, DOI 10.7748/NR.13.4.84.S4]
[8]   Using Machine Learning to Support Qualitative Coding in Social Science: Shifting the Focus to Ambiguity [J].
Chen, Nan-Chen ;
Drouhard, Margaret ;
Kocielnik, Rafal ;
Suh, Jina ;
Aragon, Cecilia R. .
ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2018, 8 (02)
[9]   Evaluation of BERT and ALBERT Sentence Embedding Performance on Downstream NLP Tasks [J].
Choi, Hyunjin ;
Kim, Judong ;
Joe, Seongho ;
Gwon, Youngjune .
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :5482-5487
[10]   Grounded theory research: A design framework for novice researchers [J].
Chun Tie, Ylona ;
Birks, Melanie ;
Francis, Karen .
SAGE OPEN MEDICINE, 2019, 7