Interdisciplinary Approach to Identify and Characterize COVID-19 Misinformation on Twitter: Mixed Methods Study

被引:1
|
作者
Tan, Iris Thiele Isip [1 ]
Cleofas, Jerome [2 ]
Solano, Geoffrey [3 ]
Pillejera, Jeanne Genevive [1 ]
Catapang, Jasper Kyle [4 ,5 ]
机构
[1] Univ Philippines Manila, Coll Med, Med Informat Unit, Manila, Philippines
[2] De La Salle Univ, Behav Sci Dept, Manila, Philippines
[3] Univ Philippines Manila, Math & Comp Sci Unit, Manila, Philippines
[4] Univ Birmingham, English Language & Linguist, Birmingham, England
[5] Univ Birmingham, English Language & Linguist, Birmingham B15 2TT, England
基金
美国国家卫生研究院;
关键词
COVID-19; misinformation; natural language processing; Twitter; biterm topic modeling;
D O I
10.2196/41134
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Studying COVID-19 misinformation on Twitter presents methodological challenges. A computational approach can analyze large data sets, but it is limited when interpreting context. A qualitative approach allows for a deeper analysis of content, but it is labor-intensive and feasible only for smaller data sets. Objective: We aimed to identify and characterize tweets containing COVID-19 misinformation.Methods: Tweets geolocated to the Philippines (January 1 to March 21, 2020) containing the words coronavirus, covid, and ncov were mined using the GetOldTweets3 Python library. This primary corpus (N=12,631) was subjected to biterm topic modeling. Key informant interviews were conducted to elicit examples of COVID-19 misinformation and determine keywords. Using NVivo (QSR International) and a combination of word frequency and text search using key informant interview keywords, subcorpus A (n=5881) was constituted and manually coded to identify misinformation. Constant comparative, iterative, and consensual analyses were used to further characterize these tweets. Tweets containing key informant interview keywords were extracted from the primary corpus and processed to constitute subcorpus B (n=4634), of which 506 tweets were manually labeled as misinformation. This training set was subjected to natural language processing to identify tweets with misinformation in the primary corpus. These tweets were further manually coded to confirm labeling.Results: Biterm topic modeling of the primary corpus revealed the following topics: uncertainty, lawmaker's response, safety measures, testing, loved ones, health standards, panic buying, tragedies other than COVID-19, economy, COVID-19 statistics, precautions, health measures, international issues, adherence to guidelines, and frontliners. These were categorized into 4 major topics: nature of COVID-19, contexts and consequences, people and agents of COVID-19, and COVID-19 prevention and management. Manual coding of subcorpus A identified 398 tweets with misinformation in the following formats: misleading content (n=179), satire and/or parody (n=77), false connection (n=53), conspiracy (n=47), and false context (n=42). The discursive strategies identified were humor (n=109), fear mongering (n=67), anger and disgust (n=59), political commentary (n=59), performing credibility (n=45), overpositivity (n=32), and marketing (n=27). Natural language processing identified 165 tweets with misinformation. However, a manual review showed that 69.7% (115/165) of tweets did not contain misinformation.Conclusions: An interdisciplinary approach was used to identify tweets with COVID-19 misinformation. Natural language processing mislabeled tweets, likely due to tweets written in Filipino or a combination of the Filipino and English languages. Identifying the formats and discursive strategies of tweets with misinformation required iterative, manual, and emergent coding by human coders with experiential and cultural knowledge of Twitter. An interdisciplinary team composed of experts in health, health informatics, social science, and computer science combined computational and qualitative methods to gain a better understanding of COVID-19 misinformation on Twitter.(JMIR Form Res 2023;7:e41134) doi: 10.2196/41134
引用
收藏
页数:15
相关论文
共 50 条
  • [31] ANTi-Vax: a novel Twitter dataset for COVID-19 vaccine misinformation detection
    Hayawi, K.
    Shahriar, S.
    Serhani, M. A.
    Taleb, I
    Mathew, S. S.
    PUBLIC HEALTH, 2022, 203 : 23 - 30
  • [32] An analysis of AstraZeneca COVID-19 vaccine misinformation and fear mongering on Twitter
    Jemielniak, D.
    Krempovych, Y.
    PUBLIC HEALTH, 2021, 200 : 4 - 6
  • [33] Psychological reactance, misinformation, and distrust: A mixed methods analysis of COVID-19 vaccine uptake
    Huang, Lily
    Huschka, Todd R.
    Barwise, Amelia K.
    Allen, Jay-Sheree P.
    Wolfersteig, Wendy
    Hamm, Kathryn
    Cardenas, Lilliana D.
    Phelan, Sean M.
    Allyse, Megan A.
    JOURNAL OF CLINICAL AND TRANSLATIONAL SCIENCE, 2024, 8 (01)
  • [34] Twitter users' coping behaviors during the COVID-19 lockdown: an analysis of tweets using mixed methods
    Mittal, Ruchi
    Ahmed, Wasim
    Mittal, Amit
    Aggarwal, Ishan
    INFORMATION DISCOVERY AND DELIVERY, 2021, 49 (03) : 193 - 202
  • [35] COVID-19 Misinformation on Social Media: A Scoping Review
    Joseph, Andrew M.
    Fernandez, Virginia
    Kritzman, Sophia
    Eaddy, Isabel
    Cook, Olivia M.
    Lambros, Sarah
    Silva, Cesar E. Jara
    Arguelles, Daryl
    Abraham, Christy
    Dorgham, Noelle
    Gilbert, Zachary A.
    Chacko, Lindsey
    Hirpara, Ram J.
    Mayi, Bindu S.
    Jacobs, Robin J.
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2022, 14 (04)
  • [36] The impact of misinformation on the COVID-19 pandemic
    Caceres, Maria Mercedes Ferreira
    Sosa, Juan Pablo
    Lawrence, Jannel A.
    Sestacovschi, Cristina
    Tidd-Johnson, Atiyah
    Rasool, Muhammad Haseeb U., I
    Gadamidi, Vinay Kumar
    Ozair, Saleha
    Pandav, Krunal
    Cuevas-Lou, Claudia
    Parrish, Matthew
    Rodriguez, Ivan
    Fernandez, Javier Perez
    AIMS PUBLIC HEALTH, 2022, 9 (02): : 262 - 277
  • [37] Belief in COVID-19 Misinformation in Nigeria
    Goldstein, Josh A.
    Grossman, Shelby
    Startz, Meredith
    JOURNAL OF POLITICS, 2024, 86 (02) : 810 - 814
  • [38] The COVID-19 Infodemic: Twitter versus Facebook
    Yang, Kai-Cheng
    Pierri, Francesco
    Hui, Pik-Mai
    Axelrod, David
    Torres-Lugo, Christopher
    Bryden, John
    Menczer, Filippo
    BIG DATA & SOCIETY, 2021, 8 (01):
  • [39] Misinformation and COVID-19 vaccine hesitancy
    Zimmerman, Tara
    Shiroma, Kristina
    Fleischmann, Kenneth R.
    Xie, Bo
    Jia, Chenyan
    Verma, Nitin
    Lee, Min Kyung
    VACCINE, 2023, 41 (01) : 136 - 144
  • [40] Dissemination and Acceptance of COVID-19 Misinformation in Iran: A Qualitative Study
    Taghipour, Faezeh
    Ashrafi-rizi, Hasan
    Soleymani, Mohammad Reza
    INTERNATIONAL QUARTERLY OF COMMUNITY HEALTH EDUCATION, 2021,