A Data-Driven Method of Discovering Misspellings of Medication Names on Twitter

被引:3
|
作者
Jiang, Keyuan [1 ]
Chen, Tingyu [1 ]
Huang, Liyuan [1 ]
Calix, Ricardo A. [1 ]
Bernard, Gordon R. [2 ]
机构
[1] Purdue Univ Northwest, Dept Comp Informat Technol & Graph, Hammond, IA USA
[2] Vanderbilt Univ, Dept Med, Nashville, TN USA
来源
BUILDING CONTINENTS OF KNOWLEDGE IN OCEANS OF DATA: THE FUTURE OF CO-CREATED EHEALTH | 2018年 / 247卷
基金
美国国家卫生研究院;
关键词
Information retrieval; Pharmacovigilance; Postmarking surveillance; Relational similarity; Twitter; Distributed word representation; Misspellings;
D O I
10.3233/978-1-61499-852-5-136
中图分类号
R-058 [];
学科分类号
摘要
Twitter, as a microblogging social media platform, has seen increasing applications of its data for pharmacovigilance which is to monitor and promote safe uses of pharmaceutical products. Medication names are typically used as keywords to query social media data. It is known that medication names are misspelled on social media, and finding the misspellings is challenging because there exists no a priori knowledge as to how people would misspell a medication name. We developed a data-driven, relational similarity-based approach to discover misspellings of medication names. Our approach is based upon the assumption of the identical (or similar) association of a medicine with its effects whether the medication is correctly spelled or misspelled. With distributed representations of the words in tweets posted in recent 24 months, we were able to discover a total of 54 misspellings of 6 medicines whose indications containing headache. Our search results also show that Twitter posts with misspellings of codeine and ibuprofen can be more than 10% of all the tweets associated with each of the medicines. Compared with the phonetics-based approach, our method discovered more actual misspellings used on Twitter.
引用
收藏
页码:136 / 140
页数:5
相关论文
共 50 条
  • [21] Data-Driven Prediction of Athletes' Performance Based on Their Social Media Presence
    Dreyer, Frank
    Greif, Jannik
    Guenther, Kolja
    Spiliopoulou, Myra
    Niemann, Uli
    DISCOVERY SCIENCE (DS 2022), 2022, 13601 : 197 - 211
  • [22] Knowledge-Based and Data-Driven Approaches for Georeferencing of Informal Documents
    Ferres, Daniel
    Rodriguez, Horacio
    TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 452 - 460
  • [23] Soft City Sensing: A turn to computational humanities in data-driven urbanism
    Madsen, Anders Koed
    Grundtvig, Anders
    Thorsen, Sofie
    CITIES, 2022, 126
  • [24] A Survey on Data-Driven Evaluation of Competencies and Capabilities Across Multimedia Environments
    Strukova, Sofia
    Ruiperez-Valiente, Jose A.
    Marmol, Felix Gomez
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2023, 8 (04): : 182 - 201
  • [25] Data-Driven Cyber Security in Perspective-Intelligent Traffic Analysis
    Coulter, Rory
    Han, Qing-Long
    Pan, Lei
    Zhang, Jun
    Xiang, Yang
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (07) : 3081 - 3093
  • [26] Understanding the User Behavior of Foursquare: A Data-Driven Study on a Global Scale
    Chen, Yang
    Hu, Jiyao
    Xiao, Yu
    Li, Xiang
    Hui, Pan
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2020, 7 (04) : 1019 - 1032
  • [27] Evaluation of retweet clustering method classification method using retweets on Twitter without text data
    Uchida, K.
    Toriumi, F.
    Sakaki, T.
    2017 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2017), 2017, : 187 - 194
  • [28] Psychological Health and Drugs: Data-Driven Discovery of Causes, Treatments, Effects, and Abuses
    Alswedani, Sarah
    Mehmood, Rashid
    Katib, Iyad
    Altowaijri, Saleh M.
    TOXICS, 2023, 11 (03)
  • [29] A Data-Driven Framework for Coding the Intent and Extent of Political Tweeting, Disinformation, and Extremism
    Hashemi, Mahdi
    INFORMATION, 2021, 12 (04)
  • [30] Data-Driven Content Analysis of Social Media: A Systematic Overview of Automated Methods
    Schwartz, H. Andrew
    Ungar, Lyle H.
    ANNALS OF THE AMERICAN ACADEMY OF POLITICAL AND SOCIAL SCIENCE, 2015, 659 (01): : 78 - 94