A Data-Driven Method of Discovering Misspellings of Medication Names on Twitter

被引:3
|
作者
Jiang, Keyuan [1 ]
Chen, Tingyu [1 ]
Huang, Liyuan [1 ]
Calix, Ricardo A. [1 ]
Bernard, Gordon R. [2 ]
机构
[1] Purdue Univ Northwest, Dept Comp Informat Technol & Graph, Hammond, IA USA
[2] Vanderbilt Univ, Dept Med, Nashville, TN USA
来源
BUILDING CONTINENTS OF KNOWLEDGE IN OCEANS OF DATA: THE FUTURE OF CO-CREATED EHEALTH | 2018年 / 247卷
基金
美国国家卫生研究院;
关键词
Information retrieval; Pharmacovigilance; Postmarking surveillance; Relational similarity; Twitter; Distributed word representation; Misspellings;
D O I
10.3233/978-1-61499-852-5-136
中图分类号
R-058 [];
学科分类号
摘要
Twitter, as a microblogging social media platform, has seen increasing applications of its data for pharmacovigilance which is to monitor and promote safe uses of pharmaceutical products. Medication names are typically used as keywords to query social media data. It is known that medication names are misspelled on social media, and finding the misspellings is challenging because there exists no a priori knowledge as to how people would misspell a medication name. We developed a data-driven, relational similarity-based approach to discover misspellings of medication names. Our approach is based upon the assumption of the identical (or similar) association of a medicine with its effects whether the medication is correctly spelled or misspelled. With distributed representations of the words in tweets posted in recent 24 months, we were able to discover a total of 54 misspellings of 6 medicines whose indications containing headache. Our search results also show that Twitter posts with misspellings of codeine and ibuprofen can be more than 10% of all the tweets associated with each of the medicines. Compared with the phonetics-based approach, our method discovered more actual misspellings used on Twitter.
引用
收藏
页码:136 / 140
页数:5
相关论文
共 50 条
  • [31] Towards data-driven sustainable design: decision support based on knowledge discovery in disparate building data
    Petrova, Ekaterina
    Pauwels, Pieter
    Svidt, Kjeld
    Jensen, Rasmus Lund
    ARCHITECTURAL ENGINEERING AND DESIGN MANAGEMENT, 2019, 15 (05) : 334 - 356
  • [32] Data-driven prediction of adverse drug reactions induced by drug-drug interactions
    Liu, Ruifeng
    AbdulHameed, Mohamed Diwan M.
    Kumar, Kamal
    Yu, Xueping
    Wallqvist, Anders
    Reifman, Jaques
    BMC PHARMACOLOGY & TOXICOLOGY, 2017, 18
  • [33] Diffusion of real versus misinformation during a crisis event: A big data-driven approach
    King, Kelvin K.
    Wang, Bin
    INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2023, 71
  • [34] Data-driven prediction of adverse drug reactions induced by drug-drug interactions
    Ruifeng Liu
    Mohamed Diwan M. AbdulHameed
    Kamal Kumar
    Xueping Yu
    Anders Wallqvist
    Jaques Reifman
    BMC Pharmacology and Toxicology, 18
  • [35] The 2011-2020 Trends of Data-Driven Approaches in Medical Informatics for Active Pharmacovigilance
    Shin, Hyunah
    Cha, Jaehun
    Lee, Chungchun
    Song, Hyejin
    Jeong, Hyuntae
    Kim, Jong-Yeup
    Lee, Suehyun
    APPLIED SCIENCES-BASEL, 2021, 11 (05): : 1 - 13
  • [36] Comparative analysis of TF-IDF and loglikelihood method for keywords extraction of twitter data
    Abid, Muhammad Adeel
    Mushtaq, Muhammad Faheem
    Akram, Urooj
    Abbasi, Mateen Ahmed
    Rustam, Furqan
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2023, 42 (01) : 88 - 94
  • [37] Theory building with big data-driven research - Moving away from the "What" towards the "Why"
    Kar, Arpan Kumar
    Dwivedi, Yogesh K.
    INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2020, 54
  • [38] A Context-Aware Data-Driven Algorithm for Small Cell Site Selection in Cellular Networks
    Bejarano-Luque, Juan L.
    Toril, Matias
    Fernandez-Navarro, Mariano
    Garcia, Antonio J.
    Luna-Ramirez, Salvador
    IEEE ACCESS, 2020, 8 : 105335 - 105350
  • [39] A Machine Learning Based Method for Automatic Identification of Disaster Related Information Using Twitter Data
    Christidou, Athina Ntiana
    Drakaki, Maria
    Linardos, Vasileios
    INTELLIGENT AND FUZZY SYSTEMS: DIGITAL ACCELERATION AND THE NEW NORMAL, INFUS 2022, VOL 2, 2022, 505 : 70 - 76
  • [40] Wiki-LDA: A Mixed-Method Approach for Effective Interest Mining on Twitter Data
    Pu, Xiao
    Chatti, Mohamed Amine
    Thues, Hendrik
    Schroeder, Ulrik
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED EDUCATION, VOL 1 (CSEDU), 2016, : 426 - 433