A Data-Driven Method of Discovering Misspellings of Medication Names on Twitter

被引:3
|
作者
Jiang, Keyuan [1 ]
Chen, Tingyu [1 ]
Huang, Liyuan [1 ]
Calix, Ricardo A. [1 ]
Bernard, Gordon R. [2 ]
机构
[1] Purdue Univ Northwest, Dept Comp Informat Technol & Graph, Hammond, IA USA
[2] Vanderbilt Univ, Dept Med, Nashville, TN USA
来源
BUILDING CONTINENTS OF KNOWLEDGE IN OCEANS OF DATA: THE FUTURE OF CO-CREATED EHEALTH | 2018年 / 247卷
基金
美国国家卫生研究院;
关键词
Information retrieval; Pharmacovigilance; Postmarking surveillance; Relational similarity; Twitter; Distributed word representation; Misspellings;
D O I
10.3233/978-1-61499-852-5-136
中图分类号
R-058 [];
学科分类号
摘要
Twitter, as a microblogging social media platform, has seen increasing applications of its data for pharmacovigilance which is to monitor and promote safe uses of pharmaceutical products. Medication names are typically used as keywords to query social media data. It is known that medication names are misspelled on social media, and finding the misspellings is challenging because there exists no a priori knowledge as to how people would misspell a medication name. We developed a data-driven, relational similarity-based approach to discover misspellings of medication names. Our approach is based upon the assumption of the identical (or similar) association of a medicine with its effects whether the medication is correctly spelled or misspelled. With distributed representations of the words in tweets posted in recent 24 months, we were able to discover a total of 54 misspellings of 6 medicines whose indications containing headache. Our search results also show that Twitter posts with misspellings of codeine and ibuprofen can be more than 10% of all the tweets associated with each of the medicines. Compared with the phonetics-based approach, our method discovered more actual misspellings used on Twitter.
引用
收藏
页码:136 / 140
页数:5
相关论文
共 50 条
  • [41] Data-Driven Extraction of Quantitative Multi-dimensional Associations of Cardiovascular Drugs and Adverse Drug Reactions
    Chutia, Upasana
    Sangma, Jerry W.
    Pal, Vipin
    Yogita
    PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 1005 : 70 - 77
  • [42] Improving Diversity in Engineering: A Data-Driven Approach to Support Resource Mobilization and Participation in Hashtag Activism Campaigns
    Karbasian, Habib
    Purohit, Hemant
    Johri, Aditya
    PROCEEDINGS OF THE 32ND ACM CONFERENCE ON HYPERTEXT AND SOCIAL MEDIA (HT '21), 2021, : 121 - 131
  • [43] Fostering Civic Engagement on "Ghana X": An Analysis of Data-driven Journalistic Practices of Mainstream and Peripheral Media Actors
    Adjin-Tettey, Theodora Dame
    Etrue, Michael
    JOURNALISM STUDIES, 2025,
  • [44] The digital ingredients of donation-based crowdfunding. A data-driven study of Leetchi projects and social campaigns
    Sokolova, Karina
    Perez, Charles
    JOURNAL OF DECISION SYSTEMS, 2018, 27 (03) : 146 - 186
  • [45] Data-driven investigations of using social media to aid evacuations amid Western United States wildfire season
    Li, Lingyao
    Ma, Zihui
    Cao, Tao
    FIRE SAFETY JOURNAL, 2021, 126
  • [46] A Data-Driven Reference Standard for Adverse Drug Reaction (RS-ADR) Signal Assessment: Development and Validation
    Lee, Suehyun
    Lee, Jeong Hoon
    Kim, Grace Juyun
    Kim, Jong-Yeup
    Shin, Hyunah
    Ko, Inseok
    Choe, Seon
    Kim, Ju Han
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (10)
  • [47] Data-Driven Deep Journalism to Discover Age Dynamics in Multi-Generational Labour Markets from LinkedIn Media
    Alaql, Abeer Abdullah
    Alqurashi, Fahad
    Mehmood, Rashid
    JOURNALISM AND MEDIA, 2023, 4 (01): : 120 - 145
  • [48] A data-synthesis-driven method for detecting and extracting vague cognitive regions
    Gao, Song
    Janowicz, Krzysztof
    Montello, Daniel R.
    Hu, Yingjie
    Yang, Jiue-An
    McKenzie, Grant
    Ju, Yiting
    Gong, Li
    Adams, Benjamin
    Yan, Bo
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2017, 31 (06) : 1245 - 1271
  • [49] Detection of Behavior Patterns through Social Networks like Twitter, using Data Mining techniques as a method to detect Cyberbullying
    Tapia, Freddy
    Aguinaga, Cristina
    Luje, Roger
    2018 7TH INTERNATIONAL CONFERENCE ON SOFTWARE PROCESS IMPROVEMENT (CIMPS): APPLICATIONS IN SOFTWARE ENGINEERING, 2018, : 111 - 118
  • [50] Association rules method and big data: Evaluating frequent medication combinations associated with fractures in older adults
    Nishtala, Prasad S.
    Chyou, Te-yuan
    Held, Fabian
    Le Couteur, David G.
    Gnjidic, Danijela
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2018, 27 (10) : 1123 - 1130