Identifying Health-Related Discussions of Cannabis Use on Twitter by Using a Medical Dictionary: Content Analysis of Tweets

被引:6
作者
Allem, Jon-Patrick [1 ]
Majmundar, Anuja [2 ]
Dormanesh, Allison [1 ]
Donaldson, Scott, I [1 ]
机构
[1] Univ Southern Calif, Keck Sch Med, Dept Populat & Publ Hlth Sci, 2001 N Soto St,SSB 312D, Los Angeles, CA 90032 USA
[2] Amer Canc Soc, Dept Surveillance & Hlth Equ Sci, Kennesaw, GA USA
关键词
cannabis; marijuana; Twitter; social media; adverse event; cannabis safety; dictionary; rule-based classifier; medical; health-related; conversation; codebook;
D O I
10.2196/35027
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: The cannabis product and regulatory landscape is changing in the United States. Against the backdrop of these changes, there have been increasing reports on health-related motives for cannabis use and adverse events from its use. The use of social media data in monitoring cannabis-related health conversations may be useful to state- and federal-level regulatory agencies as they grapple with identifying cannabis safety signals in a comprehensive and scalable fashion. Objective: This study attempted to determine the extent to which a medical dictionary-the Unified Medical Language System Consumer Health Vocabulary-could identify cannabis-related motivations for use and health consequences of cannabis use based on Twitter posts in 2020. Methods: Twitter posts containing cannabis-related terms were obtained from January 1 to August 31, 2020. Each post from the sample (N=353,353) was classified into at least 1 of 17 a priori categories of common health-related topics by using a rule-based classifier. Each category was defined by the terms in the medical dictionary. A subsample of posts (n=1092) was then manually annotated to help validate the rule-based classifier and determine if each post pertained to health-related motivations for cannabis use, perceived adverse health effects from its use, or neither. Results: The validation process indicated that the medical dictionary could identify health-related conversations in 31.2% (341/1092) of posts. Specifically, 20.4% (223/1092) of posts were accurately identified as posts related to a health-related motivation for cannabis use, while 10.8% (118/1092) of posts were accurately identified as posts related to a health-related consequence from cannabis use. The health-related conversations about cannabis use included those about issues with the respiratory system, stress to the immune system, and gastrointestinal issues, among others. Conclusions: The mining of social media data may prove helpful in improving the surveillance of cannabis products and their adverse health effects. However, future research needs to develop and validate a dictionary and codebook that capture cannabis use-specific health conversations on Twitter.
引用
收藏
页数:6
相关论文
共 21 条