Building risk prediction models for daily use of marijuana using machine learning techniques

被引:12
作者
Parekh, Tarang [1 ]
Fahim, Farhan [1 ]
机构
[1] George Mason Univ, Dept Hlth Adm & Policy, Fairfax, VA 22030 USA
关键词
Marijuana; Cannabis; Machine learning; Prediction; Prevalence; CANNABIS USE; MEDICAL MARIJUANA; OBESITY; PSYCHOSIS; INDEX;
D O I
10.1016/j.drugalcdep.2021.108789
中图分类号
R194 [卫生标准、卫生检查、医药管理];
学科分类号
摘要
Identifying the characteristics of adults with recent marijuana use is limited by standard statistical methods and requires a unique approach. The objective of this study is to evaluate the efficiency of machine learning models in predicting daily marijuana use and identify factors associated with daily use among adults. The study analyzed pooled data from the 2016-2019 Behavioral Risk Factor Surveillance System (BRFSS) Survey in 2020. Prediction models were developed using four machine learning algorithms, including Logistic Regression, Decision Tree, and Random Forest with Gini function, and Naive Bayes. Respondents were randomly divided into training and testing samples. The performance of all the models was compared using accuracy, AUC, precision, and recall. The study included 253,569 respondents, of whom 10,182 (5.9 %) reported daily marijuana use in the last 30 days. Of daily marijuana use, 53.4 % were young adults (age 18 34 years), 34.3 % female, 56.1 % non-Hispanic White, 15.2 % were college graduates, and 67.3 % were employed. Random Forest was the best performing model with AUC 0.97, followed by a Decision tree (AUC 0.95). The most important factors for daily marijuana use were the current use of e-cigarette and combustible cigarette use, male gender, unmarried, poor mental health, depression, cognitive decline, abnormal sleep pattern, and high-risk behavior. Data mining methods were useful in the discovery of behavior health-risk knowledge and to visualize the significance of predicting modeling from a multidimensional behavioral health survey.
引用
收藏
页数:6
相关论文
共 41 条
[1]   Association Between Cannabis Use and Risk for Diabetic Ketoacidosis in Adults With Type 1 Diabetes [J].
Akturk, Halis K. ;
Taylor, Daniel D. ;
Camsari, Ulas M. ;
Rewers, Amanda ;
Kinney, Gregory L. ;
Shah, Viral N. .
JAMA INTERNAL MEDICINE, 2019, 179 (01) :115-118
[2]  
[Anonymous], Where to Write for Vital Records-Texas
[3]   Factors Associated with E-Cigarette Use in US Young Adult Never Smokers of Conventional Cigarettes: A Machine Learning Approach [J].
Atuegwu, Nkiruka C. ;
Oncken, Cheryl ;
Laubenbacher, Reinhard C. ;
Perez, Mario F. ;
Mortensen, Eric M. .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2020, 17 (19) :1-17
[4]   Dose-related neurocognitive effects of marijuana use [J].
Bolla, KI ;
Brown, K ;
Eldreth, D ;
Tate, K ;
Cadet, JL .
NEUROLOGY, 2002, 59 (09) :1337-1343
[5]   Marijuana-Related Posts on Instagram [J].
Cavazos-Rehg, Patricia A. ;
Krauss, Melissa J. ;
Sowles, Shaina J. ;
Bierut, Laura J. .
PREVENTION SCIENCE, 2016, 17 (06) :710-720
[6]  
CDC, 2019, CTR DIS CONTROL PREV, DOI DOI 10.15620/CDC:82532
[7]  
Centers For Disease Control And Prevention, 2018, Center for Disease Control and Prevention
[8]   Association Between Recreational Marijuana Legalization in the United States and Changes in Marijuana Use and Cannabis Use Disorder From 2008 to 2016 [J].
Cerda, Magdalena ;
Mauro, Christine ;
Hamilton, Ava ;
Levy, Natalie S. ;
Santaella-Tenorio, Julian ;
Hasin, Deborah ;
Wall, Melanie M. ;
Keyes, Katherine M. ;
Martins, Silvia S. .
JAMA PSYCHIATRY, 2020, 77 (02) :165-171
[9]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[10]   Nonmedical Versus Medical Marijuana Use Among Three Age Groups of Adults: Associations With Mental and Physical Health Status [J].
Choi, Namkee G. ;
DiNitto, Diana M. ;
Marti, Carl Nathan .
AMERICAN JOURNAL ON ADDICTIONS, 2017, 26 (07) :697-706