Analysis of substance use and its outcomes by machine learning I. Childhood evaluation of liability to substance use disorder

被引:28
作者
Jing, Yankang [1 ,3 ]
Hu, Ziheng [1 ,3 ]
Fan, Peihao [1 ,3 ]
Xue, Ying [1 ,3 ]
Wang, Lirong [1 ,3 ]
Tarter, Ralph E. [2 ]
Kirisci, Levent [2 ]
Wang, Junmei [1 ,3 ]
Vanyukov, Michael [2 ]
Xie, Xiang-Qun [1 ,3 ]
机构
[1] Univ Pittsburgh, Sch Pharm, Dept Pharmaceut Sci, Computat Chem Genom Screen Ctr, 3501 Terrace St, Pittsburgh, PA 15213 USA
[2] Univ Pittsburgh, Sch Pharm, Dept Pharmaceut Sci, 3501 Terrace St, Pittsburgh, PA 15213 USA
[3] Univ Pittsburgh, Dept Pharmaceut Sci, Sch Pharm, NIDA Natl Ctr Excellence Computat Drug Abuse Res, 3501 Terrace S4, Pittsburgh, PA 15213 USA
基金
美国国家卫生研究院;
关键词
Substance use disorder; Machine learning; Substance abuse prevention; Big data; Screening addiction risk; NEURAL-NETWORKS; DISINHIBITION; COOCCURRENCE; PREDICTION; DECISION; FUTURE; RISK;
D O I
10.1016/j.drugalcdep.2019.107605
中图分类号
R194 [卫生标准、卫生检查、医药管理];
学科分类号
摘要
Background: Substance use disorder (SUD) exacts enormous societal costs in the United States, and it is important to detect high-risk youths for prevention. Machine learning (ML) is the method to find patterns and make prediction from data. We hypothesized that ML identifies the health, psychological, psychiatric, and contextual features to predict SUD, and the identified features predict high-risk individuals to develop SUD. Method: Male (N = 494) and female (N = 206) participants and their informant parents were administered a battery of questionnaires across five waves of assessment conducted at 10-12, 12-14, 16, 19, and 22 years of age. Characteristics most strongly associated with SUD were identified using the random forest (RF)algorithm from approximately 1000 variables measured at each assessment. Next, the complement of features was validated, and the best models were selected for predicting SUD using seven ML algorithms. Lastly, area under the receiver operating characteristic curve (AUROC) evaluated accuracy of detecting individuals who develop SUD + /- up to thirty years of age. Results: Approximately thirty variables strongly predict SUD. The predictors shift from psychological dysregulation and poor health behavior in late childhood to non-normative socialization in mid to late adolescence. In 10-12-year-old youths, the features predict SUD + /- with 74% accuracy, increasing to 86% at 22 years of age. The RF algorithm optimally detects individuals between 10-22 years of age who develop SUD compared to other ML algorithms. Conclusion: These findings inform the items required for inclusion in instruments to accurately identify high risk youths and young adults requiring SUD prevention.
引用
收藏
页数:6
相关论文
共 39 条
  • [1] Empirically based assessment and taxonomy: Applications to clinical research
    Achenbach, TM
    [J]. PSYCHOLOGICAL ASSESSMENT, 1995, 7 (03) : 261 - 274
  • [2] Use of a machine learning framework to predict substance use disorder treatment success
    Acion, Laura
    Kelmansky, Diana
    van der Laan, Mark
    Sahker, Ethan
    Jones, DeShauna
    Arndt, Stephan
    [J]. PLOS ONE, 2017, 12 (04):
  • [3] AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION
    ALTMAN, NS
    [J]. AMERICAN STATISTICIAN, 1992, 46 (03) : 175 - 185
  • [4] [Anonymous], PATTERN RECOGN LETT
  • [5] [Anonymous], 2006, Pattern Recognition and Machine Learning
  • [6] [Anonymous], 2018, NAT SURV DRUG US HLT
  • [7] Nearest neighbor imputation algorithms: a critical evaluation
    Beretta, Lorenzo
    Santaniello, Alessandro
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2016, 16
  • [8] Defining Substance Use Disorders: The Need for Peripheral Biomarkers
    Bough, Kristopher J.
    Pollock, Jonathan D.
    [J]. TRENDS IN MOLECULAR MEDICINE, 2018, 24 (02) : 109 - 120
  • [9] Breiman L., 2001, Mach. Learn., V45, P5
  • [10] Machine Learning and Prediction in Medicine - Beyond the Peak of Inflated Expectations
    Chen, Jonathan H.
    Asch, Steven M.
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2017, 376 (26) : 2507 - 2509