Machine learning algorithms to classify self-harm behaviours in New South Wales Ambulance electronic medical records: A retrospective study

被引：2

作者：

Burnett, Alexander ^{[1
]}

Chen, Nicola ^{[3
,4
]}

Zeritis, Stephanie ^{[1
]}

Ware, Sandra ^{[5
]}

McGillivray, Lauren ^{[1
,2
]}

Shand, Fiona ^{[1
,2
]}

Torok, Michelle ^{[1
,2
]}

机构：

[1] Black Dog Inst, Hosp Rd, Randwick, NSW 2032, Australia

[2] Univ New South Wales, Kensington, NSW, Australia

[3] Orygen, Parkville, Vic, Australia

[4] Univ Melbourne, Melbourne, Vic, Australia

[5] NSW Ambulance, Rozelle, NSW, Australia

来源：

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS | 2022年 / 161卷

关键词：

Suicidal behaviour; Epidemiology; Machine learning; Natural language processing; Population surveillance; SCORE;

D O I：

10.1016/j.ijmedinf.2022.104734

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Background: There is increasing interest in suicide surveillance solutions to identify non-fatal suicidal and self harming behaviours in the Australian community not currently captured through national administrative datasets. Objective: The aim of the present study was to develop machine learning models to classify self-harm related behaviours using unstructured clinical note text from New South Wales (NSW) Ambulance data and compare their performance via traditional methods. Methods: Primary data were derived from NSW Ambulance electronic medical records (eMRs) for potential self harm related NSW Ambulance attendances for the period 2013-2019. Data included paramedic clinical notes detailing the nature of the attendance, clinical outcome, and narrative information. We assessed sensitivity, specificity, positive predictive value, negative predictive value, F-score, and the Matthews correlation coefficient (MCC) for four algorithms (Support Vector Machine, random forest, decision tree, and logistic regression). Results: The performance of these algorithms was compared using the MCC measure. In a test sample of 3157 ambulance attendances (1349 self-harm related behaviours and 1808 unrelated), the MCC for classification of self-harm related behaviour ranged from +0.681 to +0.730. The Support Vector Machine (sensitivity = 82.7%, specificity = 89.6%, MCC = 0.730) and the logistic regression (sensitivity = 83.1%, specificity = 89.3%, MCC = 0.727) models performed best. Conclusions: This study demonstrates that machine learning models can be applied to paramedic notes within unstructured medical records to classify self-harm related behaviours. The resulting model could be used to compliment current manual abstraction of self-harm behaviours and provide more timely approximations to be used for self-harm surveillance.

引用

页数：8

共 22 条

[21] Machine learning algorithms identifying the risk of new-onset ACS in patients with type 2 diabetes mellitus: A retrospective cohort study
Zhong, Zuoquan
Sun, Shiming
Weng, Jingfan
Zhang, Hanlin
Lin, Hui
Sun, Jing
Pan, Miaohong
Guo, Hangyuan
Chi, Jufang
FRONTIERS IN PUBLIC HEALTH, 2022, 10
[22] Development of Prediction Model Using Machine-Learning Algorithms for Nonsteroidal Anti-inflammatory Drug-Induced Gastric Ulcer in Osteoarthritis Patients: Retrospective Cohort Study of a Nationwide South Korean Cohort
Jeong, Jaehan
Han, Hyein
Ro, Du Hyun
Han, Hyuk-Soo
Won, Sungho
CLINICS IN ORTHOPEDIC SURGERY, 2023, 15 (04) : 678 - 689

← 1 2 3 →