Classification models for the prediction of clinicians' information needs

被引：15

作者：

Del Fiol, Guilherme ^{[1
,2
]}

Haug, Peter J. ^{[1
,2
]}

机构：

[1] Univ Utah, Biomed Informat Dept, Salt Lake City, UT 84120 USA

[2] Intermt Healthcare, Salt Lake City, UT USA

来源：

JOURNAL OF BIOMEDICAL INFORMATICS | 2009年 / 42卷 / 01期

关键词：

Information storage and retrieval; Machine learning; Clinical decision support systems; Infobuttons; Web usage mining; Information needs; RETRIEVAL;

D O I：

10.1016/j.jbi.2008.07.001

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Objective: Clinicians face numerous information needs during patient care activities and most of these needs are not met. Infobuttons are information retrieval tools that help clinicians to fulfill their information needs by providing links to on-line health information resources from within an electronic medical record (EMR) system. The aim of this study was to produce classification models based on medication infobutton usage data to predict the medication-related content topics (e.g., dose, adverse effects, drug interactions, patient education) that a clinician is most likely to choose while entering medication orders in a particular clinical context. Design: We prepared a dataset with 3078 infobutton sessions and 26 attributes describing characteristics of the user, the medication, and the patient. In these sessions, users selected one out of eight content topics. Automatic attribute selection methods were then applied to the dataset to eliminate redundant and useless attributes. The reduced dataset was used to produce nine classification models from a set of state-of-the-art machine learning algorithms. Finally, the performance of the models was measured and compared. Measurements: Area under the ROC curve (AUC) and agreement (kappa) between the content topics predicted by the models and those chosen by clinicians in each infobutton session. Results: The performance of the models ranged from 0.49 to 0.56 (kappa). The AUC of the best model ranged from 0.73 to 0.99. The best performance was achieved when predicting choice of the adult dose, pediatric dose, patient education, and pregnancy category content topics. Conclusion: The results suggest that classification models based on infobutton usage data are a promising method for the prediction of content topics that a clinician would choose to answer patient care questions while using an EMIR system. (C) 2008 Elsevier Inc. All rights reserved.

引用

页码：82 / 89

页数：8

共 40 条

[1]

[Anonymous], Data Mining Practical Machine Learning Tools and Techniques with Java

[2]

[Anonymous], Journal of machine learning research

[3]

[Anonymous], 1993, Proceedings of the 13th International Joint Conference on Artificial Intelligence

[4]

Caruana R., 2006, P 23 INT C MACHINE L, DOI [DOI 10.1145/1143844.1143865, 10.1145/1143844.1143865]

[5]

CARUANA R, 2004, P 21 INT C MACH LEAR, V69, P18

[6] On the Accuracy of Meta-learning for Scalable Data Mining [J].

Chan P.K. ;

Stolfo S.J. .

Journal of Intelligent Information Systems, 1997, 8 (1) :5-28

[7]

Cimino, 2006, P AMIA ANN FALL S, P151

[8]

Cimino JJ, 1997, J AM MED INFORM ASSN, P528

[9]

Cimino JJ, 2002, AMIA 2002 SYMPOSIUM, PROCEEDINGS, P170

[10]

Cimino JJ, 2006, CLIN DECISION SUPPOR

← 1 2 3 4 →