A reinforcement learning approach for individualizing erythropoietin dosages in hemodialysis patients

被引：39

作者：

Martin-Guerrero, Jose D. ^{[1
]}

Gomez, Faustino ^{[2
]}

Soria-Olivas, Emilio ^{[1
]}

Schmidhuber, Juergen ^{[2
]}

Climente-Marti, Monica ^{[3
]}

Victor Jimenez-Torres, N. ^{[3
,4
]}

机构：

[1] Univ Valencia, Dept Elect Engn, E-46100 Valencia, Spain

[2] IDSIA, Lugano, Switzerland

[3] Univ Hosp, Pharm Unit, Valencia, Spain

[4] Univ Valencia, Pharm & Pharmaceut Technol Dept, E-46100 Valencia, Spain

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2009年 / 36卷 / 06期

关键词：

Erythropoietin; Anemia; Chronic renal failure; Clinical pharmacokinetics; Reinforcement learning; ANEMIA;

D O I：

10.1016/j.eswa.2009.02.041

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a reinforcement learning (RL) approach for anemia management in patients undergoing chronic renal failure. Erythropoietin (EPO) is the treatment of choice for this kind of anemia but it is an expensive drug and with some dangerous side-effects that should be considered especially for patients who do not respond to the treatment. Therefore, an individualized treatment appears to be necessary. RL is a suitable approach to tackle this problem. Moreover, resulting policies are similar to medical protocols, and hence, they can easily be transferred to daily practice. A cohort of 64 patients are included in the study. An implementation of the Q-learning algorithm based on a state-aggregation table and another implementation using the multi-layer perceptron as a function approximator (Q-MLP) are compared with the protocols followed in the Nephrology Unit. The policy obtained by the Q-MLP approach outperforms the hospital policy in terms of the ratio of patients that are within the targeted range of hemoglobin (11.5-12.5 g/dl) at the end of the analyzed period, since an increase of 25% is observed. It ensures an improvement in patients' quality-of-life and considerable economic savings for the health care system due to both the expensiveness of EPO treatment and the costs incurred by the health care system in order to alleviate problems related to EPO over-dosing. It should be pointed Out that the approach presented here is completely general, and therefore, it can be applied to any problem of drug dosage optimization. (C) 2009 Elsevier Ltd. All rights reserved.

引用

页码：9737 / 9742

页数：6

共 13 条

[1]

BELLAZZI R, 1994, HAEMATOLOGICA, V79, P154

[2]

BELLAZZI R, 1992, COMPUTERS BIOMEDICAL, V26, P274

[3]

Bertsekas Dimitri, 1996, Neuro dynamic programming

[4]

BERTSEKAS DP, 2003, LIDSP2573 MIT

[5]

COHEN H, 2005, ANEMIA CANC CHRONIC

[6] Individualization of pharmacological anemia management using reinforcement learning [J].

Gaweda, AE ;

Muezzinoglu, MK ;

Aronoff, GR ;

Jacobs, AA ;

Zurada, JM ;

Brier, ME .

NEURAL NETWORKS, 2005, 18 (5-6) :826-834

[7] Use of neural networks for dosage individualisation of erythropoietin in patients with secondary anemia to chronic renal failure [J].

Guerrero, JDM ;

Olivas, ES ;

Valls, GC ;

López, AJS ;

Ruixo, JJN ;

Torres, NVJ .

COMPUTERS IN BIOLOGY AND MEDICINE, 2003, 33 (04) :361-373

[8]

Jacobs Alfred A., 2001, Journal of the American Society of Nephrology, V12, p387A

[9] Dosage individualization of erythropoietin using a profile-dependent support vector regression [J].

Martín-Guerrero, JD ;

Camps-Valls, G ;

Soria-Olivas, E ;

Serrano-López, AJ ;

Pérez-Ruixo, JJ ;

Jiménez-Torres, NV .

IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2003, 50 (10) :1136-1142

[10]

Peterson L. Lynne, 2004, TRENDS MED, P1

← 1 2 →