共 35 条
Optimal drug-dosing of cancer dynamics with fuzzy reinforcement learning and discontinuous reward function
被引:1
作者:
Treesatayapun, Chidentree
[1
]
Munoz-Vazquez, Aldo Jonathan
[2
]
机构:
[1] Walailak Univ, Fac Engn, 222 Thaiburi, Nakhon Si Thammarat 80161, Thailand
[2] Texas A&M Univ, Coll Engn, Higher Educ Ctr McAllen, 6200 Tres Lagos Blvd, College Stn, TX 78504 USA
关键词:
Chemotherapy drug administration;
Optimal control;
Fuzzy-rules network;
Reinforcement learning;
Discontinuous reward function;
MODEL-PREDICTIVE CONTROL;
CHEMOTHERAPY;
IDENTIFICATION;
SYSTEMS;
D O I:
10.1016/j.engappai.2023.105851
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
In this paper, a reinforcement learning-based optimal control is developed for the drug administration of biological phenomena in chemotherapy cancer treatment. The treatment is considered as a class of unknown discrete-time systems when the input: drug administration and the output: tumor cells population are only utilized to design the proposed controller. Resulting, a full-state observer is completely neglected. The controller is established by the actor-critic architecture containing two fuzzy-rules emulated networks when IF -THEN rules are imposed by human knowledge according to pharmacokinetic and pharmacodynamic behavior. Furthermore, the discontinuous reward function is proposed to derive the online learning laws that guarantee the robustness and the convergence of adjustable parameters. The validation results are conducted by numerical systems according to the robustness of the group of patients and the closed-loop performance altogether with comparative results.
引用
收藏
页数:11
相关论文