Optimal drug-dosing of cancer dynamics with fuzzy reinforcement learning and discontinuous reward function

被引：1

作者：

Treesatayapun, Chidentree ^{[1
]}

Munoz-Vazquez, Aldo Jonathan ^{[2
]}

机构：

[1] Walailak Univ, Fac Engn, 222 Thaiburi, Nakhon Si Thammarat 80161, Thailand

[2] Texas A&M Univ, Coll Engn, Higher Educ Ctr McAllen, 6200 Tres Lagos Blvd, College Stn, TX 78504 USA

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2023年 / 120卷

关键词：

Chemotherapy drug administration; Optimal control; Fuzzy-rules network; Reinforcement learning; Discontinuous reward function; MODEL-PREDICTIVE CONTROL; CHEMOTHERAPY; IDENTIFICATION; SYSTEMS;

D O I：

10.1016/j.engappai.2023.105851

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, a reinforcement learning-based optimal control is developed for the drug administration of biological phenomena in chemotherapy cancer treatment. The treatment is considered as a class of unknown discrete-time systems when the input: drug administration and the output: tumor cells population are only utilized to design the proposed controller. Resulting, a full-state observer is completely neglected. The controller is established by the actor-critic architecture containing two fuzzy-rules emulated networks when IF -THEN rules are imposed by human knowledge according to pharmacokinetic and pharmacodynamic behavior. Furthermore, the discontinuous reward function is proposed to derive the online learning laws that guarantee the robustness and the convergence of adjustable parameters. The validation results are conducted by numerical systems according to the robustness of the group of patients and the closed-loop performance altogether with comparative results.

引用

页数：11

共 35 条

[1] Shape-independent model predictive control for Takagi-Sugeno fuzzy systems
Arino, Carlos
Querol, Andres
Sala, Antonio
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 65 : 493 - 505
[2] Personalized drug administration for cancer treatment using Model Reference Adaptive Control
Babaei, Naser
Salamci, Metin U.
[J]. JOURNAL OF THEORETICAL BIOLOGY, 2015, 371 : 24 - 44
[3] Optimal chemotherapy in cancer treatment: state dependent Riccati equation control and extended Kalman filter
Batmani, Yazdan
Khaloozadeh, Hamid
[J]. OPTIMAL CONTROL APPLICATIONS & METHODS, 2013, 34 (05) : 562 - 577
[4] Deep reinforcement learning to study spatial navigation, learning and memory in artificial and biological agents
Bermudez-Contreras, Edgar
[J]. BIOLOGICAL CYBERNETICS, 2021, 115 (02) : 131 - 134
[5] Fuzzy adaptive dynamic programming-based optimal leader-following consensus for heterogeneous nonlinear multi-agent systems
Cai, Yuliang
Zhang, Huaguang
Zhang, Kun
Liu, Chong
[J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (13) : 8763 - 8781
[6] A comparative study on classification of magnetoencephalography signals using probabilistic neural network and multilayer neural network
Cetin, Onursal
Temurtas, Feyzullah
[J]. SOFT COMPUTING, 2021, 25 (03) : 2267 - 2275
[7] Optimal dosing of cancer chemotherapy using model predictive control and moving horizon state/parameter estimation
Chen, Tao
Kirkby, Norman F.
Jena, Raj
[J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2012, 108 (03) : 973 - 983
[8] Optimal control strategy for cancer remission using combinatorial therapy: A mathematical model-based approach
Das, Parthasakha
Das, Samhita
Das, Pritha
Rihan, Fathalla A.
Uzuntarla, Muhammet
Ghosh, Dibakar
[J]. CHAOS SOLITONS & FRACTALS, 2021, 145
[9] Application of gene expression programming and sensitivity analyses in analyzing effective parameters in gastric cancer tumor size and location
Dorosti, Shadi
Ghoushchi, Saeid Jafarzadeh
Sobhrakhshankhah, Elham
Ahmadi, Mohsen
Sharifi, Abbas
[J]. SOFT COMPUTING, 2020, 24 (13) : 9943 - 9964
[10] Neural-network-based model predictive control for consensus of nonlinear systems
Floriano, Bruno R. O.
Vargas, Alessandro N.
Ishihara, Joao Y.
Ferreira, Henrique C.
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 116

← 1 2 3 4 →