Optimal drug-dosing of cancer dynamics with fuzzy reinforcement learning and discontinuous reward function

被引:1
作者
Treesatayapun, Chidentree [1 ]
Munoz-Vazquez, Aldo Jonathan [2 ]
机构
[1] Walailak Univ, Fac Engn, 222 Thaiburi, Nakhon Si Thammarat 80161, Thailand
[2] Texas A&M Univ, Coll Engn, Higher Educ Ctr McAllen, 6200 Tres Lagos Blvd, College Stn, TX 78504 USA
关键词
Chemotherapy drug administration; Optimal control; Fuzzy-rules network; Reinforcement learning; Discontinuous reward function; MODEL-PREDICTIVE CONTROL; CHEMOTHERAPY; IDENTIFICATION; SYSTEMS;
D O I
10.1016/j.engappai.2023.105851
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a reinforcement learning-based optimal control is developed for the drug administration of biological phenomena in chemotherapy cancer treatment. The treatment is considered as a class of unknown discrete-time systems when the input: drug administration and the output: tumor cells population are only utilized to design the proposed controller. Resulting, a full-state observer is completely neglected. The controller is established by the actor-critic architecture containing two fuzzy-rules emulated networks when IF -THEN rules are imposed by human knowledge according to pharmacokinetic and pharmacodynamic behavior. Furthermore, the discontinuous reward function is proposed to derive the online learning laws that guarantee the robustness and the convergence of adjustable parameters. The validation results are conducted by numerical systems according to the robustness of the group of patients and the closed-loop performance altogether with comparative results.
引用
收藏
页数:11
相关论文
共 35 条
  • [1] Shape-independent model predictive control for Takagi-Sugeno fuzzy systems
    Arino, Carlos
    Querol, Andres
    Sala, Antonio
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 65 : 493 - 505
  • [2] Personalized drug administration for cancer treatment using Model Reference Adaptive Control
    Babaei, Naser
    Salamci, Metin U.
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2015, 371 : 24 - 44
  • [3] Optimal chemotherapy in cancer treatment: state dependent Riccati equation control and extended Kalman filter
    Batmani, Yazdan
    Khaloozadeh, Hamid
    [J]. OPTIMAL CONTROL APPLICATIONS & METHODS, 2013, 34 (05) : 562 - 577
  • [4] Deep reinforcement learning to study spatial navigation, learning and memory in artificial and biological agents
    Bermudez-Contreras, Edgar
    [J]. BIOLOGICAL CYBERNETICS, 2021, 115 (02) : 131 - 134
  • [5] Fuzzy adaptive dynamic programming-based optimal leader-following consensus for heterogeneous nonlinear multi-agent systems
    Cai, Yuliang
    Zhang, Huaguang
    Zhang, Kun
    Liu, Chong
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (13) : 8763 - 8781
  • [6] A comparative study on classification of magnetoencephalography signals using probabilistic neural network and multilayer neural network
    Cetin, Onursal
    Temurtas, Feyzullah
    [J]. SOFT COMPUTING, 2021, 25 (03) : 2267 - 2275
  • [7] Optimal dosing of cancer chemotherapy using model predictive control and moving horizon state/parameter estimation
    Chen, Tao
    Kirkby, Norman F.
    Jena, Raj
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2012, 108 (03) : 973 - 983
  • [8] Optimal control strategy for cancer remission using combinatorial therapy: A mathematical model-based approach
    Das, Parthasakha
    Das, Samhita
    Das, Pritha
    Rihan, Fathalla A.
    Uzuntarla, Muhammet
    Ghosh, Dibakar
    [J]. CHAOS SOLITONS & FRACTALS, 2021, 145
  • [9] Application of gene expression programming and sensitivity analyses in analyzing effective parameters in gastric cancer tumor size and location
    Dorosti, Shadi
    Ghoushchi, Saeid Jafarzadeh
    Sobhrakhshankhah, Elham
    Ahmadi, Mohsen
    Sharifi, Abbas
    [J]. SOFT COMPUTING, 2020, 24 (13) : 9943 - 9964
  • [10] Neural-network-based model predictive control for consensus of nonlinear systems
    Floriano, Bruno R. O.
    Vargas, Alessandro N.
    Ishihara, Joao Y.
    Ferreira, Henrique C.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 116