Prescribed attractivity region selection for recurrent neural networks based on deep reinforcement learning
被引:1
作者:
Bao, Gang
论文数: 0引用数: 0
h-index: 0
机构:
China Three Gorges Univ, Hubei Key Lab Cascaded Hydropower Stat Operat & C, Yichang 443002, Peoples R ChinaChina Three Gorges Univ, Hubei Key Lab Cascaded Hydropower Stat Operat & C, Yichang 443002, Peoples R China
Bao, Gang
[1
]
Song, Zhenyan
论文数: 0引用数: 0
h-index: 0
机构:
China Three Gorges Univ, Hubei Key Lab Cascaded Hydropower Stat Operat & C, Yichang 443002, Peoples R ChinaChina Three Gorges Univ, Hubei Key Lab Cascaded Hydropower Stat Operat & C, Yichang 443002, Peoples R China
Song, Zhenyan
[1
]
Xu, Rui
论文数: 0引用数: 0
h-index: 0
机构:
China Three Gorges Univ, Hubei Key Lab Cascaded Hydropower Stat Operat & C, Yichang 443002, Peoples R ChinaChina Three Gorges Univ, Hubei Key Lab Cascaded Hydropower Stat Operat & C, Yichang 443002, Peoples R China
Xu, Rui
[1
]
机构:
[1] China Three Gorges Univ, Hubei Key Lab Cascaded Hydropower Stat Operat & C, Yichang 443002, Peoples R China
Recurrent neural networks;
Attractivity region selection;
Deep reinforcement learning;
GLOBAL EXPONENTIAL STABILITY;
TIME-VARYING DELAYS;
DESIGN;
D O I:
10.1007/s00521-023-09191-8
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
Recurrent neural networks' (RNNs') outputs are the same when network states converge to the same saturation region. Strong external inputs can cause the neural network to converge to a prescribed saturation region. Different from previous works, this paper employs deep reinforcement learning to obtain external inputs to make network states converge to the desired saturation region. Firstly, for five-dimensional neural networks, the deep Q learning (DQN) algorithm is used to compute the optimal external inputs that make the network state converge to the specified saturation region. When scaling to n-dimensional RNNs, the problem of dimensional disaster is encountered. Then, it proposes a batch computation of the external inputs to cope with the curse of dimensionality. At last, the proposed method is validated by numerical examples, and compared with existing methods, it shows that less conservative external inputs conditions can be obtained.