共 50 条
- [48] Risk Aversion Operator for Addressing Maximization Bias in Q-Learning IEEE ACCESS, 2020, 8 : 43098 - 43110
- [49] UCB Momentum Q-learning: Correcting the bias without forgetting INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139