共 38 条
- [21] GRAVES A., Generating sequences with recurrent neural networks[DB/OL], (2013)
- [22] Adam D P,BA J., A method for stochastic optimization[DB/OL], (2014)
- [23] LUO L C, LIU Y,, Et al., Adaptive gradient methods with dynamic bound of learning rate[DB/ OL], (2019)
- [24] ZHUANG J,, TANG T,, DING Y,, Et al., Adabelief optimi-zer:Adapting stepsizes by the belief in observed gradients[J], Advances in Neural Information Processing Systems, 33, pp. 18795-18806, (2020)
- [25] SHAO Z, LIN T., A new adaptive gradient method with gradient decomposition[DB/OL]
- [26] ZHANG H Y,, CISSE M,, DAUPHIN Y N,, Et al., Mixup:Beyond empirical risk minimization[DB/OL], (2017)
- [27] DUBEY S R,, CHAKRABORTY S,, ROY S K, Et al., diffGrad:An optimization method for convolutional neural networks[J], IEEE Transactions on Neural Networks and Learning Systems, 31, 11, pp. 4500-4511, (2020)
- [28] ELFWING S,, UCHIBE E,, DOYA K., Sigmoid-weighted linear units for neural network function approximation in reinforcement learning[J], Neural Networks, 107, pp. 3-11, (2018)
- [29] GOYAL P, Et al., Focal loss for dense object detection[C], IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 318-327, (2018)
- [30] EVERINGHAM M, VAN GOOL L,, WILLIAMS C K I, Et al., The pascal visual object classes(VOC)challenge [J], International Journal of Computer Vision, 88, 2, pp. 303-338, (2010)