An enhanced learning algorithm with a particle filter-based gradient descent optimizer method

被引:14
作者
Kamsing, Patcharin [1 ]
Torteeka, Peerapong [2 ]
Yooyen, Soemsak [1 ]
机构
[1] King Mongkuts Inst Technol Ladkrabang, Int Acad Avit Ind, Dept Aeronaut Engn, Bangkok 10520, Thailand
[2] Natl Astron Res Inst Thailand, Chiang Mai 50180, Thailand
关键词
Gradient descent; Optimizer; Particle filter; Neural network; Deep learning;
D O I
10.1007/s00521-020-04726-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This experiment integrates a particle filter concept with a gradient descent optimizer to reduce loss during iteration and obtains a particle filter-based gradient descent (PF-GD) optimizer that can determine the global minimum with excellent performance. Four functions are applied to test optimizer deployment to verify the PF-GD method. Additionally, the Modified National Institute of Standards and Technology (MNIST) database is used to test the PF-GD method by implementing a logistic regression learning algorithm. The experimental results obtained with the four functions illustrate that the PF-GD method performs much better than the conventional gradient descent optimizer, although it has some parameters that must be set before modeling. The results of implementing the MNIST dataset demonstrate that the cross-entropy of the PF-GD method exhibits a smaller decrease than that of the conventional gradient descent optimizer, resulting in higher accuracy of the PF-GD method. The PF-GD method provides the best accuracy for the training model, 97.00%, and the accuracy of evaluating the model with the test dataset is 90.37%, which is higher than the accuracy of 90.08% obtained with the conventional gradient descent optimizer.
引用
收藏
页码:12789 / 12800
页数:12
相关论文
共 37 条
[1]   Radio frequency interference mitigation using deep convolutional neural networks [J].
Akeret, J. ;
Chang, C. ;
Lucchi, A. ;
Refregier, A. .
ASTRONOMY AND COMPUTING, 2017, 18 :35-39
[2]  
[Anonymous], 2019, The mnist database of handwritten digits
[3]  
Bello I, 2017, PR MACH LEARN RES, V70
[4]  
Bittner B, 2004, INT CONF ACOUST SPEE, P709
[5]  
Chernodub A. N., 2014, Opt. Mem. Neural Netw, V23, P96, DOI DOI 10.3103/S1060992X14020088
[6]  
Fu GX, 2018, CHIN CONT DECIS CONF, P3687, DOI 10.1109/CCDC.2018.8407763
[7]  
Gunnemann N., 2017, Proceedings of Machine Learning Research, P92
[8]   Soft computing applications in customer segmentation: State-of-art review and critique [J].
Hiziroglu, Abdulkadir .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (16) :6491-6507
[9]   Improvement of Maximum Variance Weight Partitioning Particle Filter in Urban Computing and Intelligence [J].
Huang, Li ;
Fu, Qiaobo ;
Li, Gongfa ;
Luo, Bowen ;
Chen, Disi ;
Yu, Hui .
IEEE ACCESS, 2019, 7 :106527-106535
[10]   The dynamics of wetland cover change using a state estimation technique applied to time-series remote sensing imagery [J].
Insom, Patcharin ;
Cao, Chunxiang ;
Boonsrimuang, Pisit ;
Torteeka, Peerapong ;
Boonprong, Sornkitja ;
Liu, Di ;
Chen, Wei .
GEOMATICS NATURAL HAZARDS & RISK, 2017, 8 (02) :1662-1677