共 50 条
[21]
A proof of convergence for stochastic gradient descent in the training of artificial neural networks with ReLU activation for constant target functions
[J].
ZEITSCHRIFT FUR ANGEWANDTE MATHEMATIK UND PHYSIK,
2022, 73 (05)
[22]
CONVERGENCE TIME ON THE RS MODEL FOR NEURAL NETWORKS
[J].
INTERNATIONAL JOURNAL OF MODERN PHYSICS C,
1991, 2 (03)
:711-717
[25]
Variable Order Fractional Gradient Descent Method and Its Application in Neural Networks Optimization
[J].
2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC,
2022,
:109-114
[26]
A new spectral conjugate gradient method for unconstrained optimization and its application in neural networks
[J].
JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE-JMCS,
2025, 36 (03)
:326-332
[28]
Weight and Gradient Centralization in Deep Neural Networks
[J].
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV,
2021, 12894
:227-239