共 152 条
[41]
Wei C., Ma T., Data-dependent sample complexity of deep neural networks via Lipschitz augmentation, Advances in Neural Information Processing Systems, 32, (2019)
[42]
Kawaguchi K., Kaelbling L.P., Bengio Y., Generalization in Deep Learning, (2017)
[43]
Hochreiter S., Schmidhuber J., Flat minima, Neural Comput, 9, 1, pp. 1-42, (1997)
[44]
Bahri D., Mobahi H., Tay Y., Sharpness-Aware Minimization Improves Language Model Generalization, (2021)
[45]
Orvieto A., Kersting H., Proske F., Bach F., Lucchi A., Anticorrelated noise injection for improved generalization, International Conference on Machine Learning, pp. 17094-17116, (2022)
[46]
Chatterjee S., Zielinski P., On the generalization mystery in deep learning, (2022)
[47]
Bousquet O., Elisseeff A., Algorithmic stability and generalization performance, Advances in Neural Information Processing Systems, 13, (2000)
[48]
Wolpert D.H., The lack of a priori distinctions between learning algorithms, Neural Comput, 8, 7, pp. 1341-1390, (1996)
[49]
Solomonoff R.J., Algorithmic probability: Theory and applications, Information Theory and Statistical Learning, pp. 1-23, (2009)
[50]
Goldblum M., Finzi M., Rowan K., Wilson A.G., The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learning, (2023)