A singular Riemannian geometry approach to Deep Neural Networks I. Theoretical foundations

被引:3
作者
Benfenati, Alessandro [1 ,2 ]
Marta, Alessio [3 ,4 ,5 ]
机构
[1] Univ Milan, Environm Sci & Policy Dept, Via Celoria 2, I-20133 Milan, Italy
[2] INDAM, Grp Nazl Calcolo Sci, Rome, Italy
[3] Univ Pavia, Dipartimento Sci Sistema Nervoso & Comportamento, Via Agostino Bassi 21, I-27100 Pavia, Italy
[4] INDAM, Grp Nazl Fis Matemat, Rome, Italy
[5] Ist Nazl Fis Nucl, INFN, Sez Milano, Rome, Italy
关键词
Deep learning; Riemann geometry; Classification; Degenerate metrics; Neural networks;
D O I
10.1016/j.neunet.2022.11.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Neural Networks are widely used for solving complex problems in several scientific areas, such as speech recognition, machine translation, image analysis. The strategies employed to investigate their theoretical properties mainly rely on Euclidean geometry, but in the last years new approaches based on Riemannian geometry have been developed. Motivated by some open problems, we study a particular sequence of maps between manifolds, with the last manifold of the sequence equipped with a Riemannian metric. We investigate the structures induced through pullbacks on the other manifolds of the sequence and on some related quotients. In particular, we show that the pullbacks of the final Riemannian metric to any manifolds of the sequence is a degenerate Riemannian metric inducing a structure of pseudometric space. We prove that the Kolmogorov quotient of this pseudometric space yields a smooth manifold, which is the base space of a particular vertical bundle. We investigate the theoretical properties of the maps of such sequence, eventually we focus on the case of maps between manifolds implementing neural networks of practical interest and we present some applications of the geometric framework we introduced in the first part of the paper.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页码:331 / 343
页数:13
相关论文
共 51 条
[1]   Comparing Fusion Models for DNN-Based Audiovisual Continuous Speech Recognition [J].
Abdelaziz, Ahmed Hussen .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (03) :475-484
[2]  
Aggarwal C.C., 2018, Neural networks and deep learning, V10, P978, DOI [DOI 10.1007/978-3-319-94463-03, 10.1007/978-3-319-94463-0]
[3]  
Ahmed E, 2019, Arxiv, DOI arXiv:1808.01462
[4]  
[Anonymous], 2003, Smooth Manifolds, DOI [DOI 10.1007/978-0-387-21752-91, 10.1007/978-0-387-21752-9_1, DOI 10.1007/978-0-387-21752-9_1, 10.1007/978-0-387-21752-9]
[5]  
Aslam Yasir, 2019, 2019 International Conference on Smart Systems and Inventive Technology (ICSSIT). Proceedings, P709, DOI 10.1109/ICSSIT46314.2019.8987922
[6]   DEGENERATE RIEMANNIAN METRICS [J].
BELKO, IV .
MATHEMATICAL NOTES, 1975, 18 (5-6) :1046-1049
[7]  
Benfenati A., 2022, SINGULAR RIEMANNIAN, DOI [10.1016/j.neunet.2022.11.026, DOI 10.1016/J.NEUNET.2022.11.026]
[8]  
Bishop C., 2006, Pattern Recognition and Machine Learning
[9]  
Bortolotti E., 1930, ATTI ACCAD NAZ LIN, V12, P541
[10]  
Boscaini D., 2016, Advances in neural information processing systems, V29