Deep learning as Ricci flow

被引：0

作者：

Baptista, Anthony ^{[1
,2
,7
]}

Barp, Alessandro ^{[1
,8
]}

Chakraborti, Tapabrata ^{[1
]}

Harbron, Chris ^{[3
]}

Macarthur, Ben D. ^{[1
,4
,5
]}

Banerji, Christopher R. S. ^{[1
,6
,7
]}

机构：

[1] Alan Turing Inst, British Lib, London NW1 2DB, England

[2] Queen Mary Univ London, Sch Math Sci, London E1 4NS, England

[3] Roche Pharmaceut, Welwyn Garden City AL7 1TW, England

[4] Univ Southampton, Sch Math Sci, Southampton SO17 1BJ, England

[5] Univ Southampton, Fac Med, Southampton SO17 1BJ, England

[6] Univ Coll London Hosp, NHS Fdn Trust, London NW1 2BU, England

[7] Kings Coll London, Fac Life Sci & Med, Sch Canc & Pharmaceut Sci, Canc Bioinformat, London, England

[8] UCL, Fac Math & Phys Sci, London, England

来源：

SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期

关键词：

Deep learning; Complex network; Differential geometry; Ricci flow; CURVATURE; NETWORKS;

D O I：

10.1038/s41598-024-74045-9

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Deep neural networks (DNNs) are powerful tools for approximating the distribution of complex data. It is known that data passing through a trained DNN classifier undergoes a series of geometric and topological simplifications. While some progress has been made toward understanding these transformations in neural networks with smooth activation functions, an understanding in the more general setting of non-smooth activation functions, such as the rectified linear unit (ReLU), which tend to perform better, is required. Here we propose that the geometric transformations performed by DNNs during classification tasks have parallels to those expected under Hamilton's Ricci flow-a tool from differential geometry that evolves a manifold by smoothing its curvature, in order to identify its topology. To illustrate this idea, we present a computational framework to quantify the geometric changes that occur as data passes through successive layers of a DNN, and use this framework to motivate a notion of 'global Ricci network flow' that can be used to assess a DNN's ability to disentangle complex data geometries to solve classification problems. By training more than 1500 DNN classifiers of different widths and depths on synthetic and real-world data, we show that the strength of global Ricci network flow-like behaviour correlates with accuracy for well-trained DNNs, independently of depth, width and data set. Our findings motivate the use of tools from differential and discrete geometry to the problem of explainability in deep learning.

引用

页数：11

共 45 条

[1]

Anderson M.T., 2004, Notices of the American Mathematical Society, V51, P184

[2]

Ansuini A, 2019, ADV NEUR IN, V32

[3] Charting cellular differentiation trajectories with Ricci flow [J].