Hyperbolic Deep Learning in Computer Vision: A Survey

被引：8

作者：

Mettes, Pascal ^{[1
]}

Atigh, Mina Ghadimi ^{[1
]}

Keller-Ressel, Martin ^{[2
]}

Gu, Jeffrey ^{[3
]}

Yeung, Serena ^{[3
]}

机构：

[1] Univ Amsterdam, Amsterdam, Netherlands

[2] Tech Univ Dresden, Dresden, Germany

[3] Stanford Univ, Stanford, CA USA

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2024年 / 132卷 / 09期

关键词：

Hyperbolic deep learning; Computer vision; Representation learning;

D O I：

10.1007/s11263-024-02043-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep representation learning is a ubiquitous part of modern computer vision. While Euclidean space has been the de facto standard manifold for learning visual representations, hyperbolic space has recently gained rapid traction for learning in computer vision. Specifically, hyperbolic learning has shown a strong potential to embed hierarchical structures, learn from limited samples, quantify uncertainty, add robustness, limit error severity, and more. In this paper, we provide a categorization and in-depth overview of current literature on hyperbolic learning for computer vision. We research both supervised and unsupervised literature and identify three main research themes in each direction. We outline how hyperbolic learning is performed in all themes and discuss the main research problems that benefit from current advances in hyperbolic learning for computer vision. Moreover, we provide a high-level intuition behind hyperbolic geometry and outline open research questions to further advance research in this direction.

引用

页码：3484 / 3508

页数：25

共 171 条

[1]

Ahmad O, 2022, AAAI CONF ARTIF INTE, P5968

[2]

Allen Carl, 2019, Advances in Neural Information Processing Systems

[3] Deep Semantic Hashing with Structure-Semantic Disagreement Correction via Hyperbolic Metric Learning [J].

Amin, Fazail ;

Mondal, Arijit ;

Mathew, Jimson .

2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,

[4]

[Anonymous], 2015, NATURE

[5]

[Anonymous], 2014, P 30 ANN S COMPUTATI

[6]

Anvekar T., 2023, ARXIV

[7] Multimodal sentiment and emotion recognition in hyperbolic space [J].

Arano, Keith April ;

Orsenigo, Carlotta ;

Soto, Mauricio ;

Vercellis, Carlo .

EXPERT SYSTEMS WITH APPLICATIONS, 2021, 184

[8]

Arjovsky M, 2017, PR MACH LEARN RES, V70

[9]

Asano Yuki Markus, 2019, arXiv

[10] Masked Siamese Networks for Label-Efficient Learning [J].

Assran, Mahmoud ;

Caron, Mathilde ;

Misra, Ishan ;

Bojanowski, Piotr ;

Bordes, Florian ;

Vincent, Pascal ;

Joulin, Armand ;

Rabbat, Mike ;

Ballas, Nicolas .

COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 :456-473

← 1 2 3 4 5 6 7 8 9 10 →