Grassmannian learning mutual subspace method for image set recognition

被引：6

作者：

Souza, Lincon S. ^{[1
]}

Sogi, Naoya ^{[3
]}

Gatto, Bernardo B. ^{[1
]}

Kobayashi, Takumi ^{[1
,2
]}

Fukui, Kazuhiro ^{[2
]}

机构：

[1] Natl Inst Adv Ind Sci & Technol, Dept Informat Technol & Human Factors, Koto Ku, 2-4-7 Aomi, Tokyo 1350064, Japan

[2] Univ Tsukuba, Fac Engn Informat & Syst, 1-1-1 Tennodai, Tsukuba, Ibaraki 3058573, Japan

[3] Univ Tsukuba, Grad Sch Syst & Informat Engn, 1-1-1 Tennodai, Tsukuba, Ibaraki 3058573, Japan

来源：

NEUROCOMPUTING | 2023年 / 517卷

基金：

日本学术振兴会;

关键词：

Grassmannian learning mutual subspace; method; Learning subspace methods; Subspace learning; Image recognition; Deep neural networks; Manifold optimization; APPROXIMATED NEAREST POINTS; MANIFOLD-MANIFOLD DISTANCE; FACE RECOGNITION; OBJECT; GEOMETRY; ROBUST;

D O I：

10.1016/j.neucom.2022.10.040

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses the problem of object recognition given a set of images as input (e.g., multiple cam-era sources and video frames). Convolutional neural network (CNN)-based frameworks do not exploit these sets effectively, processing a pattern as observed, not capturing the underlying feature distribution as it does not consider the variance of images in the set. To address this issue, we propose the Grassmannian learning mutual subspace method (G-LMSM), a NN layer embedded on top of CNNs that can process image sets more effectively and can be trained in an end-to-end manner. The image set is first represented by a low-dimensional input subspace and then this input subspace is matched with dic-tionary subspaces by a similarity of their canonical angles, an interpretable and easy to compute metric. The key idea of G-LMSM is that the dictionary subspaces are learned as points on the Grassmann man-ifold, optimized with Riemannian stochastic gradient descent. This learning is stable, efficient and theo-retically well-grounded. We demonstrate the effectiveness of our proposed method on hand shape recognition, face identification, and facial emotion recognition.(c) 2022 Elsevier B.V. All rights reserved.

引用

页码：20 / 33

页数：14

共 92 条

[1] Riemannian geometry of Grassmann manifolds with a view on algorithmic computation [J].

Absil, PA ;

Mahony, R ;

Sepulchre, R .

ACTA APPLICANDAE MATHEMATICAE, 2004, 80 (02) :199-220

[2]

Afriat S.N., 1957, MATH PROC CAMBRIDGE, V53, P800, DOI DOI 10.1017/S0305004100032916

[3]

Agarwala A, 2020, Arxiv, DOI arXiv:2010.07344

[4]

[Anonymous], 2008, Subspace-based Learning with Grassmann Kernels

[5]

Arandjelovic O, 2005, PROC CVPR IEEE, P581

[6] Lambertian reflectance and linear subspaces [J].

Basri, R ;

Jacobs, DW .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (02) :218-233

[7]

B‚cigneul G, 2019, Arxiv, DOI arXiv:1810.00760

[8] What is the set of images of an object under all possible illumination conditions? [J].

Belhumeur, PN ;

Kriegman, DJ .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 1998, 28 (03) :245-260

[9] Stochastic Gradient Descent on Riemannian Manifolds [J].

Bonnabel, Silvere .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2013, 58 (09) :2217-2229

[10] GRASSMANN MANIFOLDS AND THE GRASSMANN IMAGE OF SUBMANIFOLDS [J].

BORISENKO, AA ;

NIKOLAEVSKII, YA .

RUSSIAN MATHEMATICAL SURVEYS, 1991, 46 (02) :45-94

← 1 2 3 4 5 6 7 8 9 10 →