Learning Dynamic Batch-Graph Representation for Deep Representation Learning

被引：1

作者：

Wang, Xixi ^{[1
]}

Jiang, Bo ^{[1
,2
]}

Wang, Xiao ^{[1
]}

Luo, Bin ^{[1
]}

机构：

[1] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China

[2] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei 230601, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2025年 / 133卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Image representation learning; Transformer; Graph learning; Metric learning; Mini-batch;

D O I：

10.1007/s11263-024-02175-8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, batch-based image data representation has been demonstrated to be effective for context-enhanced image representation. The core issue for this task is capturing the dependences of image samples within each mini-batch and conducting message communication among different samples. Existing approaches mainly adopt self-attention or local self-attention models (on patch dimension) for this task which fail to fully exploit the intrinsic relationships of samples within mini-batch and also be sensitive to noises and outliers. To address this issue, in this paper, we propose a flexible Dynamic Batch-Graph Representation (DyBGR) model, to automatically explore the intrinsic relationship of samples for contextual sample representation. Specifically, DyBGR first represents the mini-batch with a graph (termed batch-graph) in which nodes represent image samples and edges encode the dependences of images. This graph is dynamically learned with the constraint of similarity, sparseness and semantic correlation. Upon this, DyBGR exchanges the sample (node) information on the batch-graph to update each node representation. Note that, both batch-graph learning and information propagation are jointly optimized to boost their respective performance. Furthermore, in practical, DyBGR model can be implemented via a simple plug-and-play block (named DyBGR block) which thus can be potentially integrated into any mini-batch based deep representation learning schemes. Extensive experiments on deep metric learning tasks demonstrate the effectiveness of DyBGR. We will release the code at https://github.com/SissiW/DyBGR.

引用

页码：84 / 105

页数：22

共 115 条

[1]

[Anonymous], 2002, J. Mach. Learn. Res

[2]

[Anonymous], 2015, INT C LEARN REPR

[3]

Belkin M, 2002, ADV NEUR IN, V14, P585

[4]

Boyd S. P., 2004, ALGORITHMS THEORY CO

[5] Graph Regularized Nonnegative Matrix Factorization for Data Representation [J].

Cai, Deng ;

He, Xiaofei ;

Han, Jiawei ;

Huang, Thomas S. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) :1548-1560

[6] Deep Metric Learning to Rank [J].

Cakir, Fatih ;

He, Kun ;

Xia, Xide ;

Kulis, Brian ;

Sclaroff, Stan .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1861-1870

[7] End-to-End Object Detection with Transformers [J].

Carion, Nicolas ;

Massa, Francisco ;

Synnaeve, Gabriel ;

Usunier, Nicolas ;

Kirillov, Alexander ;

Zagoruyko, Sergey .

COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229

[8] Emerging Properties in Self-Supervised Vision Transformers [J].

Caron, Mathilde ;

Touvron, Hugo ;

Misra, Ishan ;

Jegou, Herve ;

Mairal, Julien ;

Bojanowski, Piotr ;

Joulin, Armand .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9630-9640

[9] A Novel BSO Algorithm for Three-Layer Neural Network Optimization Applied to UAV Edge Control [J].

Chen, Dechao ;

Fang, Zhaotian ;

Li, Shuai .

NEURAL PROCESSING LETTERS, 2023, 55 (05) :6733-6752

[10]

Chen DL, 2020, AAAI CONF ARTIF INTE, V34, P3438

← 1 2 3 4 5 6 7 8 9 10 →