Angular Deep Supervised Vector Quantization for Image Retrieval

被引：8

作者：

Zhou, Chang ^{[1
]}

Po, Lai Man ^{[1
]}

Ou, Weifeng ^{[1
]}

机构：

[1] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2022年 / 33卷 / 04期

关键词：

Quantization (signal); Neural networks; Image retrieval; Task analysis; Optimization; Deep learning; Training; image retrieval; nearest neighbor search; neural networks; vector quantization (VQ);

D O I：

10.1109/TNNLS.2020.3043103

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most of the deep quantization methods adopt unsupervised approaches, and the quantization process usually occurs in the Euclidean space on top of the deep feature and its approximate value. When this approach is applied to the retrieval tasks, since the internal product space of the retrieval process is different from the Euclidean space of quantization, minimizing the quantization error (QE) does not necessarily lead to a good performance on the maximum inner product search (MIPS). To solve these problems, we treat Softmax classification as vector quantization (VQ) with angular decision boundaries and propose angular deep supervised VQ (ADSVQ) for image retrieval. Our approach can simultaneously learn the discriminative feature representation and the updatable codebook, both lying on a hypersphere. To reduce the QE between centroids and deep features, two regularization terms are proposed as supervision signals to encourage the intra-class compactness and inter-class balance, respectively. ADSVQ explicitly reformulates the asymmetric distance computation in MIPS to transform the image retrieval process into a two-stage classification process. Moreover, we discuss the extension of multiple-label cases from the perspective of quantization with binary classification. Extensive experiments demonstrate that the proposed ADSVQ has excellent performance on four well-known image data sets when compared with the state-of-the-art hashing methods.

引用

页码：1638 / 1649

页数：12

共 43 条

[1]

Calefati A., 2018, ARXIV180708512

[2] Deep Visual-Semantic Quantization for Efficient Image Retrieval [J].

Cao, Yue ;

Long, Mingsheng ;

Wang, Jianmin ;

Liu, Shichen .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :916-925

[3]

Cao Y, 2016, AAAI CONF ARTIF INTE, P3457

[4] HashNet: Deep Learning to Hash by Continuation [J].

Cao, Zhangjie ;

Long, Mingsheng ;

Wang, Jianmin ;

Yu, Philip S. .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5609-5618

[5] Beyond triplet loss: a deep quadruplet network for person re-identification [J].

Chen, Weihua ;

Chen, Xiaotang ;

Zhang, Jianguo ;

Huang, Kaiqi .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1320-1329

[6] ArcFace: Additive Angular Margin Loss for Deep Face Recognition [J].

Deng, Jiankang ;

Guo, Jia ;

Xue, Niannan ;

Zafeiriou, Stefanos .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4685-4694

[7] Deep Spherical Quantization for Image Search [J].

Eghbali, Sepehr ;

Tahvildari, Ladan .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11682-11691

[8]

Gao L., 2019, ARXIV190606698

[9]

Gersho A., 2012, Vector quantization and signal compression

[10]

Gionis A, 1999, PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, P518

← 1 2 3 4 5 →