DEEP-BASED FISHER VECTOR FOR MOBILE VISUAL SEARCH

被引：0

作者：

Huang, Chen ^{[1
]}

Zhang, Shengchuan ^{[1
]}

Lin, Xianming ^{[1
]}

Liu, Xiangrong ^{[1
]}

Ji, Rongrong ^{[1
]}

机构：

[1] Xiamen Univ, Sch Informat Sci & Engn, Xiamen 361005, Peoples R China

来源：

2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2017年

关键词：

CDVS; mobile visual search; Fisher Vector; autoencoder; fisher layer;

D O I：

暂无

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

We tackle the problem of mobile visual search. Moving pictures experts group (MPEG) has completed a standard named compact descriptor for visual search (CDVS) to provide a standardized syntax in the context of image retrieval application. CDVS applies principal components analysis to reduce the dimension of local feature descriptor as the input of global descriptor pipeline, and utilizes traditional fisher vector as the local feature descriptor aggregation algorithm. However, the descriptor components of SIFT and Fisher Vector (FV) have highly non-Gaussian statistics, and applying a single PCA transform can in-fact hurt compression performance at high rates. We develop a net-based architecture combining neural networks with FV layer to obtain fisher vector. There are two advantages in our architecture comparing with CDVS global descriptor pipeline. One is that we employ "autoencoder" networks to reduce the dimensionality of data, the other is that we exploit a trainable system to learn parameters after the FV codebook obtained. The experiments demonstrate an obvious advantage of our proposed architecture in terms of CDVS retrieval task.

引用

页码：3430 / 3434

页数：5

共 50 条

[21] CROSS-MODALITY MATCHING BASED ON FISHER VECTOR WITH NEURAL WORD EMBEDDINGS AND DEEP IMAGE FEATURES [J].

Han, Liang ;

Wang, Wenmin ;

Fan, Mengdi ;

Wang, Ronggang .

2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, :2921-2925

[22] Deep Fisher-Vector Descriptors for Image Retrieval and Scene Recognition [J].

Husain, Syed Sameed ;

Ong, Eng-Jon ;

Silva, Lisa ;

Thanveer, Mohamed Faheem ;

Bober, Miroslaw .

PROCEEDINGS OF 2024 ACM ICMR WORKSHOP ON MULTIMODAL VIDEO RETRIEVAL, ICMR-MVR 2024, 2024, :20-26

[23] Visual Recognition of Ancient Inscriptions Using Convolutional Neural Network and Fisher Vector [J].

Amato, Giuseppe ;

Falchi, Fabrizio ;

Vadicamo, Lucia .

ACM JOURNAL ON COMPUTING AND CULTURAL HERITAGE, 2016, 9 (04)

[24] Defect classification on semiconductor wafers using Fisher vector and visual vocabularies coding [J].

Gomez-Sirvent, Jose L. ;

Lopez de la Rosa, Francisco ;

Sanchez-Reolid, Roberto ;

Morales, Rafael ;

Fernandez-Caballero, Antonio .

MEASUREMENT, 2022, 202

[25] Context Awareness-Based Mobile Visual Search Model for Narrative Mural Scenes [J].

Sun, Shouqiang ;

Li, Qingqing ;

Xiao, Shuyue ;

Zeng, Ziming .

Data Analysis and Knowledge Discovery, 2024, 8 (8-9) :52-62

[26] PKUBENCH: A CONTEXT RICH MOBILE VISUAL SEARCH BENCHMARK [J].

Ji, Rongrong ;

Duan, Ling-Yu ;

Chen, Jie ;

Yang, Shuang ;

Huang, Tiejun ;

Yao, Hongxun ;

Gao, Wen .

2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,

[27] TapTell: Interactive visual search for mobile task recommendation [J].

Zhang, Ning ;

Mei, Tao ;

Hua, Xian-Sheng ;

Guan, Ling ;

Li, Shipeng .

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2015, 29 :114-124

[28] Mobile Visual Search Using Image and Text Features [J].

Tsai, Sam S. ;

Chen, Huizhong ;

Chen, David ;

Vedantham, Ramakrishna ;

Grzeszczuk, Radek ;

Girod, Bernd .

2011 CONFERENCE RECORD OF THE FORTY-FIFTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS (ASILOMAR), 2011, :845-849

[29] COMPARISON OF LOCAL FEATURE DESCRIPTORS FOR MOBILE VISUAL SEARCH [J].

Chandrasekhar, Vijay ;

Chen, David M. ;

Lin, Andy ;

Takacs, Gabriel ;

Tsai, Sam S. ;

Cheung, Ngai-Man ;

Reznik, Yuriy ;

Grzeszczuk, Radek ;

Girod, Bernd .

2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, :3885-3888

[30] Mobile Visual Search from Dynamic Image Databases [J].

Chen, Xi ;

Koskela, Markus .

IMAGE ANALYSIS: 17TH SCANDINAVIAN CONFERENCE, SCIA 2011, 2011, 6688 :196-205

← 1 2 3 4 5 →