A Joint Compression Scheme of Video Feature Descriptors and Visual Content

被引：36

作者：

Zhang, Xiang ^{[1
]}

Ma, Siwei ^{[1
]}

Wang, Shiqi ^{[2
]}

Zhang, Xinfeng ^{[2
]}

Sun, Huifang ^{[3
]}

Gao, Wen ^{[1
]}

机构：

[1] Peking Univ, Inst Digital Media, Sch Elect Engn & Comp Sci, Beijing 100871, Peoples R China

[2] Nanyang Technol Univ, Rapid Rich Object Search Lab, Singapore 639798, Singapore

[3] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2017年 / 26卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Video feature descriptor; visual retrieval; video compression; LOSSY COMPRESSION; REPRESENTATION; FRAMEWORK; SEARCH;

D O I：

10.1109/TIP.2016.2629447

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

High-efficiency compression of visual feature descriptors has recently emerged as an active topic due to the rapidly increasing demand in mobile visual retrieval over bandwidth-limited networks. However, transmitting only those feature descriptors may largely restrict its application scale due to the lack of necessary visual content. To facilitate the wide spread of feature descriptors, a hybrid framework of jointly compressing the feature descriptors and visual content is highly desirable. In this paper, such a content-plus-feature coding scheme is investigated, aiming to shape the next generation of video compression system toward visual retrieval, where the high-efficiency coding of both feature descriptors and visual content can be achieved by exploiting the interactions between each other. On the one hand, visual feature descriptors can achieve compact and efficient representation by taking advantages of the structure and motion information in the compressed video stream. To optimize the retrieval performance, a novel rate-accuracy optimization technique is proposed to accurately estimate the retrieval performance degradation in feature coding. On the other hand, the already compressed feature data can be utilized to further improve the video coding efficiency by applying feature matching-based affine motion compensation. Extensive simulations have shown that the proposed joint compression framework can offer significant bitrate reduction in representing both feature descriptors and video frames, while simultaneously maintaining the state-of-the-art visual retrieval performance.

引用

页码：633 / 647

页数：15

共 51 条

[11] Chen J, 2011, INT CONF ACOUST SPEE, P1029
[12] Overview of the MPEG CDVS standard
Duan, Ling-Yu
Huang, Tiejun
Gao, Wen
[J]. 2015 DATA COMPRESSION CONFERENCE (DCC), 2015, : 323 - 332
[13] Overview of the MPEG-CDVS Standard
Duan, Ling-Yu
Chandrasekhar, Vijay
Chen, Jie
Lin, Jie
Wang, Zhe
Huang, Tiejun
Girod, Bernd
Gao, Wen
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (01) : 179 - 194
[14] RANDOM SAMPLE CONSENSUS - A PARADIGM FOR MODEL-FITTING WITH APPLICATIONS TO IMAGE-ANALYSIS AND AUTOMATED CARTOGRAPHY
FISCHLER, MA
BOLLES, RC
[J]. COMMUNICATIONS OF THE ACM, 1981, 24 (06) : 381 - 395
[15] Selection of local features for visual search
Francini, Gianluca
Lepsoy, Skjalg
Balestri, Massimo
[J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (04) : 311 - 322
[16] Mobile Visual Search: Architectures, Technologies, and the Emerging MPEG Standard
Girod, Bernd
Chandrasekhar, Vijay
Grzeszczuk, Radek
Reznik, Yuriy A.
[J]. IEEE MULTIMEDIA, 2011, 18 (03) : 86 - 94
[17] GreenEyes, ROM LANDM DAT
[18] Control-Point Representation and Differential Coding Affine-Motion Compensation
Huang, Han
Woods, John W.
Zhao, Yao
Bai, Huihui
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (10) : 1651 - 1660
[19] Triangulation embedding and democratic aggregation for image search
Jegou, Herve
Zisserman, Andrew
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3310 - 3317
[20] Jégou H, 2010, PROC CVPR IEEE, P3304, DOI 10.1109/CVPR.2010.5540039

← 1 2 3 4 5 6 →