A Joint Compression Scheme of Video Feature Descriptors and Visual Content

被引：36

作者：

Zhang, Xiang ^{[1
]}

Ma, Siwei ^{[1
]}

Wang, Shiqi ^{[2
]}

Zhang, Xinfeng ^{[2
]}

Sun, Huifang ^{[3
]}

Gao, Wen ^{[1
]}

机构：

[1] Peking Univ, Inst Digital Media, Sch Elect Engn & Comp Sci, Beijing 100871, Peoples R China

[2] Nanyang Technol Univ, Rapid Rich Object Search Lab, Singapore 639798, Singapore

[3] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2017年 / 26卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Video feature descriptor; visual retrieval; video compression; LOSSY COMPRESSION; REPRESENTATION; FRAMEWORK; SEARCH;

D O I：

10.1109/TIP.2016.2629447

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

High-efficiency compression of visual feature descriptors has recently emerged as an active topic due to the rapidly increasing demand in mobile visual retrieval over bandwidth-limited networks. However, transmitting only those feature descriptors may largely restrict its application scale due to the lack of necessary visual content. To facilitate the wide spread of feature descriptors, a hybrid framework of jointly compressing the feature descriptors and visual content is highly desirable. In this paper, such a content-plus-feature coding scheme is investigated, aiming to shape the next generation of video compression system toward visual retrieval, where the high-efficiency coding of both feature descriptors and visual content can be achieved by exploiting the interactions between each other. On the one hand, visual feature descriptors can achieve compact and efficient representation by taking advantages of the structure and motion information in the compressed video stream. To optimize the retrieval performance, a novel rate-accuracy optimization technique is proposed to accurately estimate the retrieval performance degradation in feature coding. On the other hand, the already compressed feature data can be utilized to further improve the video coding efficiency by applying feature matching-based affine motion compensation. Extensive simulations have shown that the proposed joint compression framework can offer significant bitrate reduction in representing both feature descriptors and video frames, while simultaneously maintaining the state-of-the-art visual retrieval performance.

引用

页码：633 / 647

页数：15

共 51 条

[1] Baroffio L, 2015, IEEE IMAGE PROC, P2530, DOI 10.1109/ICIP.2015.7351258
[2] Baroffio L, 2014, IEEE IMAGE PROC, P2794, DOI 10.1109/ICIP.2014.7025565
[3] Coding Visual Features Extracted From Video Sequences
Baroffio, Luca
Cesana, Matteo
Redondi, Alessandro
Tagliasacchi, Marco
Tubaro, Stefano
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (05) : 2262 - 2276
[4] SURF: Speeded up robust features
Bay, Herbert
Tuytelaars, Tinne
Van Gool, Luc
[J]. COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 : 404 - 417
[5] Bjontegaard G., 2001, VCEGM33 ITUT SG16Q6V
[6] Chandrasekhar Vijay, 2009, Proceedings of the SPIE - The International Society for Optical Engineering, V7257, DOI 10.1117/12.805982
[7] A Novel Rate Control Framework for SIFT/SURF Feature Preservation in H.264/AVC Video Compression
Chao, Jianshu
Huitl, Robert
Steinbach, Eckehard
Schroeder, Damien
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (06) : 958 - 972
[8] Keypoint Encoding for Improved Feature Extraction From Compressed Video at Low Bitrates
Chao, Jianshu
Steinbach, Eckehard
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (01) : 25 - 39
[9] Chao JS, 2011, IEEE IMAGE PROC, P301, DOI 10.1109/ICIP.2011.6116299
[10] Interframe Coding of Global Image Signatures for Mobile Augmented Reality
Chen, David M.
Makar, Mina
Araujo, Andre F.
Girod, Bernd
[J]. 2014 DATA COMPRESSION CONFERENCE (DCC 2014), 2014, : 33 - 42

← 1 2 3 4 5 6 →