A Joint Compression Scheme of Video Feature Descriptors and Visual Content

被引:36
作者
Zhang, Xiang [1 ]
Ma, Siwei [1 ]
Wang, Shiqi [2 ]
Zhang, Xinfeng [2 ]
Sun, Huifang [3 ]
Gao, Wen [1 ]
机构
[1] Peking Univ, Inst Digital Media, Sch Elect Engn & Comp Sci, Beijing 100871, Peoples R China
[2] Nanyang Technol Univ, Rapid Rich Object Search Lab, Singapore 639798, Singapore
[3] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA
基金
中国国家自然科学基金;
关键词
Video feature descriptor; visual retrieval; video compression; LOSSY COMPRESSION; REPRESENTATION; FRAMEWORK; SEARCH;
D O I
10.1109/TIP.2016.2629447
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-efficiency compression of visual feature descriptors has recently emerged as an active topic due to the rapidly increasing demand in mobile visual retrieval over bandwidth-limited networks. However, transmitting only those feature descriptors may largely restrict its application scale due to the lack of necessary visual content. To facilitate the wide spread of feature descriptors, a hybrid framework of jointly compressing the feature descriptors and visual content is highly desirable. In this paper, such a content-plus-feature coding scheme is investigated, aiming to shape the next generation of video compression system toward visual retrieval, where the high-efficiency coding of both feature descriptors and visual content can be achieved by exploiting the interactions between each other. On the one hand, visual feature descriptors can achieve compact and efficient representation by taking advantages of the structure and motion information in the compressed video stream. To optimize the retrieval performance, a novel rate-accuracy optimization technique is proposed to accurately estimate the retrieval performance degradation in feature coding. On the other hand, the already compressed feature data can be utilized to further improve the video coding efficiency by applying feature matching-based affine motion compensation. Extensive simulations have shown that the proposed joint compression framework can offer significant bitrate reduction in representing both feature descriptors and video frames, while simultaneously maintaining the state-of-the-art visual retrieval performance.
引用
收藏
页码:633 / 647
页数:15
相关论文
共 51 条
  • [1] Baroffio L, 2015, IEEE IMAGE PROC, P2530, DOI 10.1109/ICIP.2015.7351258
  • [2] Baroffio L, 2014, IEEE IMAGE PROC, P2794, DOI 10.1109/ICIP.2014.7025565
  • [3] Coding Visual Features Extracted From Video Sequences
    Baroffio, Luca
    Cesana, Matteo
    Redondi, Alessandro
    Tagliasacchi, Marco
    Tubaro, Stefano
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (05) : 2262 - 2276
  • [4] SURF: Speeded up robust features
    Bay, Herbert
    Tuytelaars, Tinne
    Van Gool, Luc
    [J]. COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 : 404 - 417
  • [5] Bjontegaard G., 2001, VCEGM33 ITUT SG16Q6V
  • [6] Chandrasekhar Vijay, 2009, Proceedings of the SPIE - The International Society for Optical Engineering, V7257, DOI 10.1117/12.805982
  • [7] A Novel Rate Control Framework for SIFT/SURF Feature Preservation in H.264/AVC Video Compression
    Chao, Jianshu
    Huitl, Robert
    Steinbach, Eckehard
    Schroeder, Damien
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (06) : 958 - 972
  • [8] Keypoint Encoding for Improved Feature Extraction From Compressed Video at Low Bitrates
    Chao, Jianshu
    Steinbach, Eckehard
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (01) : 25 - 39
  • [9] Chao JS, 2011, IEEE IMAGE PROC, P301, DOI 10.1109/ICIP.2011.6116299
  • [10] Interframe Coding of Global Image Signatures for Mobile Augmented Reality
    Chen, David M.
    Makar, Mina
    Araujo, Andre F.
    Girod, Bernd
    [J]. 2014 DATA COMPRESSION CONFERENCE (DCC 2014), 2014, : 33 - 42