A Joint Compression Scheme of Video Feature Descriptors and Visual Content

被引:36
作者
Zhang, Xiang [1 ]
Ma, Siwei [1 ]
Wang, Shiqi [2 ]
Zhang, Xinfeng [2 ]
Sun, Huifang [3 ]
Gao, Wen [1 ]
机构
[1] Peking Univ, Inst Digital Media, Sch Elect Engn & Comp Sci, Beijing 100871, Peoples R China
[2] Nanyang Technol Univ, Rapid Rich Object Search Lab, Singapore 639798, Singapore
[3] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA
基金
中国国家自然科学基金;
关键词
Video feature descriptor; visual retrieval; video compression; LOSSY COMPRESSION; REPRESENTATION; FRAMEWORK; SEARCH;
D O I
10.1109/TIP.2016.2629447
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-efficiency compression of visual feature descriptors has recently emerged as an active topic due to the rapidly increasing demand in mobile visual retrieval over bandwidth-limited networks. However, transmitting only those feature descriptors may largely restrict its application scale due to the lack of necessary visual content. To facilitate the wide spread of feature descriptors, a hybrid framework of jointly compressing the feature descriptors and visual content is highly desirable. In this paper, such a content-plus-feature coding scheme is investigated, aiming to shape the next generation of video compression system toward visual retrieval, where the high-efficiency coding of both feature descriptors and visual content can be achieved by exploiting the interactions between each other. On the one hand, visual feature descriptors can achieve compact and efficient representation by taking advantages of the structure and motion information in the compressed video stream. To optimize the retrieval performance, a novel rate-accuracy optimization technique is proposed to accurately estimate the retrieval performance degradation in feature coding. On the other hand, the already compressed feature data can be utilized to further improve the video coding efficiency by applying feature matching-based affine motion compensation. Extensive simulations have shown that the proposed joint compression framework can offer significant bitrate reduction in representing both feature descriptors and video frames, while simultaneously maintaining the state-of-the-art visual retrieval performance.
引用
收藏
页码:633 / 647
页数:15
相关论文
共 51 条
  • [11] Chen J, 2011, INT CONF ACOUST SPEE, P1029
  • [12] Overview of the MPEG CDVS standard
    Duan, Ling-Yu
    Huang, Tiejun
    Gao, Wen
    [J]. 2015 DATA COMPRESSION CONFERENCE (DCC), 2015, : 323 - 332
  • [13] Overview of the MPEG-CDVS Standard
    Duan, Ling-Yu
    Chandrasekhar, Vijay
    Chen, Jie
    Lin, Jie
    Wang, Zhe
    Huang, Tiejun
    Girod, Bernd
    Gao, Wen
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (01) : 179 - 194
  • [14] RANDOM SAMPLE CONSENSUS - A PARADIGM FOR MODEL-FITTING WITH APPLICATIONS TO IMAGE-ANALYSIS AND AUTOMATED CARTOGRAPHY
    FISCHLER, MA
    BOLLES, RC
    [J]. COMMUNICATIONS OF THE ACM, 1981, 24 (06) : 381 - 395
  • [15] Selection of local features for visual search
    Francini, Gianluca
    Lepsoy, Skjalg
    Balestri, Massimo
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (04) : 311 - 322
  • [16] Mobile Visual Search: Architectures, Technologies, and the Emerging MPEG Standard
    Girod, Bernd
    Chandrasekhar, Vijay
    Grzeszczuk, Radek
    Reznik, Yuriy A.
    [J]. IEEE MULTIMEDIA, 2011, 18 (03) : 86 - 94
  • [17] GreenEyes, ROM LANDM DAT
  • [18] Control-Point Representation and Differential Coding Affine-Motion Compensation
    Huang, Han
    Woods, John W.
    Zhao, Yao
    Bai, Huihui
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (10) : 1651 - 1660
  • [19] Triangulation embedding and democratic aggregation for image search
    Jegou, Herve
    Zisserman, Andrew
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3310 - 3317
  • [20] Jégou H, 2010, PROC CVPR IEEE, P3304, DOI 10.1109/CVPR.2010.5540039