Subgraphs Matching-Based Side Information Generation for Distributed Multiview Video Coding

被引:9
作者
Xiong, Hongkai [1 ,2 ]
Lv, Hui [1 ]
Zhang, Yongsheng [1 ]
Song, Li [1 ]
He, Zhihai [3 ]
Chen, Tsuhan [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15213 USA
[3] Univ Missouri, Dept Elect & Comp Engn, Columbia, MO 65211 USA
来源
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING | 2009年
基金
国家高技术研究发展计划(863计划);
关键词
Image coding - Video signal processing - Electric distortion - Image segmentation - Mathematical transformations - Graphic methods - Signal distortion - Feature extraction;
D O I
10.1155/2009/386795
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We adopt constrained relaxation for distributed multiview video coding (DMVC). The novel framework integrates the graph-based segmentation and matching to generate interview correlated side information without knowing the camera parameters, inspired by subgraph semantics and sparse decomposition of high-dimensional scale invariant feature data. The sparse data as a good hypothesis space aim for a best matching optimization of interview side information with compact syndromes, from inferred relaxed coset. The plausible filling-in from a priori feature constraints between neighboring views could reinforce a promising compensation to interview side-information generation for joint multiview decoding. The graph-based representations of multiview images are adopted as constrained relaxation, which assists the interview correlation matching for subgraph semantics of the original Wyner-Ziv image by the graph-based image segmentation and the associated scale invariant feature detector MSER (maximally stable extremal regions) and descriptor SIFT (scale-invariant feature transform). In order to find a distinctive feature matching with a more stable approximation, linear (PCA-SIFT) and nonlinear projections (Locally linear embedding) are adopted to reduce the dimension SIFT descriptors, and TPS (thin plate spline) warping model is to catch a more accurate interview motion model. The experimental results validate the high-estimation precision and the rate-distortion improvements. Copyright (c) 2009 Hongkai Xiong et al.
引用
收藏
页数:17
相关论文
共 34 条
  • [1] Transform-domain Wyner-Ziv codec for video
    Aaron, A
    Rane, S
    Setton, E
    Girod, B
    [J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2004, PTS 1 AND 2, 2004, 5308 : 520 - 528
  • [2] ARTIGAS X, 2007, P PICT COD S PCS 07
  • [3] Artigas X, 2006, 2006 7TH NORDIC SIGNAL PROCESSING SYMPOSIUM, P250
  • [4] Ascenso J., 2005, 5 EURASIP C SPEECH I
  • [5] Image inpainting
    Bertalmio, M
    Sapiro, G
    Caselles, V
    Ballester, C
    [J]. SIGGRAPH 2000 CONFERENCE PROCEEDINGS, 2000, : 417 - 424
  • [6] Evaluating a feedback channel based transform domain Wyner-Ziv video codec
    Brites, Catarina
    Ascenso, Joao
    Pedro, Jose Quintas
    Pereira, Fernando
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2008, 23 (04) : 269 - 297
  • [7] DROESE M, 2006, P INT C IM PROC ICIP, P2977
  • [8] Efficient graph-based image segmentation
    Felzenszwalb, PF
    Huttenlocher, DP
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 59 (02) : 167 - 181
  • [9] RANDOM SAMPLE CONSENSUS - A PARADIGM FOR MODEL-FITTING WITH APPLICATIONS TO IMAGE-ANALYSIS AND AUTOMATED CARTOGRAPHY
    FISCHLER, MA
    BOLLES, RC
    [J]. COMMUNICATIONS OF THE ACM, 1981, 24 (06) : 381 - 395
  • [10] Distributed video coding
    Girod, B
    Margot, A
    Rane, S
    Rebollo-Monedero, D
    [J]. PROCEEDINGS OF THE IEEE, 2005, 93 (01) : 71 - 83