Subgraphs Matching-Based Side Information Generation for Distributed Multiview Video Coding

被引：9

作者：

Xiong, Hongkai ^{[1
,2
]}

Lv, Hui ^{[1
]}

Zhang, Yongsheng ^{[1
]}

Song, Li ^{[1
]}

He, Zhihai ^{[3
]}

Chen, Tsuhan ^{[2
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China

[2] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15213 USA

[3] Univ Missouri, Dept Elect & Comp Engn, Columbia, MO 65211 USA

来源：

EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING | 2009年

基金：

国家高技术研究发展计划(863计划);

关键词：

Image coding - Video signal processing - Electric distortion - Image segmentation - Mathematical transformations - Graphic methods - Signal distortion - Feature extraction;

D O I：

10.1155/2009/386795

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We adopt constrained relaxation for distributed multiview video coding (DMVC). The novel framework integrates the graph-based segmentation and matching to generate interview correlated side information without knowing the camera parameters, inspired by subgraph semantics and sparse decomposition of high-dimensional scale invariant feature data. The sparse data as a good hypothesis space aim for a best matching optimization of interview side information with compact syndromes, from inferred relaxed coset. The plausible filling-in from a priori feature constraints between neighboring views could reinforce a promising compensation to interview side-information generation for joint multiview decoding. The graph-based representations of multiview images are adopted as constrained relaxation, which assists the interview correlation matching for subgraph semantics of the original Wyner-Ziv image by the graph-based image segmentation and the associated scale invariant feature detector MSER (maximally stable extremal regions) and descriptor SIFT (scale-invariant feature transform). In order to find a distinctive feature matching with a more stable approximation, linear (PCA-SIFT) and nonlinear projections (Locally linear embedding) are adopted to reduce the dimension SIFT descriptors, and TPS (thin plate spline) warping model is to catch a more accurate interview motion model. The experimental results validate the high-estimation precision and the rate-distortion improvements. Copyright (c) 2009 Hongkai Xiong et al.

引用

页数：17

共 34 条

[1] Transform-domain Wyner-Ziv codec for video
Aaron, A
Rane, S
Setton, E
Girod, B
[J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2004, PTS 1 AND 2, 2004, 5308 : 520 - 528
[2] ARTIGAS X, 2007, P PICT COD S PCS 07
[3] Artigas X, 2006, 2006 7TH NORDIC SIGNAL PROCESSING SYMPOSIUM, P250
[4] Ascenso J., 2005, 5 EURASIP C SPEECH I
[5] Image inpainting
Bertalmio, M
Sapiro, G
Caselles, V
Ballester, C
[J]. SIGGRAPH 2000 CONFERENCE PROCEEDINGS, 2000, : 417 - 424
[6] Evaluating a feedback channel based transform domain Wyner-Ziv video codec
Brites, Catarina
Ascenso, Joao
Pedro, Jose Quintas
Pereira, Fernando
[J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2008, 23 (04) : 269 - 297
[7] DROESE M, 2006, P INT C IM PROC ICIP, P2977
[8] Efficient graph-based image segmentation
Felzenszwalb, PF
Huttenlocher, DP
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 59 (02) : 167 - 181
[9] RANDOM SAMPLE CONSENSUS - A PARADIGM FOR MODEL-FITTING WITH APPLICATIONS TO IMAGE-ANALYSIS AND AUTOMATED CARTOGRAPHY
FISCHLER, MA
BOLLES, RC
[J]. COMMUNICATIONS OF THE ACM, 1981, 24 (06) : 381 - 395
[10] Distributed video coding
Girod, B
Margot, A
Rane, S
Rebollo-Monedero, D
[J]. PROCEEDINGS OF THE IEEE, 2005, 93 (01) : 71 - 83

← 1 2 3 4 →