Blind Configuration of Multi-View Video Coder Prediction Structure

被引:2
作者
Hussein, Hany S. [1 ]
El-Khamy, Mostafa [2 ,3 ]
El-Sharkawy, Mohamed [4 ]
机构
[1] Egypt Japan Univ Sci & Technol E JUST, Elect & Commun Engn ECE Dept, New Borg El Arab City 21934, Alexandria, Egypt
[2] Univ Alexandria, Dept Elect Engn, Alexandria 21544, Egypt
[3] Egypt Japan Univ Sci & Technol E JUST, Alexandria, Egypt
[4] Purdue Sch Engn & Technol, IUPUI, Indianapolis, IN USA
关键词
3D video; blind source separation (BSS); joint multi-view video model (JMVM); multi-view video coding (MVC); GOP prediction structure; 3DTV;
D O I
10.1109/TCE.2013.6490259
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Efficient coding of 3D multi-view video depends on the group of pictures (GOP) prediction structure and the video stream encoding order. Optimizing the GOP prediction structure and the stream coding order will reduce the coding bit rate, improve the peak signal to noise ratio (PSNR) and reduce the coding complexity. To date, conventional coders are manually configured based on prior knowledge of the geometric arrangement of the video cameras and the properties of the video streams. In this paper, a blind self-configurable multi-view video coder (BC-MVC) algorithm is introduced. The proposed BC-MVC blindly estimates a GOP prediction structure without prior knowledge of the cameras' geometric arrangement. The BC-MVC decomposes the key video frames into independent bases and a projection (mixing) matrix using blind source separation. Based on the mixing matrix, an algorithm is developed to estimate the cameras' geometric arrangement and consequently an optimum GOP prediction structure. The experimental results show that the proposed blind multi-view video coder has better coding efficiency than conventional 3D multi-view video coders with predefined coding structures. It also shows that BC-MVC is robust to camera failures and severe channel errors. Moreover, the numerical complexity analysis shows that the proposed BC-MVC algorithm has lower computational complexity than existing multi-view video prediction schemes.(1)
引用
收藏
页码:191 / 199
页数:9
相关论文
共 17 条
  • [1] Jacobi angles for simultaneous diagonalization
    Cardoso, JF
    Souloumiac, A
    [J]. SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 1996, 17 (01) : 161 - 164
  • [2] Choi S., 2005, NEURAL INFORM PROCES, V6, P1
  • [3] El-Shafai W., 2011, 18 IEEE INT C IM PRO, P2233
  • [4] Feng Lu, 2010, 2010 International Conference on Audio, Language and Image Processing (ICALIP), P1227, DOI 10.1109/ICALIP.2010.5685139
  • [5] Hyvarinen A., 2001, INDEPENDENT COMPONEN, V26
  • [6] Fast disparity and motion estimation for mufti-view video coding
    Kim, Yongtae
    Kim, Jiyoung
    Sohn, Kwanghoon
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2007, 53 (02) : 712 - 719
  • [7] Multiview imaging and 3DTV - Special issue overview and introduction
    Kubota, Akira
    Smolic, Aljoscha
    Magnor, Marcus
    Tanirnoto, Masayuki
    Chen, Tsuhan
    Zhang, Cha
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2007, 24 (06) : 10 - 21
  • [8] Efficient prediction structures for multiview video coding
    Merkle, Philipp
    Smolic, Aljoscha
    Mueller, Karsten
    Wiegand, Thomas
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2007, 17 (11) : 1461 - 1473
  • [9] System architecture for free-viewpoint video and 3D-TV
    Morvan, Yannick
    Farin, Dirk
    de With, Peter H. N.
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (02) : 925 - 932
  • [10] Universal View Synthesis Unit for Glassless 3DTV
    Park, Jungsik
    Choi, Ji-Youn
    Ryu, In
    Park, Jong-Il
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (02) : 706 - 711