Motion estimation and compensation based on almost shift-invariant wavelet transform for image sequence coding

被引:0
|
作者
Al-Mohimeed, MA
Li, CC [1 ]
机构
[1] Univ Pittsburgh, Dept Elect Engn, Pittsburgh, PA 15261 USA
[2] King Saud Univ, Dept Comp Engn, Riyadh 11543, Saudi Arabia
关键词
D O I
10.1002/(SICI)1098-1098(1998)9:4<214::AID-IMA4>3.0.CO;2-D
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
There has been rapid progress in the application of wavelet transforms to image and image sequence compression. The standard discrete wavelet transform lacks translation invariance in image decomposition which will affect the accuracy of motion estimation from the decomposed subimages in video coding. In this article, we present a study of applying an almost shift-invariant wavelet transform with "oversampled frames" to image sequence compression. With minimal oversampling and biorthogonal spline wavelets in the almost shift-invariant wavelet transform, motion vectors can be more accurately estimated, contributing toward fewer prediction errors in comparison to those obtained with the standard discrete wavelet transform. Thus, an improved compression ratio can be obtained. We present two new algorithms, the full-motion oversampling algorithm (FMOS) and the reduced search multiresolution motion estimation algorithm (MRME), for estimating motion fields at different scales and in different subimages. In the latter, motion vectors at a higher resolution are approximated by the motion vector estimates at a lower resolution through proper scaling. Experiments were performed on three video sequences with a variety of motions including slow, fast, and zooming. Our results have shown that both algorithms, FMOS and MRME, using the almost shift-invariant oversampled frame wavelet transform have reduced prediction errors and enhanced the compression performance in terms of peak-signal-to-noise ratio (PSNR) for the same bit rate when compared to the existing full motion standard algorithm. (C) 1998 John Wiley & Sons, Inc. Int J Imaging Syst Technol, 9, 214-229, 1998.
引用
收藏
页码:214 / 229
页数:16
相关论文
共 50 条
  • [1] Image sequence fusion using a shift-invariant wavelet transform
    Rockinger, O
    INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL III, 1997, : 288 - 291
  • [2] Application of shift-invariant wavelet transform to video coding
    AlMohimeed, MA
    Li, CC
    VIDEO TECHNIQUES AND SOFTWARE FOR FULL-SERVICE NETWORKS, 1997, 2915 : 64 - 75
  • [3] Wavelet-based moving-picture coding using shift-invariant motion estimation in wavelet domain
    Kim, HS
    Park, HW
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2001, 16 (07) : 669 - 679
  • [4] Circular shift-invariant wavelet transform
    Mahmoud, W.A.
    Wafi, N.K.
    Al-Qudsi, W.R.
    Advances in Modelling and Analysis B, 2003, 46 (7-8): : 25 - 34
  • [5] A shift-invariant discrete wavelet transform
    SariSarraf, H
    Brzakovic, D
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1997, 45 (10) : 2621 - 2626
  • [6] Ovocyte texture analysis through almost shift-invariant decimated wavelet transform
    Nicolier, F
    Legrand, AC
    Laligant, O
    Kohler, S
    Truchetet, F
    KES'2000: FOURTH INTERNATIONAL CONFERENCE ON KNOWLEDGE-BASED INTELLIGENT ENGINEERING SYSTEMS & ALLIED TECHNOLOGIES, VOLS 1 AND 2, PROCEEDINGS, 2000, : 385 - 388
  • [7] Shift-invariant discrete wavelet transform analysis for retinal image classification
    Khademi, April
    Krishnan, Sridhar
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2007, 45 (12) : 1211 - 1222
  • [8] Shift-invariant discrete wavelet transform analysis for retinal image classification
    April Khademi
    Sridhar Krishnan
    Medical & Biological Engineering & Computing, 2007, 45 : 1211 - 1222
  • [9] Exposure Fusion Based on Shift-Invariant Discrete Wavelet Transform
    Wang, Jinhua
    Xu, De
    Lang, Congyan
    Li, Bing
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2011, 27 (01) : 197 - 211
  • [10] Wavelet-domain shift invariant motion estimation and compensation
    Cai, WT
    Adjouadi, M
    PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 49 - 52