Video coding using 3-D dual-tree discrete wavelet transforms

Beibei Wang, Yao Wang, Ivan Selesnick, Anthony Vetro

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper explores the use of a recently introduced 3-D dual-tree discrete wavelet transform (DDWT) for video coding. The 3-D DDWT is an attractive video representation because it isolates motion along different directions in separate subbands. However, it is an overcomplete transform with 8:1 or 4:1 redundancy. Based on the effectiveness of the iterative projection-based noise shaping scheme proposed by Kingsbury on reducing the number of coefficients, and our prior investigation about the correlation between subbands at the same spatial/temporal location, both in the significance map and in actual coefficient values, a new video coding scheme using 3D DDWT is proposed. The proposed video codec does not require motion compensation and provides better performance than the 3D SPIHT codec, both objectively and subjectively, despite the fact that the raw number of coefficients resulting from the 3-D DDWT is much more than that of the conventional 3-D DWT. The proposed coder allows full scalability in spatial, temporal and quality dimensions.

Original languageEnglish (US)
Title of host publication2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing
PagesII61-II64
DOIs
StatePublished - Dec 1 2005
Event2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Philadelphia, PA, United States
Duration: Mar 18 2005Mar 23 2005

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
VolumeII
ISSN (Print)1520-6149

Other

Other2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05
CountryUnited States
CityPhiladelphia, PA
Period3/18/053/23/05

    Fingerprint

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Cite this

Wang, B., Wang, Y., Selesnick, I., & Vetro, A. (2005). Video coding using 3-D dual-tree discrete wavelet transforms. In 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing (pp. II61-II64). [1415341] (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; Vol. II). https://doi.org/10.1109/ICASSP.2005.1415341