Video coding using 3D dual-tree wavelet transform

Beibei Wang, Yao Wang, Ivan Selesnick, Anthony Vetro

Research output: Contribution to journalArticle

Abstract

This work investigates the use of the 3D dual-tree discrete wavelet transform (DDWT) for video coding. The 3D DDWT is an attractive video representation because it isolates image patterns with different spatial orientations and motion directions and speeds in separate subbands. However, it is an overcomplete transform with 4 : 1 redundancy when only real parts are used. We apply the noise-shaping algorithm proposed by Kingsbury to reduce the number of coefficients. To code the remaining significant coefficients, we propose two video codecs. The first one applies separate 3D set partitioning in hierarchical trees (SPIHT) on each subset of the DDWT coefficients (each forming a standard isotropic tree). The second codec exploits the correlation between redundant subbands, and codes the subbands jointly. Both codecs do not require motion compensation and provide better performance than the 3D SPIHT codec using the standard DWT, both objectively and subjectively. Furthermore, both codecs provide full scalability in spatial, temporal, and quality dimensions. Besides the standard isotropic decomposition, we propose an anisotropic DDWT, which extends the superiority of the normal DDWT with more directional subbands without adding to the redundancy. This anisotropic structure requires significantly fewer coefficients to represent a video after noise shaping.Finally, we also explore the benefits of combining the 3D DDWT with the standard DWT to capture a wider set of orientations.

Original languageEnglish (US)
Article number42761
JournalEurasip Journal on Image and Video Processing
Volume2007
DOIs
StatePublished - 2007

Fingerprint

Discrete wavelet transforms
Image coding
Wavelet transforms
Redundancy
Motion compensation
Scalability
Decomposition

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Signal Processing
  • Information Systems

Cite this

Video coding using 3D dual-tree wavelet transform. / Wang, Beibei; Wang, Yao; Selesnick, Ivan; Vetro, Anthony.

In: Eurasip Journal on Image and Video Processing, Vol. 2007, 42761, 2007.

Research output: Contribution to journalArticle

@article{5e83c1e0a8084a329d35d406817f5ba2,
title = "Video coding using 3D dual-tree wavelet transform",
abstract = "This work investigates the use of the 3D dual-tree discrete wavelet transform (DDWT) for video coding. The 3D DDWT is an attractive video representation because it isolates image patterns with different spatial orientations and motion directions and speeds in separate subbands. However, it is an overcomplete transform with 4 : 1 redundancy when only real parts are used. We apply the noise-shaping algorithm proposed by Kingsbury to reduce the number of coefficients. To code the remaining significant coefficients, we propose two video codecs. The first one applies separate 3D set partitioning in hierarchical trees (SPIHT) on each subset of the DDWT coefficients (each forming a standard isotropic tree). The second codec exploits the correlation between redundant subbands, and codes the subbands jointly. Both codecs do not require motion compensation and provide better performance than the 3D SPIHT codec using the standard DWT, both objectively and subjectively. Furthermore, both codecs provide full scalability in spatial, temporal, and quality dimensions. Besides the standard isotropic decomposition, we propose an anisotropic DDWT, which extends the superiority of the normal DDWT with more directional subbands without adding to the redundancy. This anisotropic structure requires significantly fewer coefficients to represent a video after noise shaping.Finally, we also explore the benefits of combining the 3D DDWT with the standard DWT to capture a wider set of orientations.",
author = "Beibei Wang and Yao Wang and Ivan Selesnick and Anthony Vetro",
year = "2007",
doi = "10.1155/2007/42761",
language = "English (US)",
volume = "2007",
journal = "Eurasip Journal on Image and Video Processing",
issn = "1687-5176",
publisher = "Springer Publishing Company",

}

TY - JOUR

T1 - Video coding using 3D dual-tree wavelet transform

AU - Wang, Beibei

AU - Wang, Yao

AU - Selesnick, Ivan

AU - Vetro, Anthony

PY - 2007

Y1 - 2007

N2 - This work investigates the use of the 3D dual-tree discrete wavelet transform (DDWT) for video coding. The 3D DDWT is an attractive video representation because it isolates image patterns with different spatial orientations and motion directions and speeds in separate subbands. However, it is an overcomplete transform with 4 : 1 redundancy when only real parts are used. We apply the noise-shaping algorithm proposed by Kingsbury to reduce the number of coefficients. To code the remaining significant coefficients, we propose two video codecs. The first one applies separate 3D set partitioning in hierarchical trees (SPIHT) on each subset of the DDWT coefficients (each forming a standard isotropic tree). The second codec exploits the correlation between redundant subbands, and codes the subbands jointly. Both codecs do not require motion compensation and provide better performance than the 3D SPIHT codec using the standard DWT, both objectively and subjectively. Furthermore, both codecs provide full scalability in spatial, temporal, and quality dimensions. Besides the standard isotropic decomposition, we propose an anisotropic DDWT, which extends the superiority of the normal DDWT with more directional subbands without adding to the redundancy. This anisotropic structure requires significantly fewer coefficients to represent a video after noise shaping.Finally, we also explore the benefits of combining the 3D DDWT with the standard DWT to capture a wider set of orientations.

AB - This work investigates the use of the 3D dual-tree discrete wavelet transform (DDWT) for video coding. The 3D DDWT is an attractive video representation because it isolates image patterns with different spatial orientations and motion directions and speeds in separate subbands. However, it is an overcomplete transform with 4 : 1 redundancy when only real parts are used. We apply the noise-shaping algorithm proposed by Kingsbury to reduce the number of coefficients. To code the remaining significant coefficients, we propose two video codecs. The first one applies separate 3D set partitioning in hierarchical trees (SPIHT) on each subset of the DDWT coefficients (each forming a standard isotropic tree). The second codec exploits the correlation between redundant subbands, and codes the subbands jointly. Both codecs do not require motion compensation and provide better performance than the 3D SPIHT codec using the standard DWT, both objectively and subjectively. Furthermore, both codecs provide full scalability in spatial, temporal, and quality dimensions. Besides the standard isotropic decomposition, we propose an anisotropic DDWT, which extends the superiority of the normal DDWT with more directional subbands without adding to the redundancy. This anisotropic structure requires significantly fewer coefficients to represent a video after noise shaping.Finally, we also explore the benefits of combining the 3D DDWT with the standard DWT to capture a wider set of orientations.

UR - http://www.scopus.com/inward/record.url?scp=34247220373&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34247220373&partnerID=8YFLogxK

U2 - 10.1155/2007/42761

DO - 10.1155/2007/42761

M3 - Article

AN - SCOPUS:34247220373

VL - 2007

JO - Eurasip Journal on Image and Video Processing

JF - Eurasip Journal on Image and Video Processing

SN - 1687-5176

M1 - 42761

ER -