A Novel Video Coding Framework using Self-adaptive Dictionary

Yuanyi Xue, Yao Wang

Research output: Contribution to journalArticle

Abstract

In this paper, we propose to use a self-adaptive redundant dictionary, consisting of all possible inter and intra prediction candidates, to directly represent the frame blocks in a video sequence. The self-adaptive dictionary generalizes the conventional predictive coding approach by allowing adaptive linear combinations of prediction candidates, which is solved by an rate-distortion aware L0-norm minimization problem using orthogonal least squares (OLS). To overcome the inefficiency in quantizing and coding coefficients corresponding to correlated chosen atoms, we orthonormalize the chosen atoms recursively as part of OLS process. We further propose a two-stage video coding framework, in which a second stage codes the residual from the chosen atoms using a modified DCT dictionary that is adaptively orthonormalized with respect to the subspace spanned by the first stage atoms. To determine the transition from the first stage to the second stage, we propose a rate-distortion (RD) aware adaptive switching algorithm. The proposed framework is further extended to accommodate variable block sizes (16 × 16, 8 × 8, and 4×4), and the partition mode is derived by a fast partition mode decision algorithm. A context-adaptive binary arithmetic entropy coder is designed to code the symbols of the proposed coding framework. The proposed coder shows competitive, and in some cases better rate-distortion performance, compared to the HEVC video coding standard for P-frames.

Original languageEnglish (US)
JournalIEEE Transactions on Circuits and Systems for Video Technology
DOIs
StateAccepted/In press - Oct 4 2017

Fingerprint

Glossaries
Image coding
Atoms
Entropy

Keywords

  • L0-norm minimization
  • Rate-distortion optimization
  • Sparse coding
  • Video coding

ASJC Scopus subject areas

  • Media Technology
  • Electrical and Electronic Engineering

Cite this

@article{8a60f5c75634450d92f82587fa4e1d69,
title = "A Novel Video Coding Framework using Self-adaptive Dictionary",
abstract = "In this paper, we propose to use a self-adaptive redundant dictionary, consisting of all possible inter and intra prediction candidates, to directly represent the frame blocks in a video sequence. The self-adaptive dictionary generalizes the conventional predictive coding approach by allowing adaptive linear combinations of prediction candidates, which is solved by an rate-distortion aware L0-norm minimization problem using orthogonal least squares (OLS). To overcome the inefficiency in quantizing and coding coefficients corresponding to correlated chosen atoms, we orthonormalize the chosen atoms recursively as part of OLS process. We further propose a two-stage video coding framework, in which a second stage codes the residual from the chosen atoms using a modified DCT dictionary that is adaptively orthonormalized with respect to the subspace spanned by the first stage atoms. To determine the transition from the first stage to the second stage, we propose a rate-distortion (RD) aware adaptive switching algorithm. The proposed framework is further extended to accommodate variable block sizes (16 × 16, 8 × 8, and 4×4), and the partition mode is derived by a fast partition mode decision algorithm. A context-adaptive binary arithmetic entropy coder is designed to code the symbols of the proposed coding framework. The proposed coder shows competitive, and in some cases better rate-distortion performance, compared to the HEVC video coding standard for P-frames.",
keywords = "L0-norm minimization, Rate-distortion optimization, Sparse coding, Video coding",
author = "Yuanyi Xue and Yao Wang",
year = "2017",
month = "10",
day = "4",
doi = "10.1109/TCSVT.2017.2760143",
language = "English (US)",
journal = "IEEE Transactions on Circuits and Systems for Video Technology",
issn = "1051-8215",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - A Novel Video Coding Framework using Self-adaptive Dictionary

AU - Xue, Yuanyi

AU - Wang, Yao

PY - 2017/10/4

Y1 - 2017/10/4

N2 - In this paper, we propose to use a self-adaptive redundant dictionary, consisting of all possible inter and intra prediction candidates, to directly represent the frame blocks in a video sequence. The self-adaptive dictionary generalizes the conventional predictive coding approach by allowing adaptive linear combinations of prediction candidates, which is solved by an rate-distortion aware L0-norm minimization problem using orthogonal least squares (OLS). To overcome the inefficiency in quantizing and coding coefficients corresponding to correlated chosen atoms, we orthonormalize the chosen atoms recursively as part of OLS process. We further propose a two-stage video coding framework, in which a second stage codes the residual from the chosen atoms using a modified DCT dictionary that is adaptively orthonormalized with respect to the subspace spanned by the first stage atoms. To determine the transition from the first stage to the second stage, we propose a rate-distortion (RD) aware adaptive switching algorithm. The proposed framework is further extended to accommodate variable block sizes (16 × 16, 8 × 8, and 4×4), and the partition mode is derived by a fast partition mode decision algorithm. A context-adaptive binary arithmetic entropy coder is designed to code the symbols of the proposed coding framework. The proposed coder shows competitive, and in some cases better rate-distortion performance, compared to the HEVC video coding standard for P-frames.

AB - In this paper, we propose to use a self-adaptive redundant dictionary, consisting of all possible inter and intra prediction candidates, to directly represent the frame blocks in a video sequence. The self-adaptive dictionary generalizes the conventional predictive coding approach by allowing adaptive linear combinations of prediction candidates, which is solved by an rate-distortion aware L0-norm minimization problem using orthogonal least squares (OLS). To overcome the inefficiency in quantizing and coding coefficients corresponding to correlated chosen atoms, we orthonormalize the chosen atoms recursively as part of OLS process. We further propose a two-stage video coding framework, in which a second stage codes the residual from the chosen atoms using a modified DCT dictionary that is adaptively orthonormalized with respect to the subspace spanned by the first stage atoms. To determine the transition from the first stage to the second stage, we propose a rate-distortion (RD) aware adaptive switching algorithm. The proposed framework is further extended to accommodate variable block sizes (16 × 16, 8 × 8, and 4×4), and the partition mode is derived by a fast partition mode decision algorithm. A context-adaptive binary arithmetic entropy coder is designed to code the symbols of the proposed coding framework. The proposed coder shows competitive, and in some cases better rate-distortion performance, compared to the HEVC video coding standard for P-frames.

KW - L0-norm minimization

KW - Rate-distortion optimization

KW - Sparse coding

KW - Video coding

UR - http://www.scopus.com/inward/record.url?scp=85030775519&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85030775519&partnerID=8YFLogxK

U2 - 10.1109/TCSVT.2017.2760143

DO - 10.1109/TCSVT.2017.2760143

M3 - Article

JO - IEEE Transactions on Circuits and Systems for Video Technology

JF - IEEE Transactions on Circuits and Systems for Video Technology

SN - 1051-8215

ER -