A two-stage video coding framework with both self-adaptive redundant dictionary and adaptively orthonormalized DCT basis

Yuanyi Xue, Yi Zhou, Yao Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this work, we propose a two-stage video coding framework, as an extension of our previous one-stage framework in [1]. The two-stage frameworks consists two different dictionaries. Specifically, the first stage directly finds the sparse representation of a block with a self-adaptive dictionary consisting of all possible inter-prediction candidates by solving an L0-norm minimization problem using orthogonal least squares (OLS), and the second stage codes the residual using altered DCT dictionary orthonormalized to the subspace spanned by the first stage atoms. The transition of the first stage and the second stage is adaptively determined based on the estimated residual reduction per bit. We further propose a complete context adaptive entropy coder to efficiently code the locations and the coefficients of chosen first stage atoms. Simulation results show that the proposed coder significantly improves the RD performance over our previous one-stage coder. More importantly, the two-stage coder, using a fixed block size and inter-prediction only, outperforms the H.264 coder (x264) and is competitive with the HEVC reference coder (HM) over a large rate range.

Original languageEnglish (US)
Title of host publication2015 IEEE International Conference on Image Processing, ICIP 2015 - Proceedings
PublisherIEEE Computer Society
Pages1697-1701
Number of pages5
Volume2015-December
ISBN (Print)9781479983391
DOIs
StatePublished - Dec 9 2015
EventIEEE International Conference on Image Processing, ICIP 2015 - Quebec City, Canada
Duration: Sep 27 2015Sep 30 2015

Other

OtherIEEE International Conference on Image Processing, ICIP 2015
CountryCanada
CityQuebec City
Period9/27/159/30/15

Fingerprint

Glossaries
Image coding
Atoms
Entropy

ASJC Scopus subject areas

  • Software
  • Computer Vision and Pattern Recognition
  • Signal Processing

Cite this

Xue, Y., Zhou, Y., & Wang, Y. (2015). A two-stage video coding framework with both self-adaptive redundant dictionary and adaptively orthonormalized DCT basis. In 2015 IEEE International Conference on Image Processing, ICIP 2015 - Proceedings (Vol. 2015-December, pp. 1697-1701). [7351090] IEEE Computer Society. https://doi.org/10.1109/ICIP.2015.7351090

A two-stage video coding framework with both self-adaptive redundant dictionary and adaptively orthonormalized DCT basis. / Xue, Yuanyi; Zhou, Yi; Wang, Yao.

2015 IEEE International Conference on Image Processing, ICIP 2015 - Proceedings. Vol. 2015-December IEEE Computer Society, 2015. p. 1697-1701 7351090.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Xue, Y, Zhou, Y & Wang, Y 2015, A two-stage video coding framework with both self-adaptive redundant dictionary and adaptively orthonormalized DCT basis. in 2015 IEEE International Conference on Image Processing, ICIP 2015 - Proceedings. vol. 2015-December, 7351090, IEEE Computer Society, pp. 1697-1701, IEEE International Conference on Image Processing, ICIP 2015, Quebec City, Canada, 9/27/15. https://doi.org/10.1109/ICIP.2015.7351090
Xue Y, Zhou Y, Wang Y. A two-stage video coding framework with both self-adaptive redundant dictionary and adaptively orthonormalized DCT basis. In 2015 IEEE International Conference on Image Processing, ICIP 2015 - Proceedings. Vol. 2015-December. IEEE Computer Society. 2015. p. 1697-1701. 7351090 https://doi.org/10.1109/ICIP.2015.7351090
Xue, Yuanyi ; Zhou, Yi ; Wang, Yao. / A two-stage video coding framework with both self-adaptive redundant dictionary and adaptively orthonormalized DCT basis. 2015 IEEE International Conference on Image Processing, ICIP 2015 - Proceedings. Vol. 2015-December IEEE Computer Society, 2015. pp. 1697-1701
@inproceedings{9b8fe6c9260a4397838494d7147c2b33,
title = "A two-stage video coding framework with both self-adaptive redundant dictionary and adaptively orthonormalized DCT basis",
abstract = "In this work, we propose a two-stage video coding framework, as an extension of our previous one-stage framework in [1]. The two-stage frameworks consists two different dictionaries. Specifically, the first stage directly finds the sparse representation of a block with a self-adaptive dictionary consisting of all possible inter-prediction candidates by solving an L0-norm minimization problem using orthogonal least squares (OLS), and the second stage codes the residual using altered DCT dictionary orthonormalized to the subspace spanned by the first stage atoms. The transition of the first stage and the second stage is adaptively determined based on the estimated residual reduction per bit. We further propose a complete context adaptive entropy coder to efficiently code the locations and the coefficients of chosen first stage atoms. Simulation results show that the proposed coder significantly improves the RD performance over our previous one-stage coder. More importantly, the two-stage coder, using a fixed block size and inter-prediction only, outperforms the H.264 coder (x264) and is competitive with the HEVC reference coder (HM) over a large rate range.",
author = "Yuanyi Xue and Yi Zhou and Yao Wang",
year = "2015",
month = "12",
day = "9",
doi = "10.1109/ICIP.2015.7351090",
language = "English (US)",
isbn = "9781479983391",
volume = "2015-December",
pages = "1697--1701",
booktitle = "2015 IEEE International Conference on Image Processing, ICIP 2015 - Proceedings",
publisher = "IEEE Computer Society",

}

TY - GEN

T1 - A two-stage video coding framework with both self-adaptive redundant dictionary and adaptively orthonormalized DCT basis

AU - Xue, Yuanyi

AU - Zhou, Yi

AU - Wang, Yao

PY - 2015/12/9

Y1 - 2015/12/9

N2 - In this work, we propose a two-stage video coding framework, as an extension of our previous one-stage framework in [1]. The two-stage frameworks consists two different dictionaries. Specifically, the first stage directly finds the sparse representation of a block with a self-adaptive dictionary consisting of all possible inter-prediction candidates by solving an L0-norm minimization problem using orthogonal least squares (OLS), and the second stage codes the residual using altered DCT dictionary orthonormalized to the subspace spanned by the first stage atoms. The transition of the first stage and the second stage is adaptively determined based on the estimated residual reduction per bit. We further propose a complete context adaptive entropy coder to efficiently code the locations and the coefficients of chosen first stage atoms. Simulation results show that the proposed coder significantly improves the RD performance over our previous one-stage coder. More importantly, the two-stage coder, using a fixed block size and inter-prediction only, outperforms the H.264 coder (x264) and is competitive with the HEVC reference coder (HM) over a large rate range.

AB - In this work, we propose a two-stage video coding framework, as an extension of our previous one-stage framework in [1]. The two-stage frameworks consists two different dictionaries. Specifically, the first stage directly finds the sparse representation of a block with a self-adaptive dictionary consisting of all possible inter-prediction candidates by solving an L0-norm minimization problem using orthogonal least squares (OLS), and the second stage codes the residual using altered DCT dictionary orthonormalized to the subspace spanned by the first stage atoms. The transition of the first stage and the second stage is adaptively determined based on the estimated residual reduction per bit. We further propose a complete context adaptive entropy coder to efficiently code the locations and the coefficients of chosen first stage atoms. Simulation results show that the proposed coder significantly improves the RD performance over our previous one-stage coder. More importantly, the two-stage coder, using a fixed block size and inter-prediction only, outperforms the H.264 coder (x264) and is competitive with the HEVC reference coder (HM) over a large rate range.

UR - http://www.scopus.com/inward/record.url?scp=84956684180&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84956684180&partnerID=8YFLogxK

U2 - 10.1109/ICIP.2015.7351090

DO - 10.1109/ICIP.2015.7351090

M3 - Conference contribution

SN - 9781479983391

VL - 2015-December

SP - 1697

EP - 1701

BT - 2015 IEEE International Conference on Image Processing, ICIP 2015 - Proceedings

PB - IEEE Computer Society

ER -