One-pass mode decision for low-complexity and high-efficiency encoding of quality scalable video

Meng Xu, Yao Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper presents a novel one-pass mode decision algorithm for encoding multiple quality layers, following the coarse grain scalability coding structure in the H.264/SVC standard. In our solution, motion estimation (ME) is carried out only once at the base layer using the reconstructed picture from the highest enhancement layer. The same motion vectors are used for all layers to not only avoid multiple ME processes at different layers, but also save the overhead bits. In addition, early SKIP/DIRECT mode decision is introduced to further boost the encoding speed. The encoder produces fully compliant SVC bit streams. Although the method is applicable to both coarse grain scalability (CGS) and medium grain scalability (MGS), we have examined its performance over CGS only. We demonstrate that more than 2x speedup for three-layer coding against the conventional H.264/SVC encoding using the reference software over 7 test sequences. Significantly, this complexity saving is achieved simultaneously with increase in the coding efficiency! Although the base layer requires slightly higher bit rate (2.5% in terms of the BD-Rate), the enhancement layers enjoy lower rates (5.7% and 2.2% reduction for the total of two and three layers, respectively), on average of 7 test sequences.

Original languageEnglish (US)
Title of host publicationElectronic Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2013
DOIs
StatePublished - 2013
Event2013 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2013 - San Jose, CA, United States
Duration: Jul 15 2013Jul 19 2013

Other

Other2013 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2013
CountryUnited States
CitySan Jose, CA
Period7/15/137/19/13

Fingerprint

Scalability
Motion estimation

Keywords

  • CGS
  • H.264/SVC
  • low complexity encoder
  • mode decision

ASJC Scopus subject areas

  • Computer Graphics and Computer-Aided Design
  • Computer Vision and Pattern Recognition

Cite this

Xu, M., & Wang, Y. (2013). One-pass mode decision for low-complexity and high-efficiency encoding of quality scalable video. In Electronic Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2013 [6618322] https://doi.org/10.1109/ICMEW.2013.6618322

One-pass mode decision for low-complexity and high-efficiency encoding of quality scalable video. / Xu, Meng; Wang, Yao.

Electronic Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2013. 2013. 6618322.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Xu, M & Wang, Y 2013, One-pass mode decision for low-complexity and high-efficiency encoding of quality scalable video. in Electronic Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2013., 6618322, 2013 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2013, San Jose, CA, United States, 7/15/13. https://doi.org/10.1109/ICMEW.2013.6618322
Xu M, Wang Y. One-pass mode decision for low-complexity and high-efficiency encoding of quality scalable video. In Electronic Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2013. 2013. 6618322 https://doi.org/10.1109/ICMEW.2013.6618322
Xu, Meng ; Wang, Yao. / One-pass mode decision for low-complexity and high-efficiency encoding of quality scalable video. Electronic Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2013. 2013.
@inproceedings{faae4856c202473d8b74e7c3b9f54b7f,
title = "One-pass mode decision for low-complexity and high-efficiency encoding of quality scalable video",
abstract = "This paper presents a novel one-pass mode decision algorithm for encoding multiple quality layers, following the coarse grain scalability coding structure in the H.264/SVC standard. In our solution, motion estimation (ME) is carried out only once at the base layer using the reconstructed picture from the highest enhancement layer. The same motion vectors are used for all layers to not only avoid multiple ME processes at different layers, but also save the overhead bits. In addition, early SKIP/DIRECT mode decision is introduced to further boost the encoding speed. The encoder produces fully compliant SVC bit streams. Although the method is applicable to both coarse grain scalability (CGS) and medium grain scalability (MGS), we have examined its performance over CGS only. We demonstrate that more than 2x speedup for three-layer coding against the conventional H.264/SVC encoding using the reference software over 7 test sequences. Significantly, this complexity saving is achieved simultaneously with increase in the coding efficiency! Although the base layer requires slightly higher bit rate (2.5{\%} in terms of the BD-Rate), the enhancement layers enjoy lower rates (5.7{\%} and 2.2{\%} reduction for the total of two and three layers, respectively), on average of 7 test sequences.",
keywords = "CGS, H.264/SVC, low complexity encoder, mode decision",
author = "Meng Xu and Yao Wang",
year = "2013",
doi = "10.1109/ICMEW.2013.6618322",
language = "English (US)",
isbn = "9781479916047",
booktitle = "Electronic Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2013",

}

TY - GEN

T1 - One-pass mode decision for low-complexity and high-efficiency encoding of quality scalable video

AU - Xu, Meng

AU - Wang, Yao

PY - 2013

Y1 - 2013

N2 - This paper presents a novel one-pass mode decision algorithm for encoding multiple quality layers, following the coarse grain scalability coding structure in the H.264/SVC standard. In our solution, motion estimation (ME) is carried out only once at the base layer using the reconstructed picture from the highest enhancement layer. The same motion vectors are used for all layers to not only avoid multiple ME processes at different layers, but also save the overhead bits. In addition, early SKIP/DIRECT mode decision is introduced to further boost the encoding speed. The encoder produces fully compliant SVC bit streams. Although the method is applicable to both coarse grain scalability (CGS) and medium grain scalability (MGS), we have examined its performance over CGS only. We demonstrate that more than 2x speedup for three-layer coding against the conventional H.264/SVC encoding using the reference software over 7 test sequences. Significantly, this complexity saving is achieved simultaneously with increase in the coding efficiency! Although the base layer requires slightly higher bit rate (2.5% in terms of the BD-Rate), the enhancement layers enjoy lower rates (5.7% and 2.2% reduction for the total of two and three layers, respectively), on average of 7 test sequences.

AB - This paper presents a novel one-pass mode decision algorithm for encoding multiple quality layers, following the coarse grain scalability coding structure in the H.264/SVC standard. In our solution, motion estimation (ME) is carried out only once at the base layer using the reconstructed picture from the highest enhancement layer. The same motion vectors are used for all layers to not only avoid multiple ME processes at different layers, but also save the overhead bits. In addition, early SKIP/DIRECT mode decision is introduced to further boost the encoding speed. The encoder produces fully compliant SVC bit streams. Although the method is applicable to both coarse grain scalability (CGS) and medium grain scalability (MGS), we have examined its performance over CGS only. We demonstrate that more than 2x speedup for three-layer coding against the conventional H.264/SVC encoding using the reference software over 7 test sequences. Significantly, this complexity saving is achieved simultaneously with increase in the coding efficiency! Although the base layer requires slightly higher bit rate (2.5% in terms of the BD-Rate), the enhancement layers enjoy lower rates (5.7% and 2.2% reduction for the total of two and three layers, respectively), on average of 7 test sequences.

KW - CGS

KW - H.264/SVC

KW - low complexity encoder

KW - mode decision

UR - http://www.scopus.com/inward/record.url?scp=84888236346&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84888236346&partnerID=8YFLogxK

U2 - 10.1109/ICMEW.2013.6618322

DO - 10.1109/ICMEW.2013.6618322

M3 - Conference contribution

SN - 9781479916047

BT - Electronic Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2013

ER -