One-Pass Mode and Motion Decision for Multilayer Quality Scalable Video Coding

Meng Xu, Zhan Ma, Yao Wang

Research output: Contribution to journalArticle

Abstract

This paper presents a novel low-complexity motion estimation and mode decision algorithm for encoding multiple quality layers following the H.264/scalable video coding standard, considering both coarse grain scalability (CGS) and medium grain scalability (MGS). The proposed algorithm conducts motion estimation and mode decision only at the base layer (BL) and enforces the higher layers to inherit the motion and mode decisions of the BL. In order for the decision made at the BL to be nearly optimal for all layers, we use the highest layer reconstructed frame as the reference frame for motion estimation and set the Lagrangian multipliers according to the quantization parameter of the current and higher layers. We also propose a simple early skip/direct decision to further boost the encoding speed. Mode decision and motion estimation is conducted at a higher layer only if the layer below it uses the skip/direct mode for a block. Significant complexity reduction can be achieved because the mode and motion estimation is performed at most once for each macroblock. Because the mode and motion information only needs to be transmitted once, we also achieve a slightly better rate-distortion (R-D) performance for typical videos. Experiments have shown more than 2× (up to 5×) speedup for a three-layer encoder against the conventional R-D optimized reference software JSVM on both CIF and HD sequences, and for both CGS and MGS, with the tradeoff of the coding efficiency measured by the Bjontegaard delta rate.

Original languageEnglish (US)
Article number7172494
Pages (from-to)4250-4262
Number of pages13
JournalIEEE Transactions on Image Processing
Volume24
Issue number11
DOIs
StatePublished - Nov 1 2015

Fingerprint

Scalable video coding
Motion estimation
Multilayers
Scalability
Image coding
Experiments

Keywords

  • CGS
  • Fast motion and mode decsion
  • H.264/SVC
  • MGS

ASJC Scopus subject areas

  • Computer Graphics and Computer-Aided Design
  • Software

Cite this

One-Pass Mode and Motion Decision for Multilayer Quality Scalable Video Coding. / Xu, Meng; Ma, Zhan; Wang, Yao.

In: IEEE Transactions on Image Processing, Vol. 24, No. 11, 7172494, 01.11.2015, p. 4250-4262.

Research output: Contribution to journalArticle

@article{54ab84f157bc41d8a998b100c6ef0852,
title = "One-Pass Mode and Motion Decision for Multilayer Quality Scalable Video Coding",
abstract = "This paper presents a novel low-complexity motion estimation and mode decision algorithm for encoding multiple quality layers following the H.264/scalable video coding standard, considering both coarse grain scalability (CGS) and medium grain scalability (MGS). The proposed algorithm conducts motion estimation and mode decision only at the base layer (BL) and enforces the higher layers to inherit the motion and mode decisions of the BL. In order for the decision made at the BL to be nearly optimal for all layers, we use the highest layer reconstructed frame as the reference frame for motion estimation and set the Lagrangian multipliers according to the quantization parameter of the current and higher layers. We also propose a simple early skip/direct decision to further boost the encoding speed. Mode decision and motion estimation is conducted at a higher layer only if the layer below it uses the skip/direct mode for a block. Significant complexity reduction can be achieved because the mode and motion estimation is performed at most once for each macroblock. Because the mode and motion information only needs to be transmitted once, we also achieve a slightly better rate-distortion (R-D) performance for typical videos. Experiments have shown more than 2× (up to 5×) speedup for a three-layer encoder against the conventional R-D optimized reference software JSVM on both CIF and HD sequences, and for both CGS and MGS, with the tradeoff of the coding efficiency measured by the Bjontegaard delta rate.",
keywords = "CGS, Fast motion and mode decsion, H.264/SVC, MGS",
author = "Meng Xu and Zhan Ma and Yao Wang",
year = "2015",
month = "11",
day = "1",
doi = "10.1109/TIP.2015.2462747",
language = "English (US)",
volume = "24",
pages = "4250--4262",
journal = "IEEE Transactions on Image Processing",
issn = "1057-7149",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
number = "11",

}

TY - JOUR

T1 - One-Pass Mode and Motion Decision for Multilayer Quality Scalable Video Coding

AU - Xu, Meng

AU - Ma, Zhan

AU - Wang, Yao

PY - 2015/11/1

Y1 - 2015/11/1

N2 - This paper presents a novel low-complexity motion estimation and mode decision algorithm for encoding multiple quality layers following the H.264/scalable video coding standard, considering both coarse grain scalability (CGS) and medium grain scalability (MGS). The proposed algorithm conducts motion estimation and mode decision only at the base layer (BL) and enforces the higher layers to inherit the motion and mode decisions of the BL. In order for the decision made at the BL to be nearly optimal for all layers, we use the highest layer reconstructed frame as the reference frame for motion estimation and set the Lagrangian multipliers according to the quantization parameter of the current and higher layers. We also propose a simple early skip/direct decision to further boost the encoding speed. Mode decision and motion estimation is conducted at a higher layer only if the layer below it uses the skip/direct mode for a block. Significant complexity reduction can be achieved because the mode and motion estimation is performed at most once for each macroblock. Because the mode and motion information only needs to be transmitted once, we also achieve a slightly better rate-distortion (R-D) performance for typical videos. Experiments have shown more than 2× (up to 5×) speedup for a three-layer encoder against the conventional R-D optimized reference software JSVM on both CIF and HD sequences, and for both CGS and MGS, with the tradeoff of the coding efficiency measured by the Bjontegaard delta rate.

AB - This paper presents a novel low-complexity motion estimation and mode decision algorithm for encoding multiple quality layers following the H.264/scalable video coding standard, considering both coarse grain scalability (CGS) and medium grain scalability (MGS). The proposed algorithm conducts motion estimation and mode decision only at the base layer (BL) and enforces the higher layers to inherit the motion and mode decisions of the BL. In order for the decision made at the BL to be nearly optimal for all layers, we use the highest layer reconstructed frame as the reference frame for motion estimation and set the Lagrangian multipliers according to the quantization parameter of the current and higher layers. We also propose a simple early skip/direct decision to further boost the encoding speed. Mode decision and motion estimation is conducted at a higher layer only if the layer below it uses the skip/direct mode for a block. Significant complexity reduction can be achieved because the mode and motion estimation is performed at most once for each macroblock. Because the mode and motion information only needs to be transmitted once, we also achieve a slightly better rate-distortion (R-D) performance for typical videos. Experiments have shown more than 2× (up to 5×) speedup for a three-layer encoder against the conventional R-D optimized reference software JSVM on both CIF and HD sequences, and for both CGS and MGS, with the tradeoff of the coding efficiency measured by the Bjontegaard delta rate.

KW - CGS

KW - Fast motion and mode decsion

KW - H.264/SVC

KW - MGS

UR - http://www.scopus.com/inward/record.url?scp=84939833303&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84939833303&partnerID=8YFLogxK

U2 - 10.1109/TIP.2015.2462747

DO - 10.1109/TIP.2015.2462747

M3 - Article

AN - SCOPUS:84939833303

VL - 24

SP - 4250

EP - 4262

JO - IEEE Transactions on Image Processing

JF - IEEE Transactions on Image Processing

SN - 1057-7149

IS - 11

M1 - 7172494

ER -