JPEG-matched MRC compression of compound documents

D. Mukherjee, N. Memon, A. Said

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Mixed Raster Content (MRC) is an ITU document compression standard (T.44) specifying both a model for multilayer representation of a compound document, and a set of allowable standardized coders for the individual layers. The model requires decomposition of a document into two image layers and a binary mask layer, but the standard does not recommend any procedure for this task. For best compression results, the decomposition method should be optimized for the layer encoders. In this paper, a high performance MRC compound document codec is presented, where the layer decomposition scheme is matched to the JPEG encoder with arithmetic coding for the foreground and background image layers. JBIG is used to code the mask layer. Integrated noise removal routines enable handling of scanned documents along with electronic ones. Resolution scalable decoding features are also implemented. The page segmenter yields a segmentation mask, which serves to separate text and other features.

Original languageEnglish (US)
Title of host publicationIEEE International Conference on Image Processing
Pages434-437
Number of pages4
Volume3
StatePublished - 2001
EventIEEE International Conference on Image Processing (ICIP) - Thessaloniki, Greece
Duration: Oct 7 2001Oct 10 2001

Other

OtherIEEE International Conference on Image Processing (ICIP)
CountryGreece
CityThessaloniki
Period10/7/0110/10/01

Fingerprint

Masks
Decomposition
Decoding
Multilayers

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition
  • Hardware and Architecture
  • Electrical and Electronic Engineering

Cite this

Mukherjee, D., Memon, N., & Said, A. (2001). JPEG-matched MRC compression of compound documents. In IEEE International Conference on Image Processing (Vol. 3, pp. 434-437)

JPEG-matched MRC compression of compound documents. / Mukherjee, D.; Memon, N.; Said, A.

IEEE International Conference on Image Processing. Vol. 3 2001. p. 434-437.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Mukherjee, D, Memon, N & Said, A 2001, JPEG-matched MRC compression of compound documents. in IEEE International Conference on Image Processing. vol. 3, pp. 434-437, IEEE International Conference on Image Processing (ICIP), Thessaloniki, Greece, 10/7/01.
Mukherjee D, Memon N, Said A. JPEG-matched MRC compression of compound documents. In IEEE International Conference on Image Processing. Vol. 3. 2001. p. 434-437
Mukherjee, D. ; Memon, N. ; Said, A. / JPEG-matched MRC compression of compound documents. IEEE International Conference on Image Processing. Vol. 3 2001. pp. 434-437
@inproceedings{5fed10eb27174d07ae0ae2a0b6cbc3f7,
title = "JPEG-matched MRC compression of compound documents",
abstract = "Mixed Raster Content (MRC) is an ITU document compression standard (T.44) specifying both a model for multilayer representation of a compound document, and a set of allowable standardized coders for the individual layers. The model requires decomposition of a document into two image layers and a binary mask layer, but the standard does not recommend any procedure for this task. For best compression results, the decomposition method should be optimized for the layer encoders. In this paper, a high performance MRC compound document codec is presented, where the layer decomposition scheme is matched to the JPEG encoder with arithmetic coding for the foreground and background image layers. JBIG is used to code the mask layer. Integrated noise removal routines enable handling of scanned documents along with electronic ones. Resolution scalable decoding features are also implemented. The page segmenter yields a segmentation mask, which serves to separate text and other features.",
author = "D. Mukherjee and N. Memon and A. Said",
year = "2001",
language = "English (US)",
volume = "3",
pages = "434--437",
booktitle = "IEEE International Conference on Image Processing",

}

TY - GEN

T1 - JPEG-matched MRC compression of compound documents

AU - Mukherjee, D.

AU - Memon, N.

AU - Said, A.

PY - 2001

Y1 - 2001

N2 - Mixed Raster Content (MRC) is an ITU document compression standard (T.44) specifying both a model for multilayer representation of a compound document, and a set of allowable standardized coders for the individual layers. The model requires decomposition of a document into two image layers and a binary mask layer, but the standard does not recommend any procedure for this task. For best compression results, the decomposition method should be optimized for the layer encoders. In this paper, a high performance MRC compound document codec is presented, where the layer decomposition scheme is matched to the JPEG encoder with arithmetic coding for the foreground and background image layers. JBIG is used to code the mask layer. Integrated noise removal routines enable handling of scanned documents along with electronic ones. Resolution scalable decoding features are also implemented. The page segmenter yields a segmentation mask, which serves to separate text and other features.

AB - Mixed Raster Content (MRC) is an ITU document compression standard (T.44) specifying both a model for multilayer representation of a compound document, and a set of allowable standardized coders for the individual layers. The model requires decomposition of a document into two image layers and a binary mask layer, but the standard does not recommend any procedure for this task. For best compression results, the decomposition method should be optimized for the layer encoders. In this paper, a high performance MRC compound document codec is presented, where the layer decomposition scheme is matched to the JPEG encoder with arithmetic coding for the foreground and background image layers. JBIG is used to code the mask layer. Integrated noise removal routines enable handling of scanned documents along with electronic ones. Resolution scalable decoding features are also implemented. The page segmenter yields a segmentation mask, which serves to separate text and other features.

UR - http://www.scopus.com/inward/record.url?scp=0035168598&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0035168598&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:0035168598

VL - 3

SP - 434

EP - 437

BT - IEEE International Conference on Image Processing

ER -