JPEG-matched MRC compression of compound documents

D. Mukherjee, N. Memon, A. Said

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    Mixed Raster Content (MRC) is an ITU document compression standard (T.44) specifying both a model for multilayer representation of a compound document, and a set of allowable standardized coders for the individual layers. The model requires decomposition of a document into two image layers and a binary mask layer, but the standard does not recommend any procedure for this task. For best compression results, the decomposition method should be optimized for the layer encoders. In this paper, a high performance MRC compound document codec is presented, where the layer decomposition scheme is matched to the JPEG encoder with arithmetic coding for the foreground and background image layers. JBIG is used to code the mask layer. Integrated noise removal routines enable handling of scanned documents along with electronic ones. Resolution scalable decoding features are also implemented. The page segmenter yields a segmentation mask, which serves to separate text and other features.

    Original languageEnglish (US)
    Title of host publicationIEEE International Conference on Image Processing
    Pages434-437
    Number of pages4
    Volume3
    StatePublished - 2001
    EventIEEE International Conference on Image Processing (ICIP) - Thessaloniki, Greece
    Duration: Oct 7 2001Oct 10 2001

    Other

    OtherIEEE International Conference on Image Processing (ICIP)
    CountryGreece
    CityThessaloniki
    Period10/7/0110/10/01

    Fingerprint

    Masks
    Decomposition
    Decoding
    Multilayers

    ASJC Scopus subject areas

    • Computer Vision and Pattern Recognition
    • Hardware and Architecture
    • Electrical and Electronic Engineering

    Cite this

    Mukherjee, D., Memon, N., & Said, A. (2001). JPEG-matched MRC compression of compound documents. In IEEE International Conference on Image Processing (Vol. 3, pp. 434-437)

    JPEG-matched MRC compression of compound documents. / Mukherjee, D.; Memon, N.; Said, A.

    IEEE International Conference on Image Processing. Vol. 3 2001. p. 434-437.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Mukherjee, D, Memon, N & Said, A 2001, JPEG-matched MRC compression of compound documents. in IEEE International Conference on Image Processing. vol. 3, pp. 434-437, IEEE International Conference on Image Processing (ICIP), Thessaloniki, Greece, 10/7/01.
    Mukherjee D, Memon N, Said A. JPEG-matched MRC compression of compound documents. In IEEE International Conference on Image Processing. Vol. 3. 2001. p. 434-437
    Mukherjee, D. ; Memon, N. ; Said, A. / JPEG-matched MRC compression of compound documents. IEEE International Conference on Image Processing. Vol. 3 2001. pp. 434-437
    @inproceedings{5fed10eb27174d07ae0ae2a0b6cbc3f7,
    title = "JPEG-matched MRC compression of compound documents",
    abstract = "Mixed Raster Content (MRC) is an ITU document compression standard (T.44) specifying both a model for multilayer representation of a compound document, and a set of allowable standardized coders for the individual layers. The model requires decomposition of a document into two image layers and a binary mask layer, but the standard does not recommend any procedure for this task. For best compression results, the decomposition method should be optimized for the layer encoders. In this paper, a high performance MRC compound document codec is presented, where the layer decomposition scheme is matched to the JPEG encoder with arithmetic coding for the foreground and background image layers. JBIG is used to code the mask layer. Integrated noise removal routines enable handling of scanned documents along with electronic ones. Resolution scalable decoding features are also implemented. The page segmenter yields a segmentation mask, which serves to separate text and other features.",
    author = "D. Mukherjee and N. Memon and A. Said",
    year = "2001",
    language = "English (US)",
    volume = "3",
    pages = "434--437",
    booktitle = "IEEE International Conference on Image Processing",

    }

    TY - GEN

    T1 - JPEG-matched MRC compression of compound documents

    AU - Mukherjee, D.

    AU - Memon, N.

    AU - Said, A.

    PY - 2001

    Y1 - 2001

    N2 - Mixed Raster Content (MRC) is an ITU document compression standard (T.44) specifying both a model for multilayer representation of a compound document, and a set of allowable standardized coders for the individual layers. The model requires decomposition of a document into two image layers and a binary mask layer, but the standard does not recommend any procedure for this task. For best compression results, the decomposition method should be optimized for the layer encoders. In this paper, a high performance MRC compound document codec is presented, where the layer decomposition scheme is matched to the JPEG encoder with arithmetic coding for the foreground and background image layers. JBIG is used to code the mask layer. Integrated noise removal routines enable handling of scanned documents along with electronic ones. Resolution scalable decoding features are also implemented. The page segmenter yields a segmentation mask, which serves to separate text and other features.

    AB - Mixed Raster Content (MRC) is an ITU document compression standard (T.44) specifying both a model for multilayer representation of a compound document, and a set of allowable standardized coders for the individual layers. The model requires decomposition of a document into two image layers and a binary mask layer, but the standard does not recommend any procedure for this task. For best compression results, the decomposition method should be optimized for the layer encoders. In this paper, a high performance MRC compound document codec is presented, where the layer decomposition scheme is matched to the JPEG encoder with arithmetic coding for the foreground and background image layers. JBIG is used to code the mask layer. Integrated noise removal routines enable handling of scanned documents along with electronic ones. Resolution scalable decoding features are also implemented. The page segmenter yields a segmentation mask, which serves to separate text and other features.

    UR - http://www.scopus.com/inward/record.url?scp=0035168598&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=0035168598&partnerID=8YFLogxK

    M3 - Conference contribution

    VL - 3

    SP - 434

    EP - 437

    BT - IEEE International Conference on Image Processing

    ER -