DjVu document browsing with on-demand loading and rendering of image components

Yann LeCun, Léon Bottou, Andrei Erofeev, Patrick Haffner, Bill Riemers

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Image-based digital documents are composed of multiple pages, each of which may be composed of multiple components such as the text, pictures, background, and annotations. We describe the image structure and software architecture that allows the DjVu system to load and render the required components on demand while minimizing the bandwidth requirements, and the memory requirements in the client. DjVu document files are merely a list of enriched URLs that point to individual files (or file elements) that contain image components. Image components include: text images, background images, shape dictionaries shared by multiple pages, OCRed text, and several types of annotations. A multithreaded software architecture with smart caching allows individual components to be loaded and pre-decoded and rendered on-demand. Pages are pre-fetched or loaded on demand, allowing users to randomly access pages without downloading the entire document, and without the help of a byte server. Components that are shared accross pages (e.g. shape dictionnaries, or background layers) are loaded as required and cached. This greatly reduces the overall bandwidth requirements. Shared dictionnaries allow 40% typical file size reduction for scanned bitonal documents at 300dpi. Compression ratios on scanned US patents at 300dpi are 5.2 to 10.2 times higher than GroupIV with shared dictionnaries and 3.6 to 8.5 times higher than GroupIV without shared dictionnaries.

Original languageEnglish (US)
Title of host publicationProceedings of SPIE - The International Society for Optical Engineering
EditorsG.B. Beretta, R. Schettini
Pages7-16
Number of pages10
Volume4311
DOIs
StatePublished - 2001
EventInternet Imaging II - San Jose, CA, United States
Duration: Jan 24 2001Jan 26 2001

Other

OtherInternet Imaging II
CountryUnited States
CitySan Jose, CA
Period1/24/011/26/01

Fingerprint

Software architecture
files
Bandwidth
Glossaries
annotations
Websites
Servers
Data storage equipment
requirements
bandwidth
computer programs
dictionaries
patents
compression ratio
lists

Keywords

  • Digital libraries
  • DjVu
  • Document imaging
  • Scanning
  • Segmentation
  • Wavelets

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Condensed Matter Physics

Cite this

LeCun, Y., Bottou, L., Erofeev, A., Haffner, P., & Riemers, B. (2001). DjVu document browsing with on-demand loading and rendering of image components. In G. B. Beretta, & R. Schettini (Eds.), Proceedings of SPIE - The International Society for Optical Engineering (Vol. 4311, pp. 7-16) https://doi.org/10.1117/12.411887

DjVu document browsing with on-demand loading and rendering of image components. / LeCun, Yann; Bottou, Léon; Erofeev, Andrei; Haffner, Patrick; Riemers, Bill.

Proceedings of SPIE - The International Society for Optical Engineering. ed. / G.B. Beretta; R. Schettini. Vol. 4311 2001. p. 7-16.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

LeCun, Y, Bottou, L, Erofeev, A, Haffner, P & Riemers, B 2001, DjVu document browsing with on-demand loading and rendering of image components. in GB Beretta & R Schettini (eds), Proceedings of SPIE - The International Society for Optical Engineering. vol. 4311, pp. 7-16, Internet Imaging II, San Jose, CA, United States, 1/24/01. https://doi.org/10.1117/12.411887
LeCun Y, Bottou L, Erofeev A, Haffner P, Riemers B. DjVu document browsing with on-demand loading and rendering of image components. In Beretta GB, Schettini R, editors, Proceedings of SPIE - The International Society for Optical Engineering. Vol. 4311. 2001. p. 7-16 https://doi.org/10.1117/12.411887
LeCun, Yann ; Bottou, Léon ; Erofeev, Andrei ; Haffner, Patrick ; Riemers, Bill. / DjVu document browsing with on-demand loading and rendering of image components. Proceedings of SPIE - The International Society for Optical Engineering. editor / G.B. Beretta ; R. Schettini. Vol. 4311 2001. pp. 7-16
@inproceedings{0934df7ada194b1d806dbe7c4e26fdfd,
title = "DjVu document browsing with on-demand loading and rendering of image components",
abstract = "Image-based digital documents are composed of multiple pages, each of which may be composed of multiple components such as the text, pictures, background, and annotations. We describe the image structure and software architecture that allows the DjVu system to load and render the required components on demand while minimizing the bandwidth requirements, and the memory requirements in the client. DjVu document files are merely a list of enriched URLs that point to individual files (or file elements) that contain image components. Image components include: text images, background images, shape dictionaries shared by multiple pages, OCRed text, and several types of annotations. A multithreaded software architecture with smart caching allows individual components to be loaded and pre-decoded and rendered on-demand. Pages are pre-fetched or loaded on demand, allowing users to randomly access pages without downloading the entire document, and without the help of a byte server. Components that are shared accross pages (e.g. shape dictionnaries, or background layers) are loaded as required and cached. This greatly reduces the overall bandwidth requirements. Shared dictionnaries allow 40{\%} typical file size reduction for scanned bitonal documents at 300dpi. Compression ratios on scanned US patents at 300dpi are 5.2 to 10.2 times higher than GroupIV with shared dictionnaries and 3.6 to 8.5 times higher than GroupIV without shared dictionnaries.",
keywords = "Digital libraries, DjVu, Document imaging, Scanning, Segmentation, Wavelets",
author = "Yann LeCun and L{\'e}on Bottou and Andrei Erofeev and Patrick Haffner and Bill Riemers",
year = "2001",
doi = "10.1117/12.411887",
language = "English (US)",
volume = "4311",
pages = "7--16",
editor = "G.B. Beretta and R. Schettini",
booktitle = "Proceedings of SPIE - The International Society for Optical Engineering",

}

TY - GEN

T1 - DjVu document browsing with on-demand loading and rendering of image components

AU - LeCun, Yann

AU - Bottou, Léon

AU - Erofeev, Andrei

AU - Haffner, Patrick

AU - Riemers, Bill

PY - 2001

Y1 - 2001

N2 - Image-based digital documents are composed of multiple pages, each of which may be composed of multiple components such as the text, pictures, background, and annotations. We describe the image structure and software architecture that allows the DjVu system to load and render the required components on demand while minimizing the bandwidth requirements, and the memory requirements in the client. DjVu document files are merely a list of enriched URLs that point to individual files (or file elements) that contain image components. Image components include: text images, background images, shape dictionaries shared by multiple pages, OCRed text, and several types of annotations. A multithreaded software architecture with smart caching allows individual components to be loaded and pre-decoded and rendered on-demand. Pages are pre-fetched or loaded on demand, allowing users to randomly access pages without downloading the entire document, and without the help of a byte server. Components that are shared accross pages (e.g. shape dictionnaries, or background layers) are loaded as required and cached. This greatly reduces the overall bandwidth requirements. Shared dictionnaries allow 40% typical file size reduction for scanned bitonal documents at 300dpi. Compression ratios on scanned US patents at 300dpi are 5.2 to 10.2 times higher than GroupIV with shared dictionnaries and 3.6 to 8.5 times higher than GroupIV without shared dictionnaries.

AB - Image-based digital documents are composed of multiple pages, each of which may be composed of multiple components such as the text, pictures, background, and annotations. We describe the image structure and software architecture that allows the DjVu system to load and render the required components on demand while minimizing the bandwidth requirements, and the memory requirements in the client. DjVu document files are merely a list of enriched URLs that point to individual files (or file elements) that contain image components. Image components include: text images, background images, shape dictionaries shared by multiple pages, OCRed text, and several types of annotations. A multithreaded software architecture with smart caching allows individual components to be loaded and pre-decoded and rendered on-demand. Pages are pre-fetched or loaded on demand, allowing users to randomly access pages without downloading the entire document, and without the help of a byte server. Components that are shared accross pages (e.g. shape dictionnaries, or background layers) are loaded as required and cached. This greatly reduces the overall bandwidth requirements. Shared dictionnaries allow 40% typical file size reduction for scanned bitonal documents at 300dpi. Compression ratios on scanned US patents at 300dpi are 5.2 to 10.2 times higher than GroupIV with shared dictionnaries and 3.6 to 8.5 times higher than GroupIV without shared dictionnaries.

KW - Digital libraries

KW - DjVu

KW - Document imaging

KW - Scanning

KW - Segmentation

KW - Wavelets

UR - http://www.scopus.com/inward/record.url?scp=0035063088&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0035063088&partnerID=8YFLogxK

U2 - 10.1117/12.411887

DO - 10.1117/12.411887

M3 - Conference contribution

AN - SCOPUS:0035063088

VL - 4311

SP - 7

EP - 16

BT - Proceedings of SPIE - The International Society for Optical Engineering

A2 - Beretta, G.B.

A2 - Schettini, R.

ER -