IMAX: Incremental maintenance of schema-based XML statistics

Maya Ramanath, Lingzhi Zhang, Juliana Freire, Jayant R. Haritsa

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Current approaches for estimating the cardinality of XML queries are applicable to a static scenario wherein the underlying XML data does not change subsequent to the collection of statistics on the repository. However, in practice, many XML-based applications are dynamic and involve frequent updates to the data. In this paper, we investigate efficient strategies for incrementally maintaining statistical summaries as and when updates are applied to the data. Specifically, we propose algorithms that handle both the addition of new documents as well as random insertions in the existing document trees. We also show, through a detailed performance evaluation, that our incremental techniques are significantly faster than the naive recomputation approach; and that estimation accuracy can be maintained even with a fixed memory budget.

Original languageEnglish (US)
Title of host publicationProceedings - 21st International Conference on Data Engineering, ICDE 2005
Pages273-284
Number of pages12
DOIs
StatePublished - 2005
Event21st International Conference on Data Engineering, ICDE 2005 - Tokyo, Japan
Duration: Apr 5 2005Apr 8 2005

Other

Other21st International Conference on Data Engineering, ICDE 2005
CountryJapan
CityTokyo
Period4/5/054/8/05

Fingerprint

XML
Statistics
Data storage equipment

ASJC Scopus subject areas

  • Software
  • Engineering(all)
  • Engineering (miscellaneous)

Cite this

Ramanath, M., Zhang, L., Freire, J., & Haritsa, J. R. (2005). IMAX: Incremental maintenance of schema-based XML statistics. In Proceedings - 21st International Conference on Data Engineering, ICDE 2005 (pp. 273-284) https://doi.org/10.1109/ICDE.2005.75

IMAX : Incremental maintenance of schema-based XML statistics. / Ramanath, Maya; Zhang, Lingzhi; Freire, Juliana; Haritsa, Jayant R.

Proceedings - 21st International Conference on Data Engineering, ICDE 2005. 2005. p. 273-284.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Ramanath, M, Zhang, L, Freire, J & Haritsa, JR 2005, IMAX: Incremental maintenance of schema-based XML statistics. in Proceedings - 21st International Conference on Data Engineering, ICDE 2005. pp. 273-284, 21st International Conference on Data Engineering, ICDE 2005, Tokyo, Japan, 4/5/05. https://doi.org/10.1109/ICDE.2005.75
Ramanath M, Zhang L, Freire J, Haritsa JR. IMAX: Incremental maintenance of schema-based XML statistics. In Proceedings - 21st International Conference on Data Engineering, ICDE 2005. 2005. p. 273-284 https://doi.org/10.1109/ICDE.2005.75
Ramanath, Maya ; Zhang, Lingzhi ; Freire, Juliana ; Haritsa, Jayant R. / IMAX : Incremental maintenance of schema-based XML statistics. Proceedings - 21st International Conference on Data Engineering, ICDE 2005. 2005. pp. 273-284
@inproceedings{60447f23b9da4c28a9ea1569a5f2b5e7,
title = "IMAX: Incremental maintenance of schema-based XML statistics",
abstract = "Current approaches for estimating the cardinality of XML queries are applicable to a static scenario wherein the underlying XML data does not change subsequent to the collection of statistics on the repository. However, in practice, many XML-based applications are dynamic and involve frequent updates to the data. In this paper, we investigate efficient strategies for incrementally maintaining statistical summaries as and when updates are applied to the data. Specifically, we propose algorithms that handle both the addition of new documents as well as random insertions in the existing document trees. We also show, through a detailed performance evaluation, that our incremental techniques are significantly faster than the naive recomputation approach; and that estimation accuracy can be maintained even with a fixed memory budget.",
author = "Maya Ramanath and Lingzhi Zhang and Juliana Freire and Haritsa, {Jayant R.}",
year = "2005",
doi = "10.1109/ICDE.2005.75",
language = "English (US)",
isbn = "0769522858",
pages = "273--284",
booktitle = "Proceedings - 21st International Conference on Data Engineering, ICDE 2005",

}

TY - GEN

T1 - IMAX

T2 - Incremental maintenance of schema-based XML statistics

AU - Ramanath, Maya

AU - Zhang, Lingzhi

AU - Freire, Juliana

AU - Haritsa, Jayant R.

PY - 2005

Y1 - 2005

N2 - Current approaches for estimating the cardinality of XML queries are applicable to a static scenario wherein the underlying XML data does not change subsequent to the collection of statistics on the repository. However, in practice, many XML-based applications are dynamic and involve frequent updates to the data. In this paper, we investigate efficient strategies for incrementally maintaining statistical summaries as and when updates are applied to the data. Specifically, we propose algorithms that handle both the addition of new documents as well as random insertions in the existing document trees. We also show, through a detailed performance evaluation, that our incremental techniques are significantly faster than the naive recomputation approach; and that estimation accuracy can be maintained even with a fixed memory budget.

AB - Current approaches for estimating the cardinality of XML queries are applicable to a static scenario wherein the underlying XML data does not change subsequent to the collection of statistics on the repository. However, in practice, many XML-based applications are dynamic and involve frequent updates to the data. In this paper, we investigate efficient strategies for incrementally maintaining statistical summaries as and when updates are applied to the data. Specifically, we propose algorithms that handle both the addition of new documents as well as random insertions in the existing document trees. We also show, through a detailed performance evaluation, that our incremental techniques are significantly faster than the naive recomputation approach; and that estimation accuracy can be maintained even with a fixed memory budget.

UR - http://www.scopus.com/inward/record.url?scp=28444460689&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=28444460689&partnerID=8YFLogxK

U2 - 10.1109/ICDE.2005.75

DO - 10.1109/ICDE.2005.75

M3 - Conference contribution

SN - 0769522858

SP - 273

EP - 284

BT - Proceedings - 21st International Conference on Data Engineering, ICDE 2005

ER -