A flexible infrastructure for gathering XML statistics and estimating query cardinality

Juliana Freire, Maya Ramanath, Lingzhi Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The XML statistics collection process and result estimation of user cardinalities queries was discussed. The estimated cardinalities which included query optimizations and cost-based storage design were needed in variety of tasks and were used to give users a early feedback about the expected outcome of their queries. The result estimator, StatiX, uses specialized data structures and estimation algorithms. It uses histograms to capture uniformly the structural and value skew presented in documents, and also leverages schema information to produce high-quality and concise statistical summaries.

Original languageEnglish (US)
Title of host publicationProceedings - 20th International Conference on Data Engineering - ICDE 2004
Pages857
Number of pages1
Volume20
DOIs
StatePublished - 2004
EventProceedings - 20th International Conference on Data Engineering - ICDE 2004 - Boston, MA., United States
Duration: Mar 30 2004Apr 2 2004

Other

OtherProceedings - 20th International Conference on Data Engineering - ICDE 2004
CountryUnited States
CityBoston, MA.
Period3/30/044/2/04

Fingerprint

XML
Statistics
Data structures
Feedback
Costs

ASJC Scopus subject areas

  • Software
  • Engineering(all)
  • Engineering (miscellaneous)

Cite this

Freire, J., Ramanath, M., & Zhang, L. (2004). A flexible infrastructure for gathering XML statistics and estimating query cardinality. In Proceedings - 20th International Conference on Data Engineering - ICDE 2004 (Vol. 20, pp. 857) https://doi.org/10.1109/ICDE.2004.1320085

A flexible infrastructure for gathering XML statistics and estimating query cardinality. / Freire, Juliana; Ramanath, Maya; Zhang, Lingzhi.

Proceedings - 20th International Conference on Data Engineering - ICDE 2004. Vol. 20 2004. p. 857.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Freire, J, Ramanath, M & Zhang, L 2004, A flexible infrastructure for gathering XML statistics and estimating query cardinality. in Proceedings - 20th International Conference on Data Engineering - ICDE 2004. vol. 20, pp. 857, Proceedings - 20th International Conference on Data Engineering - ICDE 2004, Boston, MA., United States, 3/30/04. https://doi.org/10.1109/ICDE.2004.1320085
Freire J, Ramanath M, Zhang L. A flexible infrastructure for gathering XML statistics and estimating query cardinality. In Proceedings - 20th International Conference on Data Engineering - ICDE 2004. Vol. 20. 2004. p. 857 https://doi.org/10.1109/ICDE.2004.1320085
Freire, Juliana ; Ramanath, Maya ; Zhang, Lingzhi. / A flexible infrastructure for gathering XML statistics and estimating query cardinality. Proceedings - 20th International Conference on Data Engineering - ICDE 2004. Vol. 20 2004. pp. 857
@inproceedings{7cc7c07e618d4c4ca5a587ebf0b233bc,
title = "A flexible infrastructure for gathering XML statistics and estimating query cardinality",
abstract = "The XML statistics collection process and result estimation of user cardinalities queries was discussed. The estimated cardinalities which included query optimizations and cost-based storage design were needed in variety of tasks and were used to give users a early feedback about the expected outcome of their queries. The result estimator, StatiX, uses specialized data structures and estimation algorithms. It uses histograms to capture uniformly the structural and value skew presented in documents, and also leverages schema information to produce high-quality and concise statistical summaries.",
author = "Juliana Freire and Maya Ramanath and Lingzhi Zhang",
year = "2004",
doi = "10.1109/ICDE.2004.1320085",
language = "English (US)",
volume = "20",
pages = "857",
booktitle = "Proceedings - 20th International Conference on Data Engineering - ICDE 2004",

}

TY - GEN

T1 - A flexible infrastructure for gathering XML statistics and estimating query cardinality

AU - Freire, Juliana

AU - Ramanath, Maya

AU - Zhang, Lingzhi

PY - 2004

Y1 - 2004

N2 - The XML statistics collection process and result estimation of user cardinalities queries was discussed. The estimated cardinalities which included query optimizations and cost-based storage design were needed in variety of tasks and were used to give users a early feedback about the expected outcome of their queries. The result estimator, StatiX, uses specialized data structures and estimation algorithms. It uses histograms to capture uniformly the structural and value skew presented in documents, and also leverages schema information to produce high-quality and concise statistical summaries.

AB - The XML statistics collection process and result estimation of user cardinalities queries was discussed. The estimated cardinalities which included query optimizations and cost-based storage design were needed in variety of tasks and were used to give users a early feedback about the expected outcome of their queries. The result estimator, StatiX, uses specialized data structures and estimation algorithms. It uses histograms to capture uniformly the structural and value skew presented in documents, and also leverages schema information to produce high-quality and concise statistical summaries.

UR - http://www.scopus.com/inward/record.url?scp=2442500399&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=2442500399&partnerID=8YFLogxK

U2 - 10.1109/ICDE.2004.1320085

DO - 10.1109/ICDE.2004.1320085

M3 - Conference contribution

VL - 20

SP - 857

BT - Proceedings - 20th International Conference on Data Engineering - ICDE 2004

ER -