Querying structured information sources on the Web

Sergio Mergen, Juliana Freire, Carlos A. Heuser

Research output: Contribution to journalArticle

Abstract

To provide access to heterogeneous data distributed over the Web, we propose a solution that merges the expressiveness of information integration systems with the flexibility found in dataspace-aware search engines. Our approach requires neither a mediated schema nor source mappings. In the absence of a mediated schema, the user formulates structured queries based on what she expects to find. We demonstrate the feasibility of this approach by providing a query interface for integrating hundreds of (real) structured Web information sources. We also discuss experimental results which indicate that our query rewriting algorithm is both effective and scalable.

Original languageEnglish (US)
Pages (from-to)208-221
Number of pages14
JournalInternational Journal of Metadata, Semantics and Ontologies
Volume5
Issue number3
DOIs
StatePublished - Jul 2010

Fingerprint

Search engines
search engine
flexibility

Keywords

  • Dataspaces
  • Information integration
  • Query rewriting
  • Search engines

ASJC Scopus subject areas

  • Computer Science Applications
  • Library and Information Sciences
  • Information Systems

Cite this

Querying structured information sources on the Web. / Mergen, Sergio; Freire, Juliana; Heuser, Carlos A.

In: International Journal of Metadata, Semantics and Ontologies, Vol. 5, No. 3, 07.2010, p. 208-221.

Research output: Contribution to journalArticle

@article{8d0e87b76a4d405285313c4850baf1a8,
title = "Querying structured information sources on the Web",
abstract = "To provide access to heterogeneous data distributed over the Web, we propose a solution that merges the expressiveness of information integration systems with the flexibility found in dataspace-aware search engines. Our approach requires neither a mediated schema nor source mappings. In the absence of a mediated schema, the user formulates structured queries based on what she expects to find. We demonstrate the feasibility of this approach by providing a query interface for integrating hundreds of (real) structured Web information sources. We also discuss experimental results which indicate that our query rewriting algorithm is both effective and scalable.",
keywords = "Dataspaces, Information integration, Query rewriting, Search engines",
author = "Sergio Mergen and Juliana Freire and Heuser, {Carlos A.}",
year = "2010",
month = "7",
doi = "10.1504/IJMSO.2010.034045",
language = "English (US)",
volume = "5",
pages = "208--221",
journal = "International Journal of Metadata, Semantics and Ontologies",
issn = "1744-2621",
publisher = "Inderscience Enterprises Ltd",
number = "3",

}

TY - JOUR

T1 - Querying structured information sources on the Web

AU - Mergen, Sergio

AU - Freire, Juliana

AU - Heuser, Carlos A.

PY - 2010/7

Y1 - 2010/7

N2 - To provide access to heterogeneous data distributed over the Web, we propose a solution that merges the expressiveness of information integration systems with the flexibility found in dataspace-aware search engines. Our approach requires neither a mediated schema nor source mappings. In the absence of a mediated schema, the user formulates structured queries based on what she expects to find. We demonstrate the feasibility of this approach by providing a query interface for integrating hundreds of (real) structured Web information sources. We also discuss experimental results which indicate that our query rewriting algorithm is both effective and scalable.

AB - To provide access to heterogeneous data distributed over the Web, we propose a solution that merges the expressiveness of information integration systems with the flexibility found in dataspace-aware search engines. Our approach requires neither a mediated schema nor source mappings. In the absence of a mediated schema, the user formulates structured queries based on what she expects to find. We demonstrate the feasibility of this approach by providing a query interface for integrating hundreds of (real) structured Web information sources. We also discuss experimental results which indicate that our query rewriting algorithm is both effective and scalable.

KW - Dataspaces

KW - Information integration

KW - Query rewriting

KW - Search engines

UR - http://www.scopus.com/inward/record.url?scp=77954429218&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77954429218&partnerID=8YFLogxK

U2 - 10.1504/IJMSO.2010.034045

DO - 10.1504/IJMSO.2010.034045

M3 - Article

VL - 5

SP - 208

EP - 221

JO - International Journal of Metadata, Semantics and Ontologies

JF - International Journal of Metadata, Semantics and Ontologies

SN - 1744-2621

IS - 3

ER -