An urban data profiler

Daniel Castellani Ribeiro, Huy T. Vo, Juliana Freire, Cláudio T. Silva

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Large volumes of urban data are being made available through a variety of open portals. Besides promoting transparency, these data can bring benefits to government, science, citizens and industry. It is no longer a fantasy to ask "if you could know anything about a city, what do you want to know" and to ponder what could be done with that information. However, the great number and variety of datasets creates a new challenge: how to find relevant datasets. While existing portals provide search interfaces, these are often limited to keyword searches over the limited metadata associated each dataset, for example, attribute names and textual description. In this paper, we present a new tool, UrbanProfiler, that automatically extracts detailed information from datasets. This information includes attribute types, value distributions, and geographical information, which can be used to support complex search queries as well as visualizations that help users explore and obtain insight into the contents of a data collection. Besides describing the tool and its implementation, we present case studies that illustrate how the tool was used to explore a large open urban data repository.

Original languageEnglish (US)
Title of host publicationWWW 2015 Companion - Proceedings of the 24th International Conference on World Wide Web
PublisherAssociation for Computing Machinery, Inc
Pages1389-1394
Number of pages6
ISBN (Electronic)9781450334730
DOIs
StatePublished - May 18 2015
Event24th International Conference on World Wide Web, WWW 2015 - Florence, Italy
Duration: May 18 2015May 22 2015

Publication series

NameWWW 2015 Companion - Proceedings of the 24th International Conference on World Wide Web

Other

Other24th International Conference on World Wide Web, WWW 2015
CountryItaly
CityFlorence
Period5/18/155/22/15

    Fingerprint

Keywords

  • Automatic Type Detection
  • Dataset Analysis
  • Metadata Extractionl

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Software

Cite this

Ribeiro, D. C., Vo, H. T., Freire, J., & Silva, C. T. (2015). An urban data profiler. In WWW 2015 Companion - Proceedings of the 24th International Conference on World Wide Web (pp. 1389-1394). (WWW 2015 Companion - Proceedings of the 24th International Conference on World Wide Web). Association for Computing Machinery, Inc. https://doi.org/10.1145/2740908.2742135