Structured Open Urban Data: Understanding the Landscape

Luciano Barbosa, Kien Pham, Claudio Silva, Marcos R. Vieira, Juliana Freire

Research output: Contribution to journalReview article

Abstract

A growing number of cities are now making urban data freely available to the public. Besides promoting transparency, these data can have a transformative effect in social science research as well as in how citizens participate in governance. These initiatives, however, are fairly recent and the landscape of open urban data is not well known. In this study, we try to shed some light on this through a detailed study of over 9,000 open data sets from 20 cities in North America. We start by presenting general statistics about the content, size, nature, and popularity of the different data sets, and then examine in more detail structured data sets that contain tabular data. Since a key benefit of having a large number of data sets available is the ability to fuse information, we investigate opportunities for data integration. We also study data quality issues and time-related aspects, namely, recency and change frequency. Our findings are encouraging in that most of the data are structured and published in standard formats that are easy to parse; there is ample opportunity to integrate different data sets; and the volume of data is increasing steadily. But they also uncovered a number of challenges that need to be addressed to enable these data to be fully leveraged. We discuss both our findings and issues involved in using open urban data.

Original languageEnglish (US)
Pages (from-to)144-154
Number of pages11
JournalBig Data
Volume2
Issue number3
DOIs
StatePublished - Sep 1 2014

Fingerprint

Social sciences
Data integration
Electric fuses
Transparency
Statistics

ASJC Scopus subject areas

  • Computer Science Applications
  • Information Systems
  • Information Systems and Management

Cite this

Structured Open Urban Data : Understanding the Landscape. / Barbosa, Luciano; Pham, Kien; Silva, Claudio; Vieira, Marcos R.; Freire, Juliana.

In: Big Data, Vol. 2, No. 3, 01.09.2014, p. 144-154.

Research output: Contribution to journalReview article

Barbosa, Luciano ; Pham, Kien ; Silva, Claudio ; Vieira, Marcos R. ; Freire, Juliana. / Structured Open Urban Data : Understanding the Landscape. In: Big Data. 2014 ; Vol. 2, No. 3. pp. 144-154.
@article{55a470f182ae490e851fa7ae818ebe58,
title = "Structured Open Urban Data: Understanding the Landscape",
abstract = "A growing number of cities are now making urban data freely available to the public. Besides promoting transparency, these data can have a transformative effect in social science research as well as in how citizens participate in governance. These initiatives, however, are fairly recent and the landscape of open urban data is not well known. In this study, we try to shed some light on this through a detailed study of over 9,000 open data sets from 20 cities in North America. We start by presenting general statistics about the content, size, nature, and popularity of the different data sets, and then examine in more detail structured data sets that contain tabular data. Since a key benefit of having a large number of data sets available is the ability to fuse information, we investigate opportunities for data integration. We also study data quality issues and time-related aspects, namely, recency and change frequency. Our findings are encouraging in that most of the data are structured and published in standard formats that are easy to parse; there is ample opportunity to integrate different data sets; and the volume of data is increasing steadily. But they also uncovered a number of challenges that need to be addressed to enable these data to be fully leveraged. We discuss both our findings and issues involved in using open urban data.",
author = "Luciano Barbosa and Kien Pham and Claudio Silva and Vieira, {Marcos R.} and Juliana Freire",
year = "2014",
month = "9",
day = "1",
doi = "10.1089/big.2014.0020",
language = "English (US)",
volume = "2",
pages = "144--154",
journal = "Big Data",
issn = "2167-6461",
publisher = "Mary Ann Liebert Inc.",
number = "3",

}

TY - JOUR

T1 - Structured Open Urban Data

T2 - Understanding the Landscape

AU - Barbosa, Luciano

AU - Pham, Kien

AU - Silva, Claudio

AU - Vieira, Marcos R.

AU - Freire, Juliana

PY - 2014/9/1

Y1 - 2014/9/1

N2 - A growing number of cities are now making urban data freely available to the public. Besides promoting transparency, these data can have a transformative effect in social science research as well as in how citizens participate in governance. These initiatives, however, are fairly recent and the landscape of open urban data is not well known. In this study, we try to shed some light on this through a detailed study of over 9,000 open data sets from 20 cities in North America. We start by presenting general statistics about the content, size, nature, and popularity of the different data sets, and then examine in more detail structured data sets that contain tabular data. Since a key benefit of having a large number of data sets available is the ability to fuse information, we investigate opportunities for data integration. We also study data quality issues and time-related aspects, namely, recency and change frequency. Our findings are encouraging in that most of the data are structured and published in standard formats that are easy to parse; there is ample opportunity to integrate different data sets; and the volume of data is increasing steadily. But they also uncovered a number of challenges that need to be addressed to enable these data to be fully leveraged. We discuss both our findings and issues involved in using open urban data.

AB - A growing number of cities are now making urban data freely available to the public. Besides promoting transparency, these data can have a transformative effect in social science research as well as in how citizens participate in governance. These initiatives, however, are fairly recent and the landscape of open urban data is not well known. In this study, we try to shed some light on this through a detailed study of over 9,000 open data sets from 20 cities in North America. We start by presenting general statistics about the content, size, nature, and popularity of the different data sets, and then examine in more detail structured data sets that contain tabular data. Since a key benefit of having a large number of data sets available is the ability to fuse information, we investigate opportunities for data integration. We also study data quality issues and time-related aspects, namely, recency and change frequency. Our findings are encouraging in that most of the data are structured and published in standard formats that are easy to parse; there is ample opportunity to integrate different data sets; and the volume of data is increasing steadily. But they also uncovered a number of challenges that need to be addressed to enable these data to be fully leveraged. We discuss both our findings and issues involved in using open urban data.

UR - http://www.scopus.com/inward/record.url?scp=84991810061&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84991810061&partnerID=8YFLogxK

U2 - 10.1089/big.2014.0020

DO - 10.1089/big.2014.0020

M3 - Review article

AN - SCOPUS:84991810061

VL - 2

SP - 144

EP - 154

JO - Big Data

JF - Big Data

SN - 2167-6461

IS - 3

ER -