Data Polygamy: The many-many relationships among urban spatio-temporal data sets

Fernando Chirigati, Harish Doraiswamy, Theodoros Damoulas, Juliana Freire

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The increasing ability to collect data from urban environments, coupled with a push towards openness by governments, has resulted in the availability of numerous spatio-temporal data sets covering diverse aspects of a city. Discovering relationships between these data sets can produce new insights by enabling domain experts to not only test but also generate hypotheses. However, discovering these relationships is difficult. First, a relationship between two data sets may occur only at certain locations and/or time periods. Second, the sheer number and size of the data sets, coupled with the diverse spatial and temporal scales at which the data is available, presents computational challenges on all fronts, from indexing and querying to analyzing them. Finally, it is nontrivial to differentiate between meaningful and spurious relationships. To address these challenges, we propose Data Polygamy, a scalable topology-based framework that allows users to query for statistically significant relationships between spatio-temporal data sets. We have performed an experimental evaluation using over 300 spatial-temporal urban data sets which shows that our approach is scalable and effective at identifying interesting relationships.

Original languageEnglish (US)
Title of host publicationSIGMOD 2016 - Proceedings of the 2016 International Conference on Management of Data
PublisherAssociation for Computing Machinery
Pages1011-1025
Number of pages15
Volume26-June-2016
ISBN (Electronic)9781450335317
DOIs
StatePublished - Jun 26 2016
Event2016 ACM SIGMOD International Conference on Management of Data, SIGMOD 2016 - San Francisco, United States
Duration: Jun 26 2016Jul 1 2016

Other

Other2016 ACM SIGMOD International Conference on Management of Data, SIGMOD 2016
CountryUnited States
CitySan Francisco
Period6/26/167/1/16

Fingerprint

Topology
Availability

ASJC Scopus subject areas

  • Software
  • Information Systems

Cite this

Chirigati, F., Doraiswamy, H., Damoulas, T., & Freire, J. (2016). Data Polygamy: The many-many relationships among urban spatio-temporal data sets. In SIGMOD 2016 - Proceedings of the 2016 International Conference on Management of Data (Vol. 26-June-2016, pp. 1011-1025). Association for Computing Machinery. https://doi.org/10.1145/2882903.2915245

Data Polygamy : The many-many relationships among urban spatio-temporal data sets. / Chirigati, Fernando; Doraiswamy, Harish; Damoulas, Theodoros; Freire, Juliana.

SIGMOD 2016 - Proceedings of the 2016 International Conference on Management of Data. Vol. 26-June-2016 Association for Computing Machinery, 2016. p. 1011-1025.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Chirigati, F, Doraiswamy, H, Damoulas, T & Freire, J 2016, Data Polygamy: The many-many relationships among urban spatio-temporal data sets. in SIGMOD 2016 - Proceedings of the 2016 International Conference on Management of Data. vol. 26-June-2016, Association for Computing Machinery, pp. 1011-1025, 2016 ACM SIGMOD International Conference on Management of Data, SIGMOD 2016, San Francisco, United States, 6/26/16. https://doi.org/10.1145/2882903.2915245
Chirigati F, Doraiswamy H, Damoulas T, Freire J. Data Polygamy: The many-many relationships among urban spatio-temporal data sets. In SIGMOD 2016 - Proceedings of the 2016 International Conference on Management of Data. Vol. 26-June-2016. Association for Computing Machinery. 2016. p. 1011-1025 https://doi.org/10.1145/2882903.2915245
Chirigati, Fernando ; Doraiswamy, Harish ; Damoulas, Theodoros ; Freire, Juliana. / Data Polygamy : The many-many relationships among urban spatio-temporal data sets. SIGMOD 2016 - Proceedings of the 2016 International Conference on Management of Data. Vol. 26-June-2016 Association for Computing Machinery, 2016. pp. 1011-1025
@inproceedings{c9aa8956c9fb414aa3e5416f7dd9faec,
title = "Data Polygamy: The many-many relationships among urban spatio-temporal data sets",
abstract = "The increasing ability to collect data from urban environments, coupled with a push towards openness by governments, has resulted in the availability of numerous spatio-temporal data sets covering diverse aspects of a city. Discovering relationships between these data sets can produce new insights by enabling domain experts to not only test but also generate hypotheses. However, discovering these relationships is difficult. First, a relationship between two data sets may occur only at certain locations and/or time periods. Second, the sheer number and size of the data sets, coupled with the diverse spatial and temporal scales at which the data is available, presents computational challenges on all fronts, from indexing and querying to analyzing them. Finally, it is nontrivial to differentiate between meaningful and spurious relationships. To address these challenges, we propose Data Polygamy, a scalable topology-based framework that allows users to query for statistically significant relationships between spatio-temporal data sets. We have performed an experimental evaluation using over 300 spatial-temporal urban data sets which shows that our approach is scalable and effective at identifying interesting relationships.",
author = "Fernando Chirigati and Harish Doraiswamy and Theodoros Damoulas and Juliana Freire",
year = "2016",
month = "6",
day = "26",
doi = "10.1145/2882903.2915245",
language = "English (US)",
volume = "26-June-2016",
pages = "1011--1025",
booktitle = "SIGMOD 2016 - Proceedings of the 2016 International Conference on Management of Data",
publisher = "Association for Computing Machinery",

}

TY - GEN

T1 - Data Polygamy

T2 - The many-many relationships among urban spatio-temporal data sets

AU - Chirigati, Fernando

AU - Doraiswamy, Harish

AU - Damoulas, Theodoros

AU - Freire, Juliana

PY - 2016/6/26

Y1 - 2016/6/26

N2 - The increasing ability to collect data from urban environments, coupled with a push towards openness by governments, has resulted in the availability of numerous spatio-temporal data sets covering diverse aspects of a city. Discovering relationships between these data sets can produce new insights by enabling domain experts to not only test but also generate hypotheses. However, discovering these relationships is difficult. First, a relationship between two data sets may occur only at certain locations and/or time periods. Second, the sheer number and size of the data sets, coupled with the diverse spatial and temporal scales at which the data is available, presents computational challenges on all fronts, from indexing and querying to analyzing them. Finally, it is nontrivial to differentiate between meaningful and spurious relationships. To address these challenges, we propose Data Polygamy, a scalable topology-based framework that allows users to query for statistically significant relationships between spatio-temporal data sets. We have performed an experimental evaluation using over 300 spatial-temporal urban data sets which shows that our approach is scalable and effective at identifying interesting relationships.

AB - The increasing ability to collect data from urban environments, coupled with a push towards openness by governments, has resulted in the availability of numerous spatio-temporal data sets covering diverse aspects of a city. Discovering relationships between these data sets can produce new insights by enabling domain experts to not only test but also generate hypotheses. However, discovering these relationships is difficult. First, a relationship between two data sets may occur only at certain locations and/or time periods. Second, the sheer number and size of the data sets, coupled with the diverse spatial and temporal scales at which the data is available, presents computational challenges on all fronts, from indexing and querying to analyzing them. Finally, it is nontrivial to differentiate between meaningful and spurious relationships. To address these challenges, we propose Data Polygamy, a scalable topology-based framework that allows users to query for statistically significant relationships between spatio-temporal data sets. We have performed an experimental evaluation using over 300 spatial-temporal urban data sets which shows that our approach is scalable and effective at identifying interesting relationships.

UR - http://www.scopus.com/inward/record.url?scp=84979656076&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84979656076&partnerID=8YFLogxK

U2 - 10.1145/2882903.2915245

DO - 10.1145/2882903.2915245

M3 - Conference contribution

AN - SCOPUS:84979656076

VL - 26-June-2016

SP - 1011

EP - 1025

BT - SIGMOD 2016 - Proceedings of the 2016 International Conference on Management of Data

PB - Association for Computing Machinery

ER -