Is this real? Generating synthetic data that looks real

Miro Mannino, Azza Abouzied

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Synner is a tool that helps users generate real-looking synthetic data by visually and declaratively specifying the properties of the dataset such as each field's statistical distribution, its domain, and its relationship to other fields. It provides instant feedback on every user interaction by updating multiple visualizations of the generated dataset and even suggests data generation specifications from a few user examples and interactions. Synner visually communicates the inherent randomness of statistical data generation. Our evaluation of Synner demonstrates its effectiveness at generating realistic data when compared with Mockaroo, a popular data generation tool, and with hired developers who coded data generation scripts for a fee.

Original languageEnglish (US)
Title of host publicationUIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology
PublisherAssociation for Computing Machinery, Inc
Pages549-561
Number of pages13
ISBN (Electronic)9781450368162
DOIs
StatePublished - Oct 17 2019
Event32nd Annual ACM Symposium on User Interface Software and Technology, UIST 2019 - New Orleans, United States
Duration: Oct 20 2019Oct 23 2019

Publication series

NameUIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology

Conference

Conference32nd Annual ACM Symposium on User Interface Software and Technology, UIST 2019
CountryUnited States
CityNew Orleans
Period10/20/1910/23/19

Fingerprint

Visualization
Specifications
Feedback

Keywords

  • Data generation
  • Mixed-initiative UI
  • Uncertainty visualization

ASJC Scopus subject areas

  • Human-Computer Interaction
  • Software

Cite this

Mannino, M., & Abouzied, A. (2019). Is this real? Generating synthetic data that looks real. In UIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology (pp. 549-561). (UIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology). Association for Computing Machinery, Inc. https://doi.org/10.1145/3332165.3347866

Is this real? Generating synthetic data that looks real. / Mannino, Miro; Abouzied, Azza.

UIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology. Association for Computing Machinery, Inc, 2019. p. 549-561 (UIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Mannino, M & Abouzied, A 2019, Is this real? Generating synthetic data that looks real. in UIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology. UIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology, Association for Computing Machinery, Inc, pp. 549-561, 32nd Annual ACM Symposium on User Interface Software and Technology, UIST 2019, New Orleans, United States, 10/20/19. https://doi.org/10.1145/3332165.3347866
Mannino M, Abouzied A. Is this real? Generating synthetic data that looks real. In UIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology. Association for Computing Machinery, Inc. 2019. p. 549-561. (UIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology). https://doi.org/10.1145/3332165.3347866
Mannino, Miro ; Abouzied, Azza. / Is this real? Generating synthetic data that looks real. UIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology. Association for Computing Machinery, Inc, 2019. pp. 549-561 (UIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology).
@inproceedings{2c73c927752545aea0973ecb9247eaa7,
title = "Is this real? Generating synthetic data that looks real",
abstract = "Synner is a tool that helps users generate real-looking synthetic data by visually and declaratively specifying the properties of the dataset such as each field's statistical distribution, its domain, and its relationship to other fields. It provides instant feedback on every user interaction by updating multiple visualizations of the generated dataset and even suggests data generation specifications from a few user examples and interactions. Synner visually communicates the inherent randomness of statistical data generation. Our evaluation of Synner demonstrates its effectiveness at generating realistic data when compared with Mockaroo, a popular data generation tool, and with hired developers who coded data generation scripts for a fee.",
keywords = "Data generation, Mixed-initiative UI, Uncertainty visualization",
author = "Miro Mannino and Azza Abouzied",
year = "2019",
month = "10",
day = "17",
doi = "10.1145/3332165.3347866",
language = "English (US)",
series = "UIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology",
publisher = "Association for Computing Machinery, Inc",
pages = "549--561",
booktitle = "UIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology",

}

TY - GEN

T1 - Is this real? Generating synthetic data that looks real

AU - Mannino, Miro

AU - Abouzied, Azza

PY - 2019/10/17

Y1 - 2019/10/17

N2 - Synner is a tool that helps users generate real-looking synthetic data by visually and declaratively specifying the properties of the dataset such as each field's statistical distribution, its domain, and its relationship to other fields. It provides instant feedback on every user interaction by updating multiple visualizations of the generated dataset and even suggests data generation specifications from a few user examples and interactions. Synner visually communicates the inherent randomness of statistical data generation. Our evaluation of Synner demonstrates its effectiveness at generating realistic data when compared with Mockaroo, a popular data generation tool, and with hired developers who coded data generation scripts for a fee.

AB - Synner is a tool that helps users generate real-looking synthetic data by visually and declaratively specifying the properties of the dataset such as each field's statistical distribution, its domain, and its relationship to other fields. It provides instant feedback on every user interaction by updating multiple visualizations of the generated dataset and even suggests data generation specifications from a few user examples and interactions. Synner visually communicates the inherent randomness of statistical data generation. Our evaluation of Synner demonstrates its effectiveness at generating realistic data when compared with Mockaroo, a popular data generation tool, and with hired developers who coded data generation scripts for a fee.

KW - Data generation

KW - Mixed-initiative UI

KW - Uncertainty visualization

UR - http://www.scopus.com/inward/record.url?scp=85074861070&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85074861070&partnerID=8YFLogxK

U2 - 10.1145/3332165.3347866

DO - 10.1145/3332165.3347866

M3 - Conference contribution

AN - SCOPUS:85074861070

T3 - UIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology

SP - 549

EP - 561

BT - UIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology

PB - Association for Computing Machinery, Inc

ER -