Packing experiments for sharing and publication

Fernando Chirigati, Dennis Shasha, Juliana Freire

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Reproducibility is a core component of the scientific process. Revisiting and reusing past results allow science to move forward - "standing on the shoulders of giants", as Newton once said. An impediment to the adoption of computational reproducibility is that authors find it difficult to generate a compendium that en compasses all the required components to correctly reproduce their experiments. Even when a compendium is available, reviewers and readers may have difficulties in verifying the results on platforms different from the ones where the experiments were originally run. As a step towards simplifying the process of creating reproducible experiments, we have developed ReproZip, a tool that automatically captures the provenance of experiments and packs all the necessary files, library dependencies and variables to reproduce the results. Reviewers can then unpack and run the experiments without having to install any additional software. We will demonstrate real use cases for ReproZip, how packages are created, and how reviewers can validate and explore experiments.

Original languageEnglish (US)
Title of host publicationSIGMOD 2013 - International Conference on Management of Data
Pages977-980
Number of pages4
DOIs
StatePublished - 2013
Event2013 ACM SIGMOD Conference on Management of Data, SIGMOD 2013 - New York, NY, United States
Duration: Jun 22 2013Jun 27 2013

Other

Other2013 ACM SIGMOD Conference on Management of Data, SIGMOD 2013
CountryUnited States
CityNew York, NY
Period6/22/136/27/13

Fingerprint

Experiments

Keywords

  • Computational Reproducibility
  • Provenance
  • ReproZip

ASJC Scopus subject areas

  • Software
  • Information Systems

Cite this

Chirigati, F., Shasha, D., & Freire, J. (2013). Packing experiments for sharing and publication. In SIGMOD 2013 - International Conference on Management of Data (pp. 977-980) https://doi.org/10.1145/2463676.2465269

Packing experiments for sharing and publication. / Chirigati, Fernando; Shasha, Dennis; Freire, Juliana.

SIGMOD 2013 - International Conference on Management of Data. 2013. p. 977-980.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Chirigati, F, Shasha, D & Freire, J 2013, Packing experiments for sharing and publication. in SIGMOD 2013 - International Conference on Management of Data. pp. 977-980, 2013 ACM SIGMOD Conference on Management of Data, SIGMOD 2013, New York, NY, United States, 6/22/13. https://doi.org/10.1145/2463676.2465269
Chirigati F, Shasha D, Freire J. Packing experiments for sharing and publication. In SIGMOD 2013 - International Conference on Management of Data. 2013. p. 977-980 https://doi.org/10.1145/2463676.2465269
Chirigati, Fernando ; Shasha, Dennis ; Freire, Juliana. / Packing experiments for sharing and publication. SIGMOD 2013 - International Conference on Management of Data. 2013. pp. 977-980
@inproceedings{ad6034d63c0d4a768f275479da8f4bcc,
title = "Packing experiments for sharing and publication",
abstract = "Reproducibility is a core component of the scientific process. Revisiting and reusing past results allow science to move forward - {"}standing on the shoulders of giants{"}, as Newton once said. An impediment to the adoption of computational reproducibility is that authors find it difficult to generate a compendium that en compasses all the required components to correctly reproduce their experiments. Even when a compendium is available, reviewers and readers may have difficulties in verifying the results on platforms different from the ones where the experiments were originally run. As a step towards simplifying the process of creating reproducible experiments, we have developed ReproZip, a tool that automatically captures the provenance of experiments and packs all the necessary files, library dependencies and variables to reproduce the results. Reviewers can then unpack and run the experiments without having to install any additional software. We will demonstrate real use cases for ReproZip, how packages are created, and how reviewers can validate and explore experiments.",
keywords = "Computational Reproducibility, Provenance, ReproZip",
author = "Fernando Chirigati and Dennis Shasha and Juliana Freire",
year = "2013",
doi = "10.1145/2463676.2465269",
language = "English (US)",
isbn = "9781450320375",
pages = "977--980",
booktitle = "SIGMOD 2013 - International Conference on Management of Data",

}

TY - GEN

T1 - Packing experiments for sharing and publication

AU - Chirigati, Fernando

AU - Shasha, Dennis

AU - Freire, Juliana

PY - 2013

Y1 - 2013

N2 - Reproducibility is a core component of the scientific process. Revisiting and reusing past results allow science to move forward - "standing on the shoulders of giants", as Newton once said. An impediment to the adoption of computational reproducibility is that authors find it difficult to generate a compendium that en compasses all the required components to correctly reproduce their experiments. Even when a compendium is available, reviewers and readers may have difficulties in verifying the results on platforms different from the ones where the experiments were originally run. As a step towards simplifying the process of creating reproducible experiments, we have developed ReproZip, a tool that automatically captures the provenance of experiments and packs all the necessary files, library dependencies and variables to reproduce the results. Reviewers can then unpack and run the experiments without having to install any additional software. We will demonstrate real use cases for ReproZip, how packages are created, and how reviewers can validate and explore experiments.

AB - Reproducibility is a core component of the scientific process. Revisiting and reusing past results allow science to move forward - "standing on the shoulders of giants", as Newton once said. An impediment to the adoption of computational reproducibility is that authors find it difficult to generate a compendium that en compasses all the required components to correctly reproduce their experiments. Even when a compendium is available, reviewers and readers may have difficulties in verifying the results on platforms different from the ones where the experiments were originally run. As a step towards simplifying the process of creating reproducible experiments, we have developed ReproZip, a tool that automatically captures the provenance of experiments and packs all the necessary files, library dependencies and variables to reproduce the results. Reviewers can then unpack and run the experiments without having to install any additional software. We will demonstrate real use cases for ReproZip, how packages are created, and how reviewers can validate and explore experiments.

KW - Computational Reproducibility

KW - Provenance

KW - ReproZip

UR - http://www.scopus.com/inward/record.url?scp=84880534236&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84880534236&partnerID=8YFLogxK

U2 - 10.1145/2463676.2465269

DO - 10.1145/2463676.2465269

M3 - Conference contribution

AN - SCOPUS:84880534236

SN - 9781450320375

SP - 977

EP - 980

BT - SIGMOD 2013 - International Conference on Management of Data

ER -