No Workflow: Capturing and analyzing provenance of scripts

Leonardo Murta, Vanessa Braganholo, Fernando Chirigati, David Koop, Juliana Freire

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We propose noWorkflow, a tool that transparently captures provenance of scripts and enables reproducibility. Unlike existing approaches, noWorkflow is non-intrusive and does not require users to change the way they work – users need not wrap their experiments in scientific workflow systems, install version control systems, or instrument their scripts. The tool leverages Software Engineering techniques, such as abstract syntax tree analysis, reflection, and profiling, to collect different types of provenance, including detailed information about the underlying libraries. We describe how noWorkflow captures multiple kinds of provenance and the different classes of analyses it supports: graph-based visualization; differencing over provenance trails; and inference queries.

Original languageEnglish (US)
Title of host publicationProvenance and Annotation of Data and Processes - 5th International Provenance and Annotation Workshop, IPAW 2014, Revised Selected Papers
PublisherSpringer Verlag
Pages71-83
Number of pages13
Volume8628
ISBN (Print)9783319164618
DOIs
StatePublished - 2015
Event5th International Provenance and Annotation Workshop, IPAW 2014 - Cologne, Germany
Duration: Jun 10 2014Jun 11 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8628
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other5th International Provenance and Annotation Workshop, IPAW 2014
CountryGermany
CityCologne
Period6/10/146/11/14

Fingerprint

Provenance
Work Flow
Software engineering
Visualization
Control systems
Scientific Workflow
Reproducibility
Profiling
Software Engineering
Leverage
Experiments
Control System
Query
Graph in graph theory
Experiment

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Murta, L., Braganholo, V., Chirigati, F., Koop, D., & Freire, J. (2015). No Workflow: Capturing and analyzing provenance of scripts. In Provenance and Annotation of Data and Processes - 5th International Provenance and Annotation Workshop, IPAW 2014, Revised Selected Papers (Vol. 8628, pp. 71-83). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 8628). Springer Verlag. https://doi.org/10.1007/978-3-319-16462-5_6

No Workflow : Capturing and analyzing provenance of scripts. / Murta, Leonardo; Braganholo, Vanessa; Chirigati, Fernando; Koop, David; Freire, Juliana.

Provenance and Annotation of Data and Processes - 5th International Provenance and Annotation Workshop, IPAW 2014, Revised Selected Papers. Vol. 8628 Springer Verlag, 2015. p. 71-83 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 8628).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Murta, L, Braganholo, V, Chirigati, F, Koop, D & Freire, J 2015, No Workflow: Capturing and analyzing provenance of scripts. in Provenance and Annotation of Data and Processes - 5th International Provenance and Annotation Workshop, IPAW 2014, Revised Selected Papers. vol. 8628, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 8628, Springer Verlag, pp. 71-83, 5th International Provenance and Annotation Workshop, IPAW 2014, Cologne, Germany, 6/10/14. https://doi.org/10.1007/978-3-319-16462-5_6
Murta L, Braganholo V, Chirigati F, Koop D, Freire J. No Workflow: Capturing and analyzing provenance of scripts. In Provenance and Annotation of Data and Processes - 5th International Provenance and Annotation Workshop, IPAW 2014, Revised Selected Papers. Vol. 8628. Springer Verlag. 2015. p. 71-83. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-319-16462-5_6
Murta, Leonardo ; Braganholo, Vanessa ; Chirigati, Fernando ; Koop, David ; Freire, Juliana. / No Workflow : Capturing and analyzing provenance of scripts. Provenance and Annotation of Data and Processes - 5th International Provenance and Annotation Workshop, IPAW 2014, Revised Selected Papers. Vol. 8628 Springer Verlag, 2015. pp. 71-83 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{af071c94f639496684e520981e2c0267,
title = "No Workflow: Capturing and analyzing provenance of scripts",
abstract = "We propose noWorkflow, a tool that transparently captures provenance of scripts and enables reproducibility. Unlike existing approaches, noWorkflow is non-intrusive and does not require users to change the way they work – users need not wrap their experiments in scientific workflow systems, install version control systems, or instrument their scripts. The tool leverages Software Engineering techniques, such as abstract syntax tree analysis, reflection, and profiling, to collect different types of provenance, including detailed information about the underlying libraries. We describe how noWorkflow captures multiple kinds of provenance and the different classes of analyses it supports: graph-based visualization; differencing over provenance trails; and inference queries.",
author = "Leonardo Murta and Vanessa Braganholo and Fernando Chirigati and David Koop and Juliana Freire",
year = "2015",
doi = "10.1007/978-3-319-16462-5_6",
language = "English (US)",
isbn = "9783319164618",
volume = "8628",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "71--83",
booktitle = "Provenance and Annotation of Data and Processes - 5th International Provenance and Annotation Workshop, IPAW 2014, Revised Selected Papers",

}

TY - GEN

T1 - No Workflow

T2 - Capturing and analyzing provenance of scripts

AU - Murta, Leonardo

AU - Braganholo, Vanessa

AU - Chirigati, Fernando

AU - Koop, David

AU - Freire, Juliana

PY - 2015

Y1 - 2015

N2 - We propose noWorkflow, a tool that transparently captures provenance of scripts and enables reproducibility. Unlike existing approaches, noWorkflow is non-intrusive and does not require users to change the way they work – users need not wrap their experiments in scientific workflow systems, install version control systems, or instrument their scripts. The tool leverages Software Engineering techniques, such as abstract syntax tree analysis, reflection, and profiling, to collect different types of provenance, including detailed information about the underlying libraries. We describe how noWorkflow captures multiple kinds of provenance and the different classes of analyses it supports: graph-based visualization; differencing over provenance trails; and inference queries.

AB - We propose noWorkflow, a tool that transparently captures provenance of scripts and enables reproducibility. Unlike existing approaches, noWorkflow is non-intrusive and does not require users to change the way they work – users need not wrap their experiments in scientific workflow systems, install version control systems, or instrument their scripts. The tool leverages Software Engineering techniques, such as abstract syntax tree analysis, reflection, and profiling, to collect different types of provenance, including detailed information about the underlying libraries. We describe how noWorkflow captures multiple kinds of provenance and the different classes of analyses it supports: graph-based visualization; differencing over provenance trails; and inference queries.

UR - http://www.scopus.com/inward/record.url?scp=84928821658&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84928821658&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-16462-5_6

DO - 10.1007/978-3-319-16462-5_6

M3 - Conference contribution

SN - 9783319164618

VL - 8628

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 71

EP - 83

BT - Provenance and Annotation of Data and Processes - 5th International Provenance and Annotation Workshop, IPAW 2014, Revised Selected Papers

PB - Springer Verlag

ER -