The provenance of workflow upgrades

David Koop, Carlos E. Scheidegger, Juliana Freire, Cláudio T. Silva

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Provenance has become an increasingly important part of documenting, verifying, and reproducing scientific research, but as users seek to extend or share results, it may be impractical to start from the exact original steps due to system configuration differences, library updates, or new algorithms. Although there have been several approaches for capturing workflow provenance, the problem of managing upgrades of the underlying tools and libraries orchestrated by workflows has been largely overlooked. In this paper we consider the problem of maintaining and re-using the provenance of workflow upgrades. We propose different kinds of upgrades that can be applied, including automatic mechanisms, developer-specified, and user-defined. We show how to capture provenance from such upgrades and suggest how this provenance might be used to influence future upgrades. We also describe our implementation of these upgrade techniques.

Original languageEnglish (US)
Title of host publicationProvenance and Annotation of Data and Processes - Third International Provenance and Annotation Workshop, IPAW 2010, Revised Selected Papers
Pages2-16
Number of pages15
Volume6378 LNCS
DOIs
StatePublished - 2010
Event3rd International Provenance and Annotation Workshop, IPAW 2010 - Troy, NY, United States
Duration: Jun 15 2010Jun 16 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6378 LNCS
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other3rd International Provenance and Annotation Workshop, IPAW 2010
CountryUnited States
CityTroy, NY
Period6/15/106/16/10

Fingerprint

Provenance
Work Flow
Update
Configuration

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Koop, D., Scheidegger, C. E., Freire, J., & Silva, C. T. (2010). The provenance of workflow upgrades. In Provenance and Annotation of Data and Processes - Third International Provenance and Annotation Workshop, IPAW 2010, Revised Selected Papers (Vol. 6378 LNCS, pp. 2-16). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6378 LNCS). https://doi.org/10.1007/978-3-642-17819-1_2

The provenance of workflow upgrades. / Koop, David; Scheidegger, Carlos E.; Freire, Juliana; Silva, Cláudio T.

Provenance and Annotation of Data and Processes - Third International Provenance and Annotation Workshop, IPAW 2010, Revised Selected Papers. Vol. 6378 LNCS 2010. p. 2-16 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6378 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Koop, D, Scheidegger, CE, Freire, J & Silva, CT 2010, The provenance of workflow upgrades. in Provenance and Annotation of Data and Processes - Third International Provenance and Annotation Workshop, IPAW 2010, Revised Selected Papers. vol. 6378 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 6378 LNCS, pp. 2-16, 3rd International Provenance and Annotation Workshop, IPAW 2010, Troy, NY, United States, 6/15/10. https://doi.org/10.1007/978-3-642-17819-1_2
Koop D, Scheidegger CE, Freire J, Silva CT. The provenance of workflow upgrades. In Provenance and Annotation of Data and Processes - Third International Provenance and Annotation Workshop, IPAW 2010, Revised Selected Papers. Vol. 6378 LNCS. 2010. p. 2-16. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-642-17819-1_2
Koop, David ; Scheidegger, Carlos E. ; Freire, Juliana ; Silva, Cláudio T. / The provenance of workflow upgrades. Provenance and Annotation of Data and Processes - Third International Provenance and Annotation Workshop, IPAW 2010, Revised Selected Papers. Vol. 6378 LNCS 2010. pp. 2-16 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{a4a2838584474e6b9db37c3725f12c35,
title = "The provenance of workflow upgrades",
abstract = "Provenance has become an increasingly important part of documenting, verifying, and reproducing scientific research, but as users seek to extend or share results, it may be impractical to start from the exact original steps due to system configuration differences, library updates, or new algorithms. Although there have been several approaches for capturing workflow provenance, the problem of managing upgrades of the underlying tools and libraries orchestrated by workflows has been largely overlooked. In this paper we consider the problem of maintaining and re-using the provenance of workflow upgrades. We propose different kinds of upgrades that can be applied, including automatic mechanisms, developer-specified, and user-defined. We show how to capture provenance from such upgrades and suggest how this provenance might be used to influence future upgrades. We also describe our implementation of these upgrade techniques.",
author = "David Koop and Scheidegger, {Carlos E.} and Juliana Freire and Silva, {Cl{\'a}udio T.}",
year = "2010",
doi = "10.1007/978-3-642-17819-1_2",
language = "English (US)",
isbn = "3642178189",
volume = "6378 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "2--16",
booktitle = "Provenance and Annotation of Data and Processes - Third International Provenance and Annotation Workshop, IPAW 2010, Revised Selected Papers",

}

TY - GEN

T1 - The provenance of workflow upgrades

AU - Koop, David

AU - Scheidegger, Carlos E.

AU - Freire, Juliana

AU - Silva, Cláudio T.

PY - 2010

Y1 - 2010

N2 - Provenance has become an increasingly important part of documenting, verifying, and reproducing scientific research, but as users seek to extend or share results, it may be impractical to start from the exact original steps due to system configuration differences, library updates, or new algorithms. Although there have been several approaches for capturing workflow provenance, the problem of managing upgrades of the underlying tools and libraries orchestrated by workflows has been largely overlooked. In this paper we consider the problem of maintaining and re-using the provenance of workflow upgrades. We propose different kinds of upgrades that can be applied, including automatic mechanisms, developer-specified, and user-defined. We show how to capture provenance from such upgrades and suggest how this provenance might be used to influence future upgrades. We also describe our implementation of these upgrade techniques.

AB - Provenance has become an increasingly important part of documenting, verifying, and reproducing scientific research, but as users seek to extend or share results, it may be impractical to start from the exact original steps due to system configuration differences, library updates, or new algorithms. Although there have been several approaches for capturing workflow provenance, the problem of managing upgrades of the underlying tools and libraries orchestrated by workflows has been largely overlooked. In this paper we consider the problem of maintaining and re-using the provenance of workflow upgrades. We propose different kinds of upgrades that can be applied, including automatic mechanisms, developer-specified, and user-defined. We show how to capture provenance from such upgrades and suggest how this provenance might be used to influence future upgrades. We also describe our implementation of these upgrade techniques.

UR - http://www.scopus.com/inward/record.url?scp=78651060363&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78651060363&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-17819-1_2

DO - 10.1007/978-3-642-17819-1_2

M3 - Conference contribution

SN - 3642178189

SN - 9783642178184

VL - 6378 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 2

EP - 16

BT - Provenance and Annotation of Data and Processes - Third International Provenance and Annotation Workshop, IPAW 2010, Revised Selected Papers

ER -