Fine-grained provenance collection over scripts through program slicing

João Felipe Pimentel, Juliana Freire, Leonardo Murta, Vanessa Braganholo

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Collecting provenance from scripts is often useful for scientists to explain and reproduce their scientific experiments. However, most existing automatic approaches capture provenance at coarse-grain, for example, the trace of user-defined functions. These approaches lack information of variable dependencies. Without this information, users may struggle to identify which functions really influenced the results, leading to the creation of false-positive provenance links. To address this problem, we propose an approach that uses dynamic program slicing for gathering provenance of Python scripts. By capturing dependencies among variables, it is possible to expose execution paths inside functions and, consequently, to create a provenance graph that accurately represents the function activations and the results they affect.

Original languageEnglish (US)
Title of host publicationProvenance and Annotation of Data and Processes - 6th International Provenance and Annotation Workshop, IPAW 2016, Proceedings
PublisherSpringer Verlag
Pages199-203
Number of pages5
Volume9672
ISBN (Print)9783319405926
DOIs
StatePublished - 2016
Event6th International Provenance and Annotation Workshop, IPAW 2016 - McLean, United States
Duration: Jun 7 2016Jun 8 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9672
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other6th International Provenance and Annotation Workshop, IPAW 2016
CountryUnited States
CityMcLean
Period6/7/166/8/16

Fingerprint

Program Slicing
Provenance
Python
Activation Function
False Positive
Chemical activation
Trace
Path
Graph in graph theory
Experiments
Experiment

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Pimentel, J. F., Freire, J., Murta, L., & Braganholo, V. (2016). Fine-grained provenance collection over scripts through program slicing. In Provenance and Annotation of Data and Processes - 6th International Provenance and Annotation Workshop, IPAW 2016, Proceedings (Vol. 9672, pp. 199-203). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9672). Springer Verlag. https://doi.org/10.1007/978-3-319-40593-3_21

Fine-grained provenance collection over scripts through program slicing. / Pimentel, João Felipe; Freire, Juliana; Murta, Leonardo; Braganholo, Vanessa.

Provenance and Annotation of Data and Processes - 6th International Provenance and Annotation Workshop, IPAW 2016, Proceedings. Vol. 9672 Springer Verlag, 2016. p. 199-203 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9672).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Pimentel, JF, Freire, J, Murta, L & Braganholo, V 2016, Fine-grained provenance collection over scripts through program slicing. in Provenance and Annotation of Data and Processes - 6th International Provenance and Annotation Workshop, IPAW 2016, Proceedings. vol. 9672, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 9672, Springer Verlag, pp. 199-203, 6th International Provenance and Annotation Workshop, IPAW 2016, McLean, United States, 6/7/16. https://doi.org/10.1007/978-3-319-40593-3_21
Pimentel JF, Freire J, Murta L, Braganholo V. Fine-grained provenance collection over scripts through program slicing. In Provenance and Annotation of Data and Processes - 6th International Provenance and Annotation Workshop, IPAW 2016, Proceedings. Vol. 9672. Springer Verlag. 2016. p. 199-203. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-319-40593-3_21
Pimentel, João Felipe ; Freire, Juliana ; Murta, Leonardo ; Braganholo, Vanessa. / Fine-grained provenance collection over scripts through program slicing. Provenance and Annotation of Data and Processes - 6th International Provenance and Annotation Workshop, IPAW 2016, Proceedings. Vol. 9672 Springer Verlag, 2016. pp. 199-203 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{5f99bc0253ee42e6a56b04c606db9db7,
title = "Fine-grained provenance collection over scripts through program slicing",
abstract = "Collecting provenance from scripts is often useful for scientists to explain and reproduce their scientific experiments. However, most existing automatic approaches capture provenance at coarse-grain, for example, the trace of user-defined functions. These approaches lack information of variable dependencies. Without this information, users may struggle to identify which functions really influenced the results, leading to the creation of false-positive provenance links. To address this problem, we propose an approach that uses dynamic program slicing for gathering provenance of Python scripts. By capturing dependencies among variables, it is possible to expose execution paths inside functions and, consequently, to create a provenance graph that accurately represents the function activations and the results they affect.",
author = "Pimentel, {Jo{\~a}o Felipe} and Juliana Freire and Leonardo Murta and Vanessa Braganholo",
year = "2016",
doi = "10.1007/978-3-319-40593-3_21",
language = "English (US)",
isbn = "9783319405926",
volume = "9672",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "199--203",
booktitle = "Provenance and Annotation of Data and Processes - 6th International Provenance and Annotation Workshop, IPAW 2016, Proceedings",
address = "Germany",

}

TY - GEN

T1 - Fine-grained provenance collection over scripts through program slicing

AU - Pimentel, João Felipe

AU - Freire, Juliana

AU - Murta, Leonardo

AU - Braganholo, Vanessa

PY - 2016

Y1 - 2016

N2 - Collecting provenance from scripts is often useful for scientists to explain and reproduce their scientific experiments. However, most existing automatic approaches capture provenance at coarse-grain, for example, the trace of user-defined functions. These approaches lack information of variable dependencies. Without this information, users may struggle to identify which functions really influenced the results, leading to the creation of false-positive provenance links. To address this problem, we propose an approach that uses dynamic program slicing for gathering provenance of Python scripts. By capturing dependencies among variables, it is possible to expose execution paths inside functions and, consequently, to create a provenance graph that accurately represents the function activations and the results they affect.

AB - Collecting provenance from scripts is often useful for scientists to explain and reproduce their scientific experiments. However, most existing automatic approaches capture provenance at coarse-grain, for example, the trace of user-defined functions. These approaches lack information of variable dependencies. Without this information, users may struggle to identify which functions really influenced the results, leading to the creation of false-positive provenance links. To address this problem, we propose an approach that uses dynamic program slicing for gathering provenance of Python scripts. By capturing dependencies among variables, it is possible to expose execution paths inside functions and, consequently, to create a provenance graph that accurately represents the function activations and the results they affect.

UR - http://www.scopus.com/inward/record.url?scp=84976619648&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84976619648&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-40593-3_21

DO - 10.1007/978-3-319-40593-3_21

M3 - Conference contribution

AN - SCOPUS:84976619648

SN - 9783319405926

VL - 9672

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 199

EP - 203

BT - Provenance and Annotation of Data and Processes - 6th International Provenance and Annotation Workshop, IPAW 2016, Proceedings

PB - Springer Verlag

ER -