A first study on clustering collections of workflow graphs

Emanuele Santos, Lauro Lins, James P. Ahrens, Juliana Freire, Cláudio T. Silva

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

As workflow systems get more widely used, the number of workflows and the volume of provenance they generate has grown considerably. New tools and infrastructure are needed to allow users to interact with, reason about, and re-use this information. In this paper, we explore the use of clustering techniques to organize large collections of workflow and provenance graphs. We propose two different representations for these graphs and present an experimental evaluation, using a collection of 1,700 workflow graphs, where we study the trade-offs of these representations and the effectiveness of alternative clustering techniques.

Original languageEnglish (US)
Title of host publicationProvenance and Annotation of Data and Processes - 2nd International Provenance and Annotation Workshop, IPAW 2008, Revised Selected Papers
EditorsLuc Moreau, Juliana Freire, Juliana Freire, David Koop, Juliana Freire
PublisherSpringer Verlag
Pages160-173
Number of pages14
ISBN (Print)9783540899648
StatePublished - Jan 1 2008
Event2nd International Provenance and Annotation Workshop, IPAW 2008 - Salt Lake City, United States
Duration: Jun 17 2008Jun 18 2008

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5272
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other2nd International Provenance and Annotation Workshop, IPAW 2008
CountryUnited States
CitySalt Lake City
Period6/17/086/18/08

    Fingerprint

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Santos, E., Lins, L., Ahrens, J. P., Freire, J., & Silva, C. T. (2008). A first study on clustering collections of workflow graphs. In L. Moreau, J. Freire, J. Freire, D. Koop, & J. Freire (Eds.), Provenance and Annotation of Data and Processes - 2nd International Provenance and Annotation Workshop, IPAW 2008, Revised Selected Papers (pp. 160-173). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5272). Springer Verlag.