Provenance in dynamically adjusted and partitioned workflows

Daniel Goodman

    Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

    134 Downloads (Pure)

    Abstract

    In this paper we describe the provenance system built into the distributed Martlet middleware. Due to both the need for scientific reproducibility, and to determine exactly what has happened with any given piece of analysis, it is necessary for this middleware to record detailed and structured provenance data in an easily query-able form. This is achieved through the use of integer clocks and directed graphs. Using these, this system is capable of keeping a complete history of the creation of all data, including the ability to store in-depth information defined by the task about the operations performed. This allows the system to continue to gather provenance data regardless of the rough grained functions being wrapped by the middleware. The middleware was developed to support functions described in "Martlet", a workflow language developed to address the problem of how to analyse the data generated by the climateprerdiction.net experiment. This data is both highly distributed, and resides in a dynamic environment where the partitioning of data structures across the distributed nodes may change both in the number of pieces and their locations, and resources may come and go. This makes it necessary for the structure of the workflows to change from execution to execution. As such the provenance system is also required to be able to handle such a dynamic environment. © 2008 IEEE.
    Original languageEnglish
    Title of host publicationProceedings - 4th IEEE International Conference on eScience, eScience 2008|Proc. - IEEE Int. Conf. eScience, eScience
    Pages39-46
    Number of pages7
    Publication statusPublished - 2008
    Event4th IEEE International Conference on eScience, eScience 2008 - Indianapolis, IN
    Duration: 1 Jul 2008 → …

    Conference

    Conference4th IEEE International Conference on eScience, eScience 2008
    CityIndianapolis, IN
    Period1/07/08 → …

    Fingerprint

    Dive into the research topics of 'Provenance in dynamically adjusted and partitioned workflows'. Together they form a unique fingerprint.

    Cite this