An identity crisis in the life sciences

    Research output: Chapter in Book/Report/Conference proceedingConference contribution


    myGrid is an e-Science project assisting life scientists to build workflows that gather data from distributed, autonomous, replicated and heterogeneous resources. The provenance logs of workflow executions are recorded as RDF graphs. The log of one workflow run is used to trace the history of its execution process. However, by aggregating provenance logs of many workflow runs, one may gather the provenance of a common data product shared in multiple derivation paths, A successful aggregation relies on accurate and universal identification of each data product. The nature of bioinformatics data and services, however, makes this difficult. We describe the identity problem in bioinformatics data, and present a protocol for managing identity co-references and allocating identity to gathered and computed data products. The ability to overcome this problem means that the provenance of workflows in bioinformatics and other domains can be exploited to enhance the practice of e-Science. © Springer-Verlag Berlin Heidelberg 2006.
    Original languageEnglish
    Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)|Lect. Notes Comput. Sci.
    PublisherSpringer Nature
    Number of pages15
    ISBN (Print)354046302X, 9783540463023
    Publication statusPublished - 2006
    EventInternational Provenance and Annotation Workshop, IPAW 2006 - Chicago, IL
    Duration: 1 Jul 2006 → …

    Publication series

    NameLecture Notes in Computer Science


    ConferenceInternational Provenance and Annotation Workshop, IPAW 2006
    CityChicago, IL
    Period1/07/06 → …
    Internet address


    Dive into the research topics of 'An identity crisis in the life sciences'. Together they form a unique fingerprint.

    Cite this