Small Is Beautiful: Summarizing Scientific Workflows Using Semantic Annotations

Pinar Alper, Khalid Belhajjame, Carole Goble, Pinar Karagoz

Research output: Contribution to conferencePaperpeer-review

39 Downloads (Pure)

Abstract

Scientific workflows have become the workhorse of Big Data analytics for scientists. As well as being repeatable and optimizable pipelines that bring together datasets and analysis tools, workflows make-up an important part of the provenance of data generated from their execution. By faithfully capturing all stages in the analysis, workflows play a critical part in building up the audit-trail (a.k.a. provenance) meta-data for derived datasets and contributes to the veracity of results. Provenance is essential for reporting results, reporting the method followed, and adapting to changes in the datasets or tools. These functions, however, are hampered by the complexity of workflows and consequently the complexity of data-trails generated from their instrumented execution. In this paper we propose the generation of workflow description summaries in order to tackle workflow complexity. We elaborate reduction primitives for summarizing workflows, and show how primitives, as building blocks, can be used in conjunction with semantic workflow annotations to encode different summarization strategies. We report on the effectiveness of the method through experimental evaluation using real-world workflows from the Tavern a system.
Original languageEnglish
Pages318-325
Number of pages8
DOIs
Publication statusPublished - 27 Jun 2013
Event2013 IEEE International Congress on Big Data (BigData Congress) - Santa Clara, CA, USA
Duration: 27 Jun 20132 Jul 2013

Conference

Conference2013 IEEE International Congress on Big Data (BigData Congress)
Period27/06/132/07/13

Keywords

  • Scientific Workflow
  • Annotation
  • Rule-Based Summarization
  • Motif
  • provenance

Fingerprint

Dive into the research topics of 'Small Is Beautiful: Summarizing Scientific Workflows Using Semantic Annotations'. Together they form a unique fingerprint.
  • Common motifs in scientific workflows: An empirical analysis

    Garijo, D., Alper, P., Belhajjame, K., Corcho, O., Gil, Y. & Goble, C., 2012, 2012 IEEE 8th International Conference on E-Science, e-Science 2012|IEEE Int. Conf. E-Sci., e-Science. IEEE Computer Society

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

  • Taverna workflows for systems biology

    Li, P., Brass, A., Pinney, J., Oinn, T., Kell, D. & Goble, C., Oct 2006, International Conference on Systems Biology.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Cite this