• xmlui.mirage2.page-structure.header.title
    • français
    • English
  • Help
  • Login
  • Language 
    • Français
    • English
View Item 
  •   BIRD Home
  • LAMSADE (UMR CNRS 7243)
  • LAMSADE : Publications
  • View Item
  •   BIRD Home
  • LAMSADE (UMR CNRS 7243)
  • LAMSADE : Publications
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Browse

BIRDResearch centres & CollectionsBy Issue DateAuthorsTitlesTypeThis CollectionBy Issue DateAuthorsTitlesType

My Account

LoginRegister

Statistics

Most Popular ItemsStatistics by CountryMost Popular Authors
Thumbnail - No thumbnail

LabelFlow Framework for Annotating Workflow Provenance

Alper, Pinar; Belhajjame, Khalid; Curcin, Vasa; Goble, Carole (2018), LabelFlow Framework for Annotating Workflow Provenance, Informatics, 5, 1. 10.3390/informatics5010011

View/Open
informatics-05-00011-v2.pdf (2.561Mb)
Type
Article accepté pour publication ou publié
Date
2018
Journal name
Informatics
Volume
5
Number
1
Publisher
MDPI
Publication identifier
10.3390/informatics5010011
Metadata
Show full item record
Author(s)
Alper, Pinar cc
Luxembourg Centre For Systems Biomedicine [LCSB]
Belhajjame, Khalid
Laboratoire d'analyse et modélisation de systèmes pour l'aide à la décision [LAMSADE]
Curcin, Vasa cc
Department of Health Service and Population Research, Institute of Psychiatry, King's College London
Goble, Carole cc
School of Computer Science [Manchester]
Abstract (EN)
Scientists routinely analyse and share data for others to use. Successful data (re)use relies on having metadata describing the context of analysis of data. In many disciplines the creation of contextual metadata is referred to as reporting. One method of implementing analyses is with workflows. A stand-out feature of workflows is their ability to record provenance from executions. Provenance is useful when analyses are executed with changing parameters (changing contexts) and results need to be traced to respective parameters. In this paper we investigate whether provenance can be exploited to support reporting. Specifically; we outline a case-study based on a real-world workflow and set of reporting queries. We observe that provenance, as collected from workflow executions, is of limited use for reporting, as it supports queries partially. We identify that this is due to the generic nature of provenance, its lack of domain-specific contextual metadata. We observe that the required information is available in implicit form, embedded in data. We describe LabelFlow, a framework comprised of four Labelling Operators for decorating provenance with domain-specific Labels. LabelFlow can be instantiated for a domain by plugging it with domain-specific metadata extractors. We provide a tool that takes as input a workflow, and produces as output a Labelling Pipeline for that workflow, comprised of Labelling Operators. We revisit the case-study and show how Labels provide a more complete implementation of reporting queries.
Subjects / Keywords
Workflow; Provenance; Domain-specific annotation

Related items

Showing items related by title and author.

  • Thumbnail
    LabelFlow: Exploiting Workflow Provenance to Surface Scientific Data Provenance 
    Alper, Pinar; Belhajjame, Khalid; Goble, Carole; Karagoz, pinar (2015) Communication / Conférence
  • Thumbnail
    Static analysis of Taverna workflows to predict provenance patterns 
    Alper, Pinar; Belhajjame, Khalid; Goble, Carole (2017) Article accepté pour publication ou publié
  • Thumbnail
    Common motifs in scientific workflows: An empirical analysis 
    Goble, Carole; Gil, Yolanda; Corcho, Oscar; Belhajjame, Khalid; Garijo, Daniel; Alper, Pinar (2014) Article accepté pour publication ou publié
  • Thumbnail
    UP & DOWN: Improving Provenance Precision by Combining Workflow- and Trace-Level Information 
    Dey, Saumen; Belhajjame, Khalid; Koop, David; Song, Tianhong; Missier, Paolo; Ludäscher, Bertram (2014) Communication / Conférence
  • Thumbnail
    SHARP: Harmonizing and Bridging Cross-Workflow Provenance 
    Gaignard, Alban; Belhajjame, Khalid; Skaf-Molli, Hala (2017) Communication / Conférence
Dauphine PSL Bibliothèque logo
Place du Maréchal de Lattre de Tassigny 75775 Paris Cedex 16
Phone: 01 44 05 40 94
Contact
Dauphine PSL logoEQUIS logoCreative Commons logo