SHARP: Harmonizing and Bridging Cross-Workflow Provenance
hal.structure.identifier | ||
dc.contributor.author | Gaignard, Alban
HAL ID: 1448 ORCID: 0000-0002-3597-8557 | |
hal.structure.identifier | Laboratoire d'analyse et modélisation de systèmes pour l'aide à la décision [LAMSADE] | |
dc.contributor.author | Belhajjame, Khalid | |
hal.structure.identifier | ||
dc.contributor.author | Skaf-Molli, Hala
HAL ID: 12153 ORCID: 0000-0003-1062-6659 | |
dc.date.accessioned | 2020-05-25T10:25:24Z | |
dc.date.available | 2020-05-25T10:25:24Z | |
dc.date.issued | 2017 | |
dc.identifier.uri | https://basepub.dauphine.fr/handle/123456789/20773 | |
dc.description | Lecture Notes in Computer Science book series (LNCS, volume 10577) | en |
dc.language.iso | en | en |
dc.subject | Reproducibility | en |
dc.subject | Scientific Workflows | en |
dc.subject | Provenance | en |
dc.subject | Prov Constraints | en |
dc.subject.ddc | 004 | en |
dc.title | SHARP: Harmonizing and Bridging Cross-Workflow Provenance | en |
dc.type | Communication / Conférence | |
dc.description.abstracten | PROV has been adopted by a number of workflow systems for encoding the traces of workflow executions. Exploiting these provenance traces is hampered by two main impediments. Firstly, workflow systems extend PROV differently to cater for system-specific constructs. The difference between the adopted PROV extensions yields heterogeneity in the generated provenance traces. This heterogeneity diminishes the value of such traces, e.g. when combining and querying provenance traces of different workflow systems. Secondly, the provenance recorded by workflow systems tends to be large, and as such difficult to browse and understand by a human user. In this paper (extending, initially published at SeWeBMeDA’17), we propose SHARP, a Linked Data approach for harmonizing cross-workflow provenance. The harmonization is performed by chasing tuple-generating and equality-generating dependencies defined for workflow provenance. This results in a provenance graph that can be summarized using domain-specific vocabularies. We experimentally evaluate SHARP (i) on publicly available provenance documents and (ii) using a real-world omic experiment involving workflow traces generated by the Taverna and Galaxy systems. | en |
dc.identifier.citationpages | 219-234 | en |
dc.relation.ispartofeditor | Blomqvist, Eva | |
dc.relation.ispartofeditor | Hose, Katja | |
dc.relation.ispartofeditor | Paulheim, Heiko | |
dc.relation.ispartofpublname | Springer | en |
dc.relation.ispartofpublcity | Cham | en |
dc.relation.ispartofpages | 387 | en |
dc.relation.ispartofurl | 10.1007/978-3-319-70407-4 | en |
dc.identifier.urlsite | https://hal.archives-ouvertes.fr/hal-01768385 | en |
dc.subject.ddclabel | Informatique générale | en |
dc.relation.ispartofisbn | 978-3-319-70407-4 | en |
dc.relation.conftitle | The Semantic Web: ESWC 2017 Satellite Events | en |
dc.relation.confdate | 2017-05 | |
dc.relation.confcity | Portorož | en |
dc.relation.confcountry | Slovenia | en |
dc.relation.forthcoming | non | en |
dc.identifier.doi | 10.1007/978-3-319-70407-4_35 | en |
dc.description.ssrncandidate | non | en |
dc.description.halcandidate | non | en |
dc.description.readership | recherche | en |
dc.description.audience | International | en |
dc.relation.Isversionofjnlpeerreviewed | non | en |
dc.relation.Isversionofjnlpeerreviewed | non | en |
dc.date.updated | 2020-05-25T10:21:33Z | |
hal.author.function | aut | |
hal.author.function | aut | |
hal.author.function | aut |
Files in this item
Files | Size | Format | View |
---|---|---|---|
There are no files associated with this item. |