Repository logo
  • English
  • Italiano
Log In
New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Affiliation
  3. INGV
  4. Conference materials
  5. Active provenance for Data-Intensive workflows: engaging users and developers
 
  • Details

Active provenance for Data-Intensive workflows: engaging users and developers

Author(s)
Spinuso, Alessandro  
Atkinson, Malcolm  
Magnoni, Federica  
Istituto Nazionale di Geofisica e Vulcanologia (INGV), Sezione ONT, Roma, Italia  
Type
Conference paper
Language
English
Obiettivo Specifico
3T. Fisica dei terremoti e Sorgente Sismica
3IT. Calcolo scientifico
Status
Published
Journal
Bridging from Concepts to Data and Computation for eScience Conference (BC2DC’19)  
Date Issued
September 2019
Conference Location
San Diego (CA, USA)
DOI
10.1109/eScience.2019.00077
URI
https://www.earth-prints.org/handle/2122/13262
Subjects

Reproducibility

Workflow management s...

Metadata

Collaborative work

Data flow computing...

Abstract
We present a practical approach for provenance capturing in Data-Intensive workflow systems. It provides contextualisation by recording injected domain metadata with the provenance stream. It offers control over lineage precision, combining automation with specified adaptations. We address provenance tasks such as extraction of domain metadata, injection of custom annotations, accuracy and integration of records from multiple independent workflows running in distributed contexts. To allow such flexibility, we introduce the concepts of programmable Provenance Types and Provenance Configuration.Provenance Types handle domain contextualisation and allow developers to model lineage patterns by re-defining API methods, composing easy-to-use extensions. Provenance Configuration, instead, enables users of a Data-Intensive workflow execution to prepare it for provenance capture, by configuring the attribution of Provenance Types to components and by specifying grouping into semantic clusters. This enables better searches over the lineage records. Provenance Types and Provenance Configuration are demonstrated in a system being used by computational seismologists. It is based on an extended provenance model, S-PROV.
File(s)
Loading...
Thumbnail Image
Name

eScienceProv.pdf

Size

1.71 MB

Format

Adobe PDF

Checksum (MD5)

6bfaa86b936b2c39624761ea86e577dc

rome library|catania library|milano library|napoli library|pisa library|palermo library
Explore By
  • Research Outputs
  • Researchers
  • Organizations
Info
  • Earth-Prints Open Archive Brochure
  • Earth-Prints Archive Policy
  • Why should you use Earth-prints?
Earth-prints working group
⚬Anna Grazia Chiodetti (Project Leader)
⚬Gabriele Ferrara (Technical and Editorial Assistant)
⚬Massimiliano Cascone
⚬Francesca Leone
⚬Salvatore Barba
⚬Emmanuel Baroux
⚬Roberto Basili
⚬Paolo Marco De Martini

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Privacy policy
  • End User Agreement
  • Send Feedback