1 / 16

A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints

A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints. Edoardo Pignotti, Peter Edwards, Alun Preece, Nick Gotts and Gary Polhill School of Natural & Computing Sciences University of Aberdeen The Macaulay Institute, Aberdeen. Content. Background Workflow Technologies

helmut
Download Presentation

A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints Edoardo Pignotti, Peter Edwards, Alun Preece, Nick Gotts and Gary Polhill School of Natural & Computing Sciences University of Aberdeen The Macaulay Institute, Aberdeen

  2. Content • Background • Workflow Technologies • Upper Deeside Case Study • Research Challenges • Scientist’s Intent Framework • Semantic Workflow Architecture • Evaluation & Conclusions

  3. Background • Semantic Grid Vision • Support for researchers to publish, share and re-use resources, integrate heterogeneous information and collaborate. (www.semanticgrid.org) • Central to this view is the integration of computational Grid technologies with Semantic Web technologies. • Fearlus-G Project (Pignotti et. al 2005) • Deployed existing social simulation model of land-use change onto the Grid. • Created metadata tools to support annotation/sharing of social simulation resources. • Fearlus-G was not designed to be a flexible problem-solving environment.

  4. Workflow in e-Science • Scientific workflows facilitate the creation and execution of experiments from a pool of available services. • Important part of in-silico experimentation to investigate or verify a hypothesis. • Taverna (myGrid, UK eScience) • Easy use of workflow and distributed compute technology. • Metadata to support the discovery of new services. • Kepler (University of California) • Provides Director component • Controls the execution environment (i.e. the “flow”). • Import OWL ontologies. • Metadata to describe activities, inputs and outputs.

  5. Evaluation of Workflow Technologies • Participants from several different disciplines. • Task • design an experiment using a Workflow tool; • Feedback via questionnaires, interviews and through direct observation. • Outcome • It is not possible to represent constraints (e.g. regarding data aggregation); • Contextual information about the experiment is missing; • At the moment, detailed technical documentation is needed in order to fully understand a workflow.

  6. Upper Deeside Case Study Constraint: If in any of the 50 runs, one land manager owns more than half of the land, ignore this parameter-set. Goal: Obtain at least one match where the real data falls within 95% confidence interval of the model value. Constraint: Simulation needs to run on a platform compatible with IEEE 754 floating point standard.

  7. Research Challenges • Need to go beyond low-level service composition by capturing higher-level descriptions of the scientific process. • Make the experimental conditions and goals of the experiment transparent. • Represent “scientist's intent”in such a way that: • it is meaningful to the researcher; • it can be reasoned about by a software application; • it can be re-used across different workflows; • it can be used as provenance (documenting the process that led to some result).

  8. Scientist's Intent Framework • Capture the logic behind scientist’s intent • Model of scientist’s intent based upon rules: PreCondition/Post Action • Based on workflow metadata support and SWRL (Semantic Web Rule Language). • Example GOAL: obtain at least one match where the real data falls within 95% confidence interval of the model value. Pre Condition: ParameterSet( ?x1 ) ^ DataSet( ?x2 ) ^ ComparisonTest( ?x3 ) ^ compares( ?x3, ?x1 ) ^ compares( ?x3, ?x2 ) ^ similarity( ?x3, ?x4 ) ^ [more-than ( ?x4, 95%) = true

  9. Scientist’s Intent for Monitoring Workflow CONSTRAINT: check if the simulation is running on a platform compatible with the IEEE 754 floating point standard. PreCondition: GridTask( ?x1 ) ^ Simulation( ?x2 ) ^ runsSimulation( ?x1, ?x2 ) ^ neg runsOnPlatform( ?x1, `IEEE754' ) ^ hasResult( ?x2, ?x3) ^ PostCondition: hasInvalidResults( ?x2, ?x3 ) ^ ACTION:resubmitTask(?x1) • Actions performed depend on the ability of the workflow to process events.

  10. Scientist’s Intent for Result Enrichment CONSTRAINT: if the similarity between the results of a simulation run and an existing dataset is below 10%, it is interesting to investigate the behavior of the run. PreCondition ParameterSet( ?x1 ) ^ DataSet( ?x2 ) ^ Simulation( ?x3 ) ^ hasSimulationRun( ?x3, ?x4 ) ^ ComparisonTest( ?x5 ) ^ compares( ?x5, ?x4 ) ^ compares( ?x5, ?x2 ) ^ similarity( ?x5, ?x6 ) ^ [less-than ( ?x6, 10%) = true] PostAction: runToExplore(?x3, ?x4 )

  11. Scientist’s Intent as Provenance • Scientist’s Intent Ontology • We have attempted to align out ontology with the core characteristics of the Open Provenance Model (OPM)

  12. Workflow Metadata Support • Possible sources of metadata: • metadata about the result(s) generated upon completion of the workflow; • metadata about the data generated at the end of an activity within the workflow or sub-workflow; • metadata about the status of an activity over time, e.g. while the workflow is running; • metadata describing the activity itself (e.g. type of service, platform, etc.) • Ontologies • Scientist’s Intent • PolicyGrid provenance framework

  13. Semantic Workflow Architecture

  14. Scientist’s Intent Interface

  15. Evaluation & Conclusions • Assessing the enhanced workflow representation • Expressiveness of the intent formalism • Reusability • Workflow execution • Creating and utilizing metadata is a non-trivial task • Scale issues with FEARLUS simulation metadata (~250,000,000 triples per experiment). • We aim to provide a closer connection between experimental workflows and the goals and constraints of the researcher, thus making experiments more transparent.

  16. Questions?

More Related