Workflow in Astronomy, the VO France Workflow Working Group experience



Similar documents
An Automated Workflow System Geared Towards Consumer Goods and Services Companies

Organization of VizieR's Catalogs Archival

How To Understand And Understand The Science Of Astronomy

SURVEY ON THE ALGORITHMS FOR WORKFLOW PLANNING AND EXECUTION

Multidimensional Data in the Virtual Observatory

VisIVO, an open source, interoperable visualization tool for the Virtual Observatory

BPMN PATTERNS USED IN MANAGEMENT INFORMATION SYSTEMS

M.L. Dubernet VOParis Data Centre

Process Mining. ^J Springer. Discovery, Conformance and Enhancement of Business Processes. Wil M.R van der Aalst Q UNIVERS1TAT.

A PRACTICAL APPROACH FOR A WORKFLOW MANAGEMENT SYSTEM

Astrophysics with Terabyte Datasets. Alex Szalay, JHU and Jim Gray, Microsoft Research

DAME Astrophysical DAta Mining Mining & & Exploration Exploration GRID

VisIVO, a VO-Enabled tool for Scientific Visualization and Data Analysis: Overview and Demo

The Virtual Observatory: What is it and how can it help me? Enrique Solano LAEFF / INTA Spanish Virtual Observatory

The astronomical Virtual Observatory : lessons learnt, looking forward. Françoise Genova - Forum VO-PDC d après ADASS XXI, Paris, nov.

Challenges and Opportunities for formal specifications in Service Oriented Architectures

Business Process Modeling Information Systems in Industry ( )

Course Syllabus For Operations Management. Management Information Systems

Realization of a UV fisheye hyperspectral camera

Chap 1. Introduction to Software Architecture

Virtual Atomic and Molecular Data Centre h4p:// (.org) Talk PCMI. Coordinator: M.L. Dubernet 1,2. Observatoire de Paris

Business Process Execution Language for Web Services

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

Capturing provenance information with a workflow monitoring extension for the Kieker framework

MiSeq: Imaging and Base Calling

What Is NetBeans? Free and open-source based > Open source since June, 2000 > Large community of users and developers

To introduce software process models To describe three generic process models and when they may be used

Software Engineering. Software Processes. Based on Software Engineering, 7 th Edition by Ian Sommerville

WHITE PAPER. Harnessing the Power of Advanced Analytics How an appliance approach simplifies the use of advanced analytics

Testing mobile computing applications: toward a scenario language and tools

OLAP. Data Mining Decision

Analyses on functional capabilities of BizTalk Server, Oracle BPEL Process Manger and WebSphere Process Server for applications in Grid middleware

and the VO-Science Francisco Jiménez Esteban Suffolk University

Total Exploration & Production: Field Monitoring Case Study

Numerical Methods for Fusion. Lectures SMF session (19-23 July): Research projects: Organizers:

Monitoring of Business Processes in the EGI

Meta-Model specification V2 D

Harnessing the power of advanced analytics with IBM Netezza

Software Architecture Document

Developing a Service Oriented Process Management System for University Quality Assurance

NASA s Big Data Challenges in Climate Science

Picture of health. An integrated approach to asset health management

Bachelor of Games and Virtual Worlds (Programming) Subject and Course Summaries

TABLE OF CONTENTS ABSTRACT ACKNOWLEDGEMENT LIST OF FIGURES LIST OF TABLES

This software agent helps industry professionals review compliance case investigations, find resolutions, and improve decision making.

CHAPTER 6 MAJOR RESULTS AND CONCLUSIONS

Business Process Management: A personal view

DCA QUESTIONNAIRE V0.1-1 INTRODUCTION AND IDENTIFICATION OF THE DATA CENTRE

Big Data Mining Services and Knowledge Discovery Applications on Clouds

Winery A Modeling Tool for TOSCA-based Cloud Applications

Enterprise Architecture: Practical Guide to Logical Architecture

Classroom Exercise ASTR 390 Selected Topics in Astronomy: Astrobiology A Hertzsprung-Russell Potpourri

Structure of Presentation. The Role of Programming in Informatics Curricula. Concepts of Informatics 2. Concepts of Informatics 1

SSO Transmission Grating Spectrograph (TGS) User s Guide

Multiscale Object-Based Classification of Satellite Images Merging Multispectral Information with Panchromatic Textural Features

INTELLIGENT DEFECT ANALYSIS, FRAMEWORK FOR INTEGRATED DATA MANAGEMENT

HTML5 Data Visualization and Manipulation Tool Colorado School of Mines Field Session Summer 2013

The University of Jordan

ASTRONOMICAL SURVEYS and BIG DATA

Professional Organization Checklist for the Computer Science Curriculum Updates. Association of Computing Machinery Computing Curricula 2008

SERVICE-ORIENTED ARCHITECTURE FOR INTEGRATION OF BIOINFORMATIC DATA AND APPLICATIONS. A Dissertation. Submitted to the Graduate School

WebSphere Business Modeler

THE CCLRC DATA PORTAL

Transcription:

Astronomical Data Analysis Software and Systems 2007 Workflow in Astronomy, the VO France Workflow Working Group experience André Schaaff, Observatoire de Strasbourg Franck Le Petit, Observatoire de Paris Philippe Prugniel, Observatoire de Lyon Eric Slezak, Observatoire de la Côte d Azur Christian Surace, Laboratoire d Astrophysique de Marseille Journée ADASS Calculs 2007, et Données London distribués

Motivation 2 2 Many services are developed/deployed in the frame of the Virtual Observatory (registry, data services, Web Services, computing and Grid services, ). Complex implementation and coordination of the services are possible through workflows Evolution from an execution of one service to a combination of services (exchanging data, ) Question to be answered : Localisation and verification of the services (execution time, tests, results, ) Existing tools (how to take them into account in workflows) and constraints on the future developments How to integrate the VO Standards?.

Motivation (2) 3 3 Workflows are useful to capture scientific methodology and to provide provenance information for their results Workflows provide a formalization of the scientific analysis routines to be executed, data flow, execution details, Workflow are structures useful to manage computation at a large-scale Collaboratively designed, assembled, validated, analyzed

Quick State of the Art 4 4 Workflow languages (source : http://www.gridworkflow.org) AGWL, BPEL4WS, BPML, DGL, DPML, GJobDL, GSFL, GFDL, GWorkflowDL, MoML, SWFL, WSCL, WSCI, WSFL, XLANG, YAWL, SCUFL/XScufl, WPDL, PIF, PSL, OWL-S, xwfl, language formalisms Petri net, UML activity diagram, BPMN, DAG, IPO, GPSG, Workflow Patterns, Pi Calculus, Finite-State Machine, Gamma-calculus, mapping from abstract to concrete workflows Workflow CWG, ACWG, Grid Job Handler, GWES, engines BioPipe, BizTalk, BPWS4J, DAGMan, GridAnt, Grid Job Handler, GRMS, GWFE, GWES, IT Innovation Enactment Engine, JIGSA, JOpera, Kepler, Karajan, OSWorkflow, Pegasus (uses DAGMan), Platform Process Manager, ScyFLOW, SDSC Matrix, SHOP2, Taverna, Triana, wftk, YAWL Engine, WebAndFlo, WFEE, composition/designing tools ilog's BPMN Modeller, CAT, GWUI, XBaya GUI for Workflow Composition, Triana, JOpera, Platform Process Manager,...

Examples of workflow builders in astronomy 5 5 The ESO Recipe Flexible Execution Workbench (based on Taverna) R. Hook et al., ESO We had a useful discussion with R. Hook and P. Järvelaïnen The AstroGrid workflow system http://www.eso.org/sampo/reflex/ http://www2.astrogrid.org/

VO France Workflow Working Group 6 6 In a first step it was necessary to give our own definition of a workflow : a sequence of tasks executed within a controlled context by an architecture taking into account VO standards Main goals Definition of use cases of general interest in different domains Suggestion of solutions for designing and exploiting easily such workflows Identification of the simplest workflow structure allowing portability Definition of elementary bricks

Use cases 7 7 Illustration : some use cases presented and discussed in the frame of the working group

Image processing use case 8 8 operation detection and evaluation of related objects in 1 band image subjacent model diffuse disjoined tasks in emission on a bottom slowly variable without defects method cartography of the background thresholding by segmentation adjustment of an ellipse of form evaluation of the azimuth profile of brightness calculation of measurements of form and flow E. Slezak Observatoire de la Côte d Azur small part of a workflow use case which is under construction

Use case in Simulation 9 9 1 - PDR code modelisation of diffuse clouds gives theoretical absorption spectra 2 - Stellar spectra databases provides synthetic stellar spectra 3 - FUSE database provides observations & PSF 4 - Compare directly models to observations F. Le Petit et al. Observatoire de Paris-Meudon (C. Martin-Zaïdi et al. 2006)

Use case in Data Mining 10 See poster P1.021 J. Berthier et al., IMCCE

Use case for TITAN and NOAR codes 11 TITAN input of parameters LIST OF MODELS ready or in progress FIT VISUALIZATION 2-curves graph (parameters + χ 2 as a legend) Analysis tool of the model when ready L. Chevallier Obs. Paris Meudon Iteration >= max? NO YES VO registries DONE TITAN NOAR YES FIT OK? NO AUTO FIT NOAR Iteration += 1

Use case implementation 12 The aim of the working group is not to decide which tool is better than another In a first step we decided to use a tool (AÏDA Astronomical Image processing Distribution Architecture) in which development some of us were involved Easy to modify sources, to add data types, VO standards enabling was discussed during the development

AÏDA architecture 13 AÏDA architecture F. Bonnarel, M. Louys, A. Schaaff CDS & LSIIT AÏDA developments J. Beugnot J.-J. Claudon, G. Mantelet, C. Pestel, CDS Other clients Aladin, Web Services (SOAP), AÏDA workflow engine Hyper spectral image Genetic algorithm reduction Reduced image Segmentation Segmentation map AÏDA client Colored representation tool Colored image AÏDA Access methods Work done in the frame of the French «Massive Data in Astronomy» project (2003-2006), OV France and Euro VOTech Image processing tasks Genetic algorithm Segmentation reduction Colored representation tool Module N+1 Module N+2 Computing resources Local or remote execution

Example of implementation for an image processing workflow 14 Use case E. Slezak Observatoire de la Côte d Azur Simulation use case (F. Le Petit) will be implemented next month

Workflow builder plugin in Aladin 15 Based on AÏDA graphical libraries and engine C. Pestel T. Boch CDS

Remarks about the work done 16 Useful to describe workflow use cases and to be able to implement them with a light and flexible workflow tool Real help for astronomers Easy to reuse in new workflows old applications developed in the past by trainees, Ph D. students, engineers and astronomers in different languages Good training before using more heavy tools Experience sharing with people in different domains (image, spectroscopy, data mining, simulation, )

Ongoing work 17 Study how to involve the Characterization IVOA standard in the workflow process Work on bricks development extraction of subparts in n dimensions hypercube separation of structures in n dimensions hypercube resolution adaptation for imagery and spectrum multi components fit image addition, spectrum addition Interoperability of languages, checking with IT community what are the finalities of the exchanges between workflows, the best approach for standardization, etc.

VO France Workflow WG 18 Thanks to all the contributors and participants to the working group : Christophe Barache Obs. Paris-Meudon, Jérôme Berthier - IMCCE, Thomas Boch - CDS, Frédéric Boone - LERMA, François Bonnarel - CDS, Loïc Chevallier Obs. Paris-Meudon, Jean-Julien Claudon - CDS, André Csillaghy HES Switzerland, Bernard Debray - LAOB, Jean-Michel Desert - IAP, Pierre Didelon - CEA, Marie-Lise Dubernet Obs. Paris-Meudon, Anabela C. Goncalves - LUTH, Jesus Iglesias - IMCCE, Gaëlle Labourot - UVSQ, Martin France - CRAL, Pierre Le Sidaner Obs. Paris-Meudon, Mireille Louys - CDS-LSIIT, Guillaume Mella - LAOG, Areg Mickaelian BAO Armenia, Johan Montagnat - ESSI Sophia Antipolis, Nicolas Moreau - LERMA, Steven Morin - UVSQ, Jonathan Normand Obs. Paris-Meudon Cyril Pestel-CDS, Fabrice Roy - LUTH, Lena Sargsyan - Yerevan State University, Alain Sarkissian - Service d'aéronomie, Renaud Savalle - Obs. Paris-Meudon, Françoise Tran Minh - LERMA Obs. Paris-Meudon, Frédéric Vachier IMCCE, Bruno Voisin - NUI Galway, http://www.france-ov.org/twiki/bin/view/groupestravail/workflow Contact : schaaff@astro.u-strasbg.fr