DataNet Flexible Metadata Overlay over File Resources



Similar documents
Distributed Cloud Environment for PL-Grid Applications

GridSpace2 Towards Science-as-a-Service Model

Development and Execution of Collaborative Application on the ViroLab Virtual Laboratory

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets

Cloud services in PL-Grid and EGI Infrastructures

A Platform for Collaborative e-science Applications. Marian Bubak ICS / Cyfronet AGH Krakow, PL bubak@agh.edu.pl

Cloud Platform for VPH Applications

The New ADS Search Interface and API

An approach to monitoring, data analytics, and decision support for levee supervision

Cloud Infrastructure Pattern

Cloud application services (SaaS) Multi-Tenant Data Architecture Shailesh Paliwal Infosys Technologies Limited

Planning, Provisioning and Deploying Enterprise Clouds with Oracle Enterprise Manager 12c Kevin Patterson, Principal Sales Consultant, Enterprise

Collaborative Open Market to Place Objects at your Service

Big Data and Cloud Computing for GHRSST

Data Storage Solution Using PL-Grid UNICORE Infrastructure

VBLOCK SOLUTION FOR SAP APPLICATION SERVER ELASTICITY

Building integration environment based on OAI-PMH protocol. Novytskyi Oleksandr Institute of Software Systems NAS Ukraine

Development of parallel codes using PL-Grid infrastructure.

INTRODUCTION TO CLOUD MANAGEMENT

SENSE/NET 6.0. Open Source ECMS for the.net platform. 1

Search and Real-Time Analytics on Big Data

An approach to monitoring, data analytics, and decision support for levee supervision

Integration of the OCM-G Monitoring System into the MonALISA Infrastructure

elearning Content Management Middleware

EIDA WFCatalog Service!!! Luca Trani and the EIDA Team

How To Create A Large Data Storage System

Developer support in a federated Platform-as-a-Service environment

Storage solutions for a. infrastructure. Giacinto DONVITO INFN-Bari. Workshop on Cloud Services for File Synchronisation and Sharing

New Features in Oracle Application Express 4.1. Oracle Application Express Websheets. Oracle Database Cloud Service

McAfee Web Gateway Administration Intel Security Education Services Administration Course Training

Scientific versus Business Workflows

Comparison of the High Availability and Grid Options

PROGRESS Portal Access Whitepaper

Collaborative Open Market to Place Objects at your Service

Silviu Panica, Marian Neagul, Daniela Zaharie and Dana Petcu (Romania)

Outcomes of the CDS Technical Infrastructure Workshop

The Agile Infrastructure Project. Monitoring. Markus Schulz Pedro Andrade. CERN IT Department CH-1211 Genève 23 Switzerland

Canadian National Research Data Repository Service. CC and CARL Partnership for a national platform for Research Data Management

CUMULUX WHICH CLOUD PLATFORM IS RIGHT FOR YOU? COMPARING CLOUD PLATFORMS. Review Business and Technology Series

Apache Sling A REST-based Web Application Framework Carsten Ziegeler cziegeler@apache.org ApacheCon NA 2014

LDAP Authentication Configuration Appendix

Installing and Configuring vcloud Connector

An Oracle White Paper June Oracle Linux Management with Oracle Enterprise Manager 12c

AUTOMATIC PROXY GENERATION AND LOAD-BALANCING-BASED DYNAMIC CHOICE OF SERVICES

POLAR IT SERVICES. Business Intelligence Project Methodology

Oracle Big Data SQL Technical Update

e-science Applications with Duckling Collaboration Library

Using EMC Documentum with Adobe LiveCycle ES

The Compatible One Application and Platform Service 1 (COAPS) API User Guide

CompatibleOne Open Source Cloud Broker Architecture Overview

Integration strategy

Enabling Continuous Delivery for Java Projects with Oracle Cloud Services (Oracle PaaS) Siva Rama Krishna Oracle India

Supporting Collaborative Grid Application Development Within The E-Science Community p. 1

Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007

Fundamental Concepts and Models

Management. Oracle Fusion Middleware. 11 g Architecture and. Oracle Press ORACLE. Stephen Lee Gangadhar Konduri. Mc Grauu Hill.

Deploying Business Virtual Appliances on Open Source Cloud Computing

Impact of Service Oriented Architecture on ERP Implementations in Technical Education

Installation Guide. Version 2.1. on Oracle Java Cloud Service

WHITEPAPER. A Technical Perspective on the Talena Data Availability Management Solution

Monitoring Nginx Server

Data Management System - Developer Guide

Structured Content: the Key to Agile. Web Experience Management. Introduction

Scalable End-User Access to Big Data HELLENIC REPUBLIC National and Kapodistrian University of Athens

Towards a New Model for the Infrastructure Grid

How To Understand Cloud Computing

Middleware support for the Internet of Things

SERVICE ORIENTED ARCHITECTURE

Agile Infrastructure Update Monitoring

Deploying a distributed data storage system on the UK National Grid Service using federated SRB

Mjolnirr: private PaaS as distributed computing evolution

Metadata Quality Control for Content Migration: The Metadata Migration Project at the University of Houston Libraries

Cluster, Grid, Cloud Concepts

Deploying complex applications to Google Cloud. Olia Kerzhner

Continuous security audit automation with Spacewalk, Puppet, Mcollective and SCAP

Qlik Sense Enabling the New Enterprise

Symantec Virtual Machine Management 7.1 User Guide

Records Management and SharePoint 2013

Glassfish Architecture.

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Networks and Services

How To Set Up Wiremock In Anhtml.Com On A Testnet On A Linux Server On A Microsoft Powerbook 2.5 (Powerbook) On A Powerbook 1.5 On A Macbook 2 (Powerbooks)

Monitoring of the UNICORE middleware

Bob Jones Technical Director

Management of Security Information and Events in Future Internet

How can the Future Internet enable Smart Energy?

An Evaluation of the Application Hosting Environment Uk e-science Engineering Task Force

SOA, case Google. Faculty of technology management Information Technology Service Oriented Communications CT30A8901.

Big Data Analytics - Accelerated. stream-horizon.com

Developing Microsoft Azure Solutions 20532B; 5 Days, Instructor-led

Advanced Solutions of Microsoft SharePoint Server 2013 Course 20332A; 5 Days, Instructor-led

In ediscovery and Litigation Support Repositories MPeterson, June 2009

MongoDB Developer and Administrator Certification Course Agenda

How To Create A Server Farm In A Web Server (Forked) With A Powerpoint 3.5 (Forum) And A Powerbook (Forms) (Forums) (Powerpoint) (Web) And Powerbook) (

PLGrid Infrastructure Solutions For Computational Chemistry

CompatibleOne Open Source Cloud Broker Architecture Overview

Book of Abstracts CS3 Workshop

Xweb: A Framework for Application Network Deployment in a Programmable Internet Service Infrastructure

Configuration Guide. BlackBerry Enterprise Service 12. Version 12.0

MASHUPS FOR THE INTERNET OF THINGS

Transcription:

1 DataNet Flexible Metadata Overlay over File Resources Daniel Harężlak 1, Marek Kasztelnik 1, Maciej Pawlik 1, Bartosz Wilk 1, Marian Bubak 1,2 1 ACC Cyfronet AGH, 2 AGH University of Science and Technology, Institute of Computer Science AGH Workshop on Cloud Services for File Synchronisation and Sharing November 17-18, 2014, CERN

Presentation Plan 2 PL-Grid Computing Infrastructure Motivation behind DataNet Metadata Management Requirements Architecture Description Deployment Conclusions

DataNet PLGrid Landscape 3 PL-Grid Programme PL-Grid Computing power: ca. 230 Tflops Storage: ca. 3600 Tbytes Basic infracructure services PL-Grid PLUS Additional 500 Tflops and 4,4 Pbytes Support for domain Grids Hardware/ Middleware Services Users Domain Experts

DataNet Rationale and Objectives 4 Rationale Data management as a common requirement in computational sciences Workflow and scripting engines provide only a little support Each application is different and requires a dedicated metadata/data model Objectives Provide means for ad-hoc metadata model creation and deployment of corresponding storage facilities Create a research space for metadata model exchange and discovery with associated data repositories with access restrictions in place Support different types of storage sites and data transfer protocols Support the exploratory paradigm by making the models evolve together with data

DataNet PLGrid Requirements 5 PLGrid infrastructure supporting different e-science domains Various applications coming from different scientific communities Common computational resources Deployment of model data as repositories Robust enablement of a dedicated interface Access control capabilities Exploitation of available storage infrastructure Universal availability of the repository Platform independent Facilitated by existing standards

DataNet Architecture 6 Web Interface is used by users to create, extend and discover metadata models Model repositories are deployed in the PaaS Cloud layer for scalable and reliable access from computing nodes through REST interfaces Data items from Storage Sites are linked from the model repositories

DataNet Data Model 7 Set of entities with fields Simple types Array types File type Relations

DataNet Repository 8 Repositories are accessed through REST Data view through a web application Configurable Access control Public Private (within a group of users)

DataNet Repository Access 9 Data sent over with JSON or FORM REST methods POST submit new data PUT modify data DELETE remove data GET- retrieve data Queries with URL

DataNet PLGrid Deployment 10 PLGrid Users Access with regular PLGrid account REST interface Web Application for simple use cases Domain applications User proxy delegation retrieved from PLGrid OpenID provider REST interface DataNet as a Service already in place Deployment procedure followed

DataNet Service summary 11 Status Production Number of users 5 10 (30 40 planned) Default and Maximum Quota 5GB (Home), 100GB (Storage) Linux/Mac/Win user ratio 3/0,5/2 Desktop/Mobile/Web access ratio Web only Technology Cloud Foundry, Java, Ruby Target communities Researchers in various fields Integration in your current environment Using the same storage elements Risk factors Network reliability (upload interruptions) Most important functionality REST access Missing functionality (if any) None so far

DataNet User Feedback 12 Easy integration in any programming language Possibility to view data via a web application Eesy extension to other metadata engines Not and end-user software

DataNet DONEs and TODOs 13 DONEs Custom CloudFoundry environment was setup as a PaaS platform to ensure quick deployments of required application and storage services Schema for metadata model creation was elaborated and was evaluated for NoSQL storage service MongoDB Storage site access libraries were implemented and tested Deployment of a web-based tool to create, discover and manage metadata models TODOs Support various types of metadata storage services to fulfil different application requirements (if required) Prototype a utility for data migration between model versions

Thank You 14 Acknowledgements This research has been partially supported by the European Regional Development Fund program no. POIG.02.03.00-00-096/10 as part of the PL-Grid PLUS project Contact us and help make DataNet better Visit http://dice.cyfronet.pl for more information See DataNet in action at https://datanet.cyfronet.pl (PLGrid account required)