Cloud-based Linked Data Geoprocessing: Implementing Kriging as WPS on the Cloud



Similar documents
Return on Experience on Cloud Compu2ng Issues a stairway to clouds. Experts Workshop Nov. 21st, 2013

Cloud based platforms for Linking and Managing Geodata

Linked Data Management and INSPIRE Compliant Data Export

GetLOD - Linked Open Data and Spatial Data Infrastructures

InGeoCloudS: open Cloud-based services for Geospatial Data management in an INSPIRE context

Investigating Hadoop for Large Spatiotemporal Processing Tasks

Cloud-based Infrastructures. Serving INSPIRE needs

Portal Version 1 - User Manual

Lift your data hands on session

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

The ORIENTGATE data platform

A Web services solution for Work Management Operations. Venu Kanaparthy Dr. Charles O Hara, Ph. D. Abstract

Combining Drupal Content Management System with OGC Web Services

Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study

GIS Databases With focused on ArcSDE

Introduction to Web Development with R

GeoKettle: A powerful open source spatial ETL tool

Hunk & Elas=c MapReduce: Big Data Analy=cs on AWS

Oracle Platform GIS & Location-Based Services. Fred Louis Solution Architect Ohio Valley

Resource Oriented Architecture and REST

A Performance Analysis of Distributed Indexing using Terrier

Data Management in the Cloud: Limitations and Opportunities. Annies Ductan

2) Xen Hypervisor 3) UEC

Experiences from a Large Scale Ontology-Based Application Development

Choosing the right GIS framework for an informed Enterprise Web GIS Solution

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

D3.4.1: Data Fusion Tools

Design Requirements for an AJAX and Web-Service Based Generic Internet GIS Client

How To Extend An Enterprise Bio Solution

How To Install Linux Titan

Leveraging Cloud-Based Mapping Solutions

HDFS Cluster Installation Automation for TupleWare

ArcGIS. Server. A Complete and Integrated Server GIS

MathCloud: From Software Toolkit to Cloud Platform for Building Computing Services

An Approach to Implement Map Reduce with NoSQL Databases

LinkZoo: A linked data platform for collaborative management of heterogeneous resources

Cloud Computing and Government Services August 2013 Serdar Yümlü SAMPAŞ Information & Communication Systems

A Novel Cloud Based Elastic Framework for Big Data Preprocessing

Analysis of the Free GIS Software Applications in respect to INSPIRE services and OGC standards

Applying Semantic Web Technologies in Service-Oriented Architectures

High Performance Big Data Analy5cs powered by Unique Web Accelera5on and NoSQL. The Big Data Engine

An HTML tool for exploiting geospatial web services

Chapter 3. Database Architectures and the Web Transparencies

Big Data: Using ArcGIS with Apache Hadoop. Erik Hoel and Mike Park

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets

Tools for Web Archiving: The Java/Open Source Tools to Crawl, Access & Search the Web. NLA Gordon Mohr March 28, 2012

ArcGIS Viewer for Silverlight An Introduction

Application of GIS Analyzes with Cloud Computing

Certified Apache CouchDB Professional VS-1045

Chapter 1: Introduction to ArcGIS Server

Geospatial Data and the Semantic Web. The GeoKnow Project. Sebastian Hellmann AKSW/KILT research group, Leipzig University & DBpedia Association

Hadoop: A Framework for Data- Intensive Distributed Computing. CS561-Spring 2012 WPI, Mohamed Y. Eltabakh

MOBILE APPLICATIONS AND CLOUD COMPUTING. Roberto Beraldi

Big Data on Microsoft Platform

Environment Canada Data Management Program. Paul Paciorek Corporate Services Branch May 7, 2014

Interoperable Solutions in Web-based Mapping

Introduction to Service Oriented Architectures (SOA)

Monitoring HP OO 10. Overview. Available Tools. HP OO Community Guides

Fraunhofer FOKUS. Fraunhofer Institute for Open Communication Systems Kaiserin-Augusta-Allee Berlin, Germany.

MongoDB and Couchbase

IAAA Grupo de Sistemas de Información Avanzados

Where is... How do I get to...

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum

Vector Web Mapping Past, Present and Future. Jing Wang MRF Geosystems Corporation

Standardized data sharing through an open-source Spatial Data Infrastructure: the Afromaison project

MS 20487A Developing Windows Azure and Web Services

2015 Ironside Group, Inc. 2

Google Web Toolkit (GWT) Architectural Impact on Enterprise Web Application

Harnessing the Power of the Microsoft Cloud for Deep Data Analytics

A Brief Introduction to Apache Tez

What s new in Carmenta Server 4.2

TopBraid Insight for Life Sciences

EUR-Lex 2012 Data Extraction using Web Services

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

Cloud Computing Now and the Future Development of the IaaS

Architecture and Mode of Operation

Documentation of open source GIS/RS software projects

Big RDF Data Partitioning and Processing using hadoop in Cloud

THE EUROPEAN DATA PORTAL

Big Data, Fast Data, Complex Data. Jans Aasman Franz Inc

Web Map Service Architecture for Topographic Data in Finland

Alfresco Enterprise on AWS: Reference Architecture

ATLAS job monitoring in the Dashboard Framework

Epimorphics Linked Data Publishing Platform

TEST AUTOMATION FRAMEWORK

MarkLogic Semantics in Healthcare and Life Sciences for LIDER COPYRIGHT 2015 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.

Access to ESPON Database by third- party applications An expertise on the potentialities of web services for accessing the ESPON metadata and data

SUMMER SCHOOL ON ADVANCES IN GIS

The full setup includes the server itself, the server control panel, Firebird Database Server, and three sample applications with source code.

Data Integration Checklist

Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013

Transcription:

Cloud-based Linked Data Geoprocessing: Implementing Kriging as WPS on the Cloud Elias Grinias Department of Civil Engineering, Surveying and Geomatics, TEI of Central Macedonia and Dimitris Kotzinos ETIS Lab, Department of Computer Science, University of Cergy Pontoise, FR and Institute of Computer Science, FORTH, GR 1

Geoprocessing on the Cloud Statistical Geoprocessing Time and storage demanding activity Cloud computing Efficient (distributed) solving of complex geoprocessing problems Scalable and elastic management of large (input) geospatial datasets Linked Data (LD) Data inter-linked to each other Data indentified by unique URIs Re-usability, interconnection and combination of data on the fly Linked Open (Geo)Data: Linked (Geo)Data freely available and open on the Web 2

Web Processing Service (WPS) OGC Web Processing Service (OGC-WPS) Standard for implementing/publishing geoprocesses as a web service Operations GetCapabilities, DescribeProcess, Execute In practice: HTTP XML submission to service s URL Execute operation XML containing information about Process identifier Inputs/parameters and form of process outputs 3

Implementation Schema Geoprocessing on the cloud WPS based on 52 North WPS implementation Kriging WPS process based on R-Gstat Input fetched from Linked Open GeoData repository through (Geo)SPARQL queries posted to Virtuoso server. Usage of more processing and storage instances when needed. Client Implementation Modification of 52 North WPS Client Implementation Based on latest javascript open source software: Ext, Openlayers and JQuery modules. 4

Kriging on the Cloud Ordinary Kriging Geostatistical interpolation method Fact: as distance between points increases, the similarity, defined by the covariance or correlation between points, decreases. Local neighborhood prediction: predict the unknown value at a location x0 using data values in the neighborhood of this location. x 0 r 5

Kriging on the Cloud Implemenation Adoption of 52 North WPS 3.1.1 implementation of OGC-WPS 1.0.0 standard Ordinary kriging using R- Gstat Global or localneighborhood prediction Prediction on non-projected data using great circle distance between points Fast enough (its main functionality is coded in C) Interconnection between R and the Java module located at the WPS Container through TCP/IP server Rserve. Machine Web Java Server WPS Container R Kriging module Rserve Gstat HTTP GET/POST XML files Client 6

Client Implementation WPS Process Description Modification of the open source 52 North Openlayers WPS Client WPS Process Description Openlayers panel construction after an HTTP GET request to WPS Container for Kriging inputs/parameters and output description 7

Client Implementation Vector Data Loading on Map Openlayers Vector Layers Data in CRS WGS84 (non projected data) Layer sources: User s own data in Excel format Selection box on already loaded layers Linked GeoData fetched from Virtuoso server using fixed SPARQL queries Linked GeoData fetched from Virtuoso server using partially parameterized SPARQL queries 8

Client Implementation Kriging Execution Use of the panel for selecting Input layer Kriging parameters to construct the XML for the Execute operation Execute operation: XML transfer to WPS Container through an HTTP POST request Preview in PNG Format (one of process s output) Selected Data 9

Cloud implementation Using standard AWS load balancing New instance generated when CPU load reaches >75% Use of a shared/ distributed file system (GlusterFS) 10

Linked Open GeoData Linked Data (LD) Data inter-linked to each other Data identified by unique URIs Re-usability, interconnection and combination of data on the fly Linked Open (Geo)Data Linked (Geo)Data freely available and open on the Web 11

External Resources Linked (Open) Data as a Service Extensible Applica/on Pool A P I 12 Rel DB Visualiza*on Data Integra*on Layer Rel DB Collabora*on Query Update Import Export Query Engine Linked Data Excel files Cross Data sets' Querying XML files Abstrac*on layer for data access abstract the applica-ons from the specific setup of the data management service (such as local vs. remote, federa-on, and distribu-on) Beyond Data Access Enabling automa*on of discovery, composi*on, and use of datasets Data Markets Online Visualiza*on Services Data Publishing Solu*ons Data Aggregators BI / Analy*cs as a Service

Linked Data Services addxslmappings ldquery geoldquery tsv/csv xml json GeoJSON geoldtransform KML ShapeFile. addr2rmlmappings ldexport xml trig trix n3 formats ldimport Inspire compliant inspire_export/ inspire_query_export 13

Cloud implementation Replicated Data Repository WPS Load Balancer REST HTTP Calls Linked Data as a Service Load Balancer Using standard AWS load balancing New instance generated when CPU load exceeds 70% Use of Virtuoso RDF triplestore REST API calls are posted independently by each WPS instance 14

Conclusions Geoprocessing on the cloud WPS based on 52 North WPS implementation Kriging process based on R-Gstat Input fetched from Linked Open GeoData repository Usage of more processing and storage instances when needed 15

Future Work Addition of more geo-processes in WPS Full parallelization of geoprocessing algorithms, when possible Adoption of kriging algorithms under the MapReduce computing paradigm Work presented here is part of the InGeoClouds Project www.ingeoclouds.eu 16

Questions? Dimitrios.Kotzinos@u-cergy.fr 17