320473 Databases & Web Applications Lab 320454 Big Data Project A

Similar documents
Adding Big Earth Data Analytics to GEOSS

Agile Analytics on Extreme-Size Earth Science Data

Agile Retrieval of Big Data with. EarthServer. ECMWF Visualization Week, Reading, 2015-sep-29

WCS as a Download Service for Big (and Small) Data

Handling Heterogeneous EO Datasets via the Web Coverage Processing Service

On the Efficient Evaluation of Array Joins

A Big Picture for Big Data

Big Data Volume & velocity data management with ERDAS APOLLO. Alain Kabamba Hexagon Geospatial

HPC technology and future architecture

<Insert Picture Here> Data Management Innovations for Massive Point Cloud, DEM, and 3D Vector Databases

The distribution of marine OpenData via distributed data networks and Web APIs. The example of ERDDAP, the message broker and data mediator from NOAA

NetCDF and HDF Data in ArcGIS

RDA PROPOSAL FOR Array Database Working Group (AD-WG) Peter Baumann, Jacobs University

VITO Centre of Image Processing

GIS Initiative: Developing an atmospheric data model for GIS. Olga Wilhelmi (ESIG), Jennifer Boehnert (RAP/ESIG) and Terri Betancourt (RAP)

PART 1. Representations of atmospheric phenomena

Databases for 3D Data Management: From Point Cloud to City Model

Use of OGC Sensor Web Enablement Standards in the Meteorology Domain. in partnership with

EED Task Order. Contract: NNG10HP02C Contractor: Raytheon Task Type:

Training for Big Data

CSE 544 Principles of Database Management Systems. Magdalena Balazinska (magda) Spring 2006 Lecture 1 - Class Introduction

Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel

Arrays in database systems, the next frontier?

Big Data in the context of Preservation and Value Adding

Cloud Computing and Advanced Relationship Analytics

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

GIS Databases With focused on ArcSDE

A New Cloud-based Deployment of Image Analysis Functionality

INTEROPERABLE IMAGE DATA ACCESS THROUGH ARCGIS SERVER

Mr. Apichon Witayangkurn Department of Civil Engineering The University of Tokyo

The USGS Landsat Big Data Challenge

The Arctic Observing Network and its Data Management Challenges Florence Fetterer (NSIDC/CIRES/CU), James A. Moore (NCAR/EOL), and the CADIS team

Where is... How do I get to...

Big Data in OpenTopography

Sextant. Spatial Data Infrastructure for Marine Environment. C. Satra Le Bris, E. Quimbert, M. Treguer

Norwegian Satellite Earth Observation Database for Marine and Polar Research USE CASES

NASA Earth System Science: Structure and data centers

GLOBAL DATA SPATIALLY INTERRELATE SYSTEM FOR SCIENTIFIC BIG DATA SPATIAL-SEAMLESS SHARING

MEETING TOMORROW S EARTH OBSERVATION CHALLENGES _ WHITE PAPER

Welcome to the first Workshop on Big data Open Source Systems (BOSS)

Use of ISO standards by NERC (a snapshot!)

Deploying ArcGIS for Server Using Managed Services

Developing Fleet and Asset Tracking Solutions with Web Maps

Scalable Distributed Service Integrity Attestation for Software-as-a-Service Clouds

A Web services solution for Work Management Operations. Venu Kanaparthy Dr. Charles O Hara, Ph. D. Abstract

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper

NASA Earth Science Research in Data and Computational Science Technologies Report of the ESTO/AIST Big Data Study Roadmap Team September 2015

Big Data R&D Initiative

Cloud-based Geospatial Data services and analysis

Analyse, Collaborate and Publish Statistics for Measuring Progress in our Society using Storytelling. The most ancient of social rituals

A standards-based open source processing chain for ocean modeling in the GEOSS Architecture Implementation Pilot Phase 8 (AIP-8)

Microsoft SQL Server 2012: What to Expect

Cleveland State University

MyOcean Copernicus Marine Service Architecture and data access Experience

ISQS 3358 BUSINESS INTELLIGENCE FALL 2014

ARIS 9ARIS 9.6 map and Future Directions Die nächste Generation des Geschäftsprozessmanagements

CSE 544 Principles of Database Management Systems. Magdalena Balazinska (magda) Winter 2009 Lecture 1 - Class Introduction

Oklahoma s Open Source Spatial Data Clearinghouse: OKMaps

One-Size-Fits-All: A DBMS Idea Whose Time has Come and Gone. Michael Stonebraker December, 2008

NASA s Big Data Challenges in Climate Science

How To Use Gis

Distributed Computing. Mark Govett Global Systems Division

MONTHLY REMINDERS FOR 2013

ECS 165A: Introduction to Database Systems

Cloud JPL Science Data Systems

Big Data Explained. An introduction to Big Data Science.

Integration of mobile automated monitoring systems with decision support tools for smart HAB management. VITO Jaap van Nes Göteborg, May 2015

Web and Mobile GIS Applications Development

CISC 432/CMPE 432/CISC 832 Advanced Database Systems

Metadata for Data Discovery: The NERC Data Catalogue Service. Steve Donegan

CSE 544 Principles of Database Management Systems. Magdalena Balazinska (magda) Fall 2007 Lecture 1 - Class Introduction

ENVI THE PREMIER SOFTWARE FOR EXTRACTING INFORMATION FROM GEOSPATIAL IMAGERY.

Transcription:

320473 Databases & Web Applications Lab 320454 Big Data Project A Instructor: Peter Baumann email: p.baumann@jacobs-university.de tel: -3178 office: room 88, Research 1 320302 Databases & Web Applications (P. Baumann)

Big Science Data [OGC Ocean Science Interoperability Experiment; image source: Mbari] 2

OGC Coverage Types Coverage = digital representation of space/time varying phenomenon n-d MultiSolid Coverage MultiSurface Coverage MultiCurve CoverageMultiPoint Coverage «FeatureType» Abstract Coverage Referenceable GridCoverage Grid Coverage Rectified GridCoverage 3

Facing the Coverage Deluge sensor feeds [OGC SWE] coverage server 4 4

Taming the Coverage Deluge sensor feeds [OGC SWE] coverage server 5 5

Let s Take a Closer Look... t Divergent access patterns for ingest and retrieval Server must mediate between access patterns 6

Our Research Large-Scale Scientific Information Services (L-SIS) Research Group flexible, scalable services on massive multi-dimensional scientific data Particular focus: n-d arrays Massive = multi-tb multi-pb per object Results: rasdaman array DBMS (www.rasdaman.org), demo at www.earthlook.org Geoservice standards: OGC WCS suite, http://external.opengeospatial.org/twiki_public/coveragesdwg/webhome ISO 9075 SQL Part 15: MDA (under work) 7

rasdaman: Scalable Array Analytics raster data manager : Array Database = SQL + n-d arrays select img.green[x0:x1,y0:y1] > 130 from LandsatArchive as img tile streaming architecture: scaling from laptop to cloud rasdaman Web visitors www.rasdaman.org 8

Use Case: Satellite ImageTime Series [Diedrich et al 2001] 9

EarthServer Big Earth Data Analytics Up to 130 TB databases for all Earth sciences + planetary science EU FP7-INFRA, 3 years, 5.85 meur Platform: rasdaman; strictly open standards Cryospheric Science landcover mapping Airborne Science high-altitude drones Atmospheric Science climate variables Geology geological models Oceanography marine model runs + in-situ data Planetary Science Mars geology 10

Database Visualization select encode( struct { red: (char) s.image.b7[x0:x1,x0:x1], green: (char) s.image.b5[x0:x1,x0:x1], blue: (char) s.image.b0[x0:x1,x0:x1], alpha: (char) scale( d.elev, 20 ) }, "image/png" ) from SatImage as s, DEM as d [JacobsU, Fraunhofer 2012; [data courtesy BGS, ESA] [JacobsU, Fraunhofer; data courtesy BGS, ESA] 11

Parallel / Distributed Query Processing ad-hoc federation mixed hardware Dataset D select max((a.nir - A.red) / (A.nir + A.red)) - max((b.nir - B.red) / (B.nir + B.red)) - max((c.nir - C.red) / (C.nir + C.red)) - max((d.nir - D.red) / (D.nir + D.red)) from A, B, C, D Dataset C Dataset A Dataset B 12

Secured Archive Integration First-ever direct, ad-hoc mix from protected NASA & ESA services in OGC WCS/WCPS Web client (EarthServer + CobWeb) 13

Demo 14

Next: On-Board Query Intelligence [OPS-SAT: ESA CubeSat] Democratize direct data access [imagery courtesy ESA, NASA] 15

Summary Project work embedded in international projects & collaborations Present Publish 16

Big Picture 320302 Databases and Web Applications Fall lecture, undergrad + grad Advanced course in spring: Information Architectures 320473 Databases and Web Applications Lab Lab, grad 320454 Big Data Project A Project, grad New meeting slot: Tue 09:45, Research 1, room 88 17

Project Task Pick a topic http://www.faculty.jacobs-university.de/pbaumann/iubremen.de_pbaumann//courses/researchtopics/ Perform task planful: Spec document 20% -- Sep 26 Oct 03 Prototype 1: breakthrough implementation 20% -- Oct 17 Prototype 2: ready for benchmark 20% -- Oct 31 Benchmark results 20% -- Nov 14 Publication 10% -- Nov 28 Prototype 3: ready for handover 10% -- Dec 05 18

Resources rasdaman website www.rasdaman.org demo www.earthlook.org Our publications http://www.faculty.jacobs-university.de/pbaumann/iu-bremen.de_pbaumann//pubs.php Instructor: p.baumann@......and the rasdaman team 19

Main Evaluation Criteria complete wrt. requirements Solid engineering bug-free, project & code documentation, coding quality,... user-friendliness and appealing look&feel complexity (in absolute terms and in comparison to other teams' work) Good writeup Specification, documentation, paper (no particular order) 20

Project/Lab Topics http://www.faculty.jacobs-university.de/pbaumann/iubremen.de_pbaumann/teaching.php -> course list -> list of topics 21