May 2012 Oracle Spatial User Conference



Similar documents
<Insert Picture Here> Data Management Innovations for Massive Point Cloud, DEM, and 3D Vector Databases

Big Data Volume & velocity data management with ERDAS APOLLO. Alain Kabamba Hexagon Geospatial

Using PCI Geomatics Software with Oracle 10g Spatial and GeoRaster A PCI Geomatics Whitepaper

Big Data Spatial Performance with Oracle Database 12c A Practitioner s Panel

Oracle Spatial and Graph. Jayant Sharma Director, Product Management

<Insert Picture Here>

Oracle Spatial 10g. An Oracle White Paper August 2005

Unstructured Data Management with Oracle Database 12c O R A C L E W H I T E P A P E R S E P T E M B E R

An Oracle White Paper January Advanced Compression with Oracle Database 11g

2009 Oracle Corporation 1

Automatic Data Optimization

Geodatabase Programming with SQL

ADVANCED DATA STRUCTURES FOR SURFACE STORAGE

FME 2007 Release Giving Raster the Vector Treatment. By Mary Jo Wagner

Oracle Big Data Spatial and Graph

OLTP Meets Bigdata, Challenges, Options, and Future Saibabu Devabhaktuni

An Introduction to Open Source Geospatial Tools

Big Data Spatial Performance with Oracle Database 12c A Practitioner s Panel

Oracle Big Data Spatial and Graph: Spatial Features

Managing Imagery and Raster Data in ArcGIS

Enterprise Architectures for Large Tiled Basemap Projects. Tommy Fauvell

Oklahoma s Open Source Spatial Data Clearinghouse: OKMaps

Safe Harbor Statement

May 2012 Oracle Spatial User Conference

Program Agenda. Safe Harbor Statement. What are SecureFiles? Performance. Features DBFS. Internals. Wrap Up 11/22/2014

An Oracle White Paper February Managing Unstructured Data with Oracle Database 11g

In Memory Accelerator for MongoDB

ArcGIS Server Performance and Scalability Optimization and Testing. Andrew Sakowicz

Databases for 3D Data Management: From Point Cloud to City Model

Technologies Enabling the Evolution of a Smart City. Brad Schmidt, Intergraph Canada

GIS Databases With focused on ArcSDE

Project eharta: a collaborative initiative to digitally preserve and freely share old cartographic documents in Romania

RESEARCH ON SPATIAL DATABASE DESIGN AND TUNING BASED ON ORACLE AND ARCSDE

ArcSDE Configuration and Tuning Guide for Oracle. ArcGIS 8.3

Administering your Oracle Geodatabase

SkylineGlobe - Products Overview

#9011 GeoMedia WebMap Performance Analysis and Tuning (a quick guide to improving system performance)

Oracle Database 11g R1 & R2: New Features for Administrators

Advanced Image Management using the Mosaic Dataset

Getting Started with Oracle Spatial

Big Data and Analytics: Getting Started with ArcGIS. Mike Park Erik Hoel

Top 10 Performance Tips for OBI-EE

GeoPackage, The Shapefile Of The Future

An Oracle White Paper November Hybrid Columnar Compression (HCC) on Exadata

Informatica Master Data Management Multi Domain Hub API: Performance and Scalability Diagnostics Checklist

OTM Performance OTM Users Conference Jim Mooney Vice President, Product Development August 11, 2015

The ORIENTGATE data platform

NetCDF and HDF Data in ArcGIS

SuperGIS Server 3.2 Standard Edition Specification

Database Scalability and Oracle 12c

Oracle Platform GIS & Location-Based Services. Fred Louis Solution Architect Ohio Valley

ArcGIS. Image Server tutorial

Optimizing the Performance of the Oracle BI Applications using Oracle Datawarehousing Features and Oracle DAC

Oracle Database 12c: Performance Management and Tuning NEW

ArcGIS for Server Performance and Scalability: Testing Methodologies. Andrew Sakowicz, Frank Pizzi,

Oracle Database In-Memory The Next Big Thing

An Oracle White Paper July Advanced Compression Option (ACO) with Oracle Database 11g

<Insert Picture Here> Best Practices for Extreme Performance with Data Warehousing on Oracle Database

SAP HANA In-Memory Database Sizing Guideline

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

EMBL-EBI. Database Replication - Distribution

Server GIS. What Server GIS software do we provide?

SQL DATABASE PROGRAMMING (PL/SQL AND T-SQL)

Tips and Tricks for Using Oracle TimesTen In-Memory Database in the Application Tier

Enterprise GIS Architecture Deployment Options. Andrew Sakowicz

INTEROPERABLE IMAGE DATA ACCESS THROUGH ARCGIS SERVER

Geodatabase Tuning and Performance. Gillian Silvertand Greg Cunningham

Managing Lidar (and other point cloud) Data. Lindsay Weitz Cody Benkelman

Oracle8i Spatial: Experiences with Extensible Databases

What's new in gvsig Desktop 2.0

3D Data Management From Point Cloud to City Model GeoSmart Africa 2016, Cape Town

Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam

Introduction to Imagery and Raster Data in ArcGIS

Best Practices for Sharing Imagery using Amazon Web Services. Peter Becker

Oracle Database 11g Comparison Chart

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.

<Insert Picture Here> Move to Oracle Database with Oracle SQL Developer Migrations

news from Tom Bacon about Monday's lecture

Boost Your Database Performance 10x with Oracle SecureFiles

Big Data and Its Impact on the Data Warehousing Architecture

Who am I? Copyright 2014, Oracle and/or its affiliates. All rights reserved. 3

Big Data: Using ArcGIS with Apache Hadoop. Erik Hoel and Mike Park

1 Copyright 2012, Oracle and/or its affiliates. All rights reserved. Public Information

Performance and Scalability Overview

Reading Raster Data with GDAL

System Requirements Table of contents

Department of Defence. Spatial Data Management Plan. DRMS Document Id: Version: Defence Support and Reform Group R

<Insert Picture Here> Oracle Database Directions Fred Louis Principal Sales Consultant Ohio Valley Region

An Oracle White Paper June High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database

ArcGIS for Server Performance and Scalability-Testing and Monitoring Tools. Amr Wahba

Oracle Architecture, Concepts & Facilities

WAN OPTIMIZATION. Srinivasan Padmanabhan (Padhu) Network Architect Texas Instruments, Inc.

Data Integrator Performance Optimization Guide

Transcription:

1 May 2012 Oracle Spatial User Conference

May 23, 2012 Ronald Reagan Building and International Trade Center Washington, DC USA

Siva Ravada Director of Development

New Performance Enhancements in Oracle Spatial

Program Agenda Introduction Faster Operators Improved GeoRaster operations Router and Geocoder enhancements Data Management Improvements Discussion

What are Performance Enhancements? Better utilization of DB resources and new features Securefiles, compression Improved and faster Algorithms for solving problems Spatial operators Tools to improve business workflows Transportable tablespaces Hardware related enhancements Exadata

Securefiles and SDO_GEOMETRY The SDO_ORDINATES in the geometry can be stored as securefiles lob CREATE TABLE MAP_WORLD_SEC ( COUNTRY" VARCHAR2(50), CAPITAL VARCHAR2(30), GEOMETRY MDSYS.SDO_GEOMETRY ) VARRAY "GEOMETRY"."SDO_ELEM_INFO" STORE AS SECUREFILE LOB VARRAY "GEOMETRY"."SDO_ORDINATES" STORE AS SECUREFILE LOB Several options for securefiles storage SECUREFILE LOB (CACHE) SECUREFILE LOB (CACHE COMPRESS) SECUREFILE LOB (CACHE COMPRESS HIGH) 7

Securefiles and SDO_GEOMETRY Test with a simple SQL operation select sum(sdo_util.getnumvertices(geometry)) from map_world_xxx Read Performance for different options 8

How to get the most out of Securefiles for SDO_GEOMETRY VARRAY s have 3 types of storage based on the size and storage options Always store as a lob (always stored in lob segments) Store small varrays (less than 4K) inline and store bigger varrays as out of line lob segments When the VARRAY storage is specified explicitly, it always stores it as a lob, even for smaller varrays Sepcify ENABLE STORAGE IN ROW to make sure inline storage is enabled for smaller VARRAYs Set this parameter in init.ora db_block_checking=false 9

Securefiles and GeoRaster Used the dataset of 50 DOQs to conduct the tests The results show very good performance improvement of SecureFiles over BasicFiles : load: mosaic: pyramiding: copy: changeformatcopy: subset: 1.16 times faster 2.5 times faster 3.3 times faster 4.15 times faster 4.23 times faster 3.4 times faster 10

Redo Log Enhancements Problem: Excessive redo generated for DML operations on tables with Spatial indexes MDRT tables store the index nodes as lobs and these lobs tend to generate lot of redo Several techniques are used to reduce the redo in 11.2.0.3 Reduce the updates to the index nodes Delayed write of the index nodes Better bulk insert/delete algorithm to reduce the number of modified nodes 11

Redo Log Enhancements Improvements with delayed write: 380K rows Normal Insert 1,509,711,160 07:34.09 00:06.26 Delayed Write 1,246,742,644 06:36.33 00:05.47 New redo algorithms New Code : 2,577,576,572 blocks; Elapsed: 00:12:24.92 Old Code : 1,034,054,352 blocks; Elapsed: 00:07:02.97 Real World proof points I/O reduced by about 90% for index insert/delete operations Throughput increased by 50% for DML operations 12

Redo Log Enhancements Improvements with delayed write: 380K rows Normal Insert 1,509,711,160 07:34.09 00:06.26 Delayed Write 1,246,742,644 06:36.33 00:05.47 New redo algorithms New Code : 2,577,576,572 blocks; Elapsed: 00:12:24.92 Old Code : 1,034,054,352 blocks; Elapsed: 00:07:02.97 Real World proof points I/O reduced by about 90% for index insert/delete operations Throughput increased by 50% for DML operations 13

Query Processing Enhancements Better use of R-tree indexing technology to improve the performance of all operators (in memory R-tree) Improve both Geodetic and non-geodetic operations Interior tile optimizations in 10.2 is based in quad-tree tiles This does not extend well to Geodetic data This new approach addresses 4 areas High performance Self-tuning no user tuning Small footprint high scalability for enterprise users Handle Earth s curved surface, poles, equator, international date line 14

Use in memory R-tree in two steps MBR (A) is from a R-tree index, either leaf node, or non-leaf node: 1) MBR (A) intersects polygon? 2) MBR (A) is inside polygon? A 15

Query Processing Enhancements Data Sets 1: An energy company: 39,000 locations in USA, a query polygon 4 times as big as Texas, 197,146 line segments 2: A transportation system: 3 million locations and 30 regions around world as query polygons: e.g. Canada/Mountain, American/Honolulu, Germany. average: 59,000 line segments, Norway 343,395 line segments 3: US Business Area data set: 10 million locations 50 US states, average: 1,755 line segments, Alaska 18,603 line segments; 1061 local regions, average: 520 line segments, Long Island : 6,915 line segments 16

Query Processing Enhancements: ANYINTERACT ANYINTERACT Operator (geode?c) Data Set 1 Data Set 2 Data Set 3 Data Set 4 10.2 204 20883 2559 11685 11.2.0.3 1.4 88 52 81 ANYINTERACT Operator (cartesian) Data Set 1 Data Set 2 Data Set 3 Data Set 4 10.2 71 7778 935 2158 11.2.0.3 2.44 190 58 64 17

Query Processing Enhancements: INSIDE/CONTAINS INSIDE Operator (geode?c) INSIDE Operator (cartesian) Data Set A 10.2 22 minutes 11.2.0.3 1 minute Data Set A 10.2 12 minutes 11.2.0.3 30 seconds Data set A: Table 1 has 260,520 geometries. Table 2 has 12,795 geometries. Both tables have polygon geometries. Experiment: Nested Loop join: 12,795 calls to the operator. 18

Query Processing Enhancements: SDO_WITHIN_DISTANCE WITHIN_DISTANCE Operator (geode?c) WITHIN_DISTANCE Operator (cartesian) Data Set with 10.2 2hr 28 minutes 11.2.0.3 4 minute Data Set 10.2 21 minutes 11.2.0.3 3 minutes Data set A: Table 1 has 260,520 geometries. Table 2 has 12,795 geometries. Both tables have polygon geometries. Experiment: Nested Loop join: 12,795 calls to the operator. 19

GeoRaster Load Performance

ETL tool the GDAL GeoRaster Driver GDAL is the best open source geospatial ETL tool/api for raster data It now natively supports importing and exporting many formats, to/from SDO_GEORASTER, including GeoTIFF, JPEG2000, ECW, NITF, HDF, NetCDF, ERDAS IMG, USGS DEM, SPOT, and more. GDAL is written in C++ and has very good performance It provides C/C++, Java, Python API for accessing GeoRaster It provides many tools. Two of the important ones are: gdal_translate utility to translate raster formats to/from GeoRaster objects gdalinfo utility to view information about a raster, such as a GeoRaster object 21

Understanding Image Structure Loading a 800 MB GeoTiff File is compressed LZW Uncompressed size is 5GB 4 band image Loading using GDAL with default blocking (256x256x4) takes 4 hours! Loading with a blocking of (256x256x1) takes 4 minutes! Most of the time is CPU: used by gdal_translate 22

Understanding Image Structure Check the structure of the input TIFF file: gdalinfo 129-286.tif Driver: GTiff/GeoTIFF Files: D:\Files\Data\129-286.tif Size is 42252, 31198... Band 1 Block=42252x32 Type=Byte, ColorInterp=Red Band 2 Block=42252x32 Type=Byte, ColorInterp=Green Band 3 Block=42252x32 Type=Byte, ColorInterp=Blue Band 4 Block=42252x32 Type=Byte, ColorInterp=Undefined Image is blocked in very wide and very narrow strips Very costly for the GeoTIFF driver to collect the pixels to fill our blocks! 23

GDAL and Memory GDAL uses little memory by default Set larger caches using GDAL_CACHEMAX option gdal_translate -of georaster sf1.tif --config GDAL_CACHEMAX 1024 georaster:scott/tiger@orcl111,us_rasters,georaster -co blockxsize=512 -co blockysize=512 -co blockbsize=3 -co interleave=bip -co srid=26943 -co "insert=values (1, 'sf1.tif', 'Aerial photo San Francisco 1', sdo_geor.init('us_rasters_rdt_01', 1),null,null)" Loading the 800 MB Tiff with a blocking of (256x256x4) now completes in less than 3 minutes! 24

Loading and Compression Option 1 Load and compress, then generate pyramid Option 2 Load, then generate pyramid, then compress the result The second option is more efficient 25

Q&A