HP Vertica and MicroStrategy 10: a functional overview including recommendations for performance optimization. Presented by: Ritika Rahate



Similar documents
SQL Server and MicroStrategy: Functional Overview Including Recommendations for Performance Optimization. MicroStrategy World 2016

Integrating MicroStrategy Analytics Platform with Microsoft SQL Server

Integrating MicroStrategy Analytics Platform with Oracle

Integrating MicroStrategy With Netezza

CitusDB Architecture for Real-Time Big Data

Oracle Database 11g Comparison Chart

The Sierra Clustered Database Engine, the technology at the heart of

SQL Server Parallel Data Warehouse: Architecture Overview. José Blakeley Database Systems Group, Microsoft Corporation

IBM DB2 Near-Line Storage Solution for SAP NetWeaver BW

Microsoft SQL Database Administrator Certification

SAP Business Objects Business Intelligence platform Document Version: 4.1 Support Package Data Federation Administration Tool Guide

Cass Walker TLG Learning

SAP HANA PLATFORM Top Ten Questions for Choosing In-Memory Databases. Start Here

Monitor and Manage Your MicroStrategy BI Environment Using Enterprise Manager and Health Center

Big Data Analytics in LinkedIn. Danielle Aring & William Merritt

ORACLE OLAP. Oracle OLAP is embedded in the Oracle Database kernel and runs in the same database process

MicroStrategy Course Catalog

Innovative technology for big data analytics

Microsoft Analytics Platform System. Solution Brief

Graph Mining on Big Data System. Presented by Hefu Chai, Rui Zhang, Jian Fang

SQL Server 2012 Gives You More Advanced Features (Out-Of-The-Box)

SAP HANA - Main Memory Technology: A Challenge for Development of Business Applications. Jürgen Primsch, SAP AG July 2011

SQL Server 2012 Performance White Paper

In-Memory Data Management for Enterprise Applications

Plug-In for Informatica Guide

Best Practices for Hadoop Data Analysis with Tableau

Inge Os Sales Consulting Manager Oracle Norway

SQL Server Administrator Introduction - 3 Days Objectives

Exploring the Synergistic Relationships Between BPC, BW and HANA

Performance and Scalability Overview

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.

SQL Performance for a Big Data 22 Billion row data warehouse

Native Connectivity to Big Data Sources in MSTR 10

Vertica Live Aggregate Projections

ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION

!"#"$%&'(()!!!"#$%&'())*"&+%

INTRODUCING DRUID: FAST AD-HOC QUERIES ON BIG DATA MICHAEL DRISCOLL - CEO ERIC TSCHETTER - LEAD METAMARKETS

<Insert Picture Here> Best Practices for Extreme Performance with Data Warehousing on Oracle Database

Who am I? Copyright 2014, Oracle and/or its affiliates. All rights reserved. 3

<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server

ORACLE DATABASE 10G ENTERPRISE EDITION

Performance and Scalability Overview

In-Memory Analytics: A comparison between Oracle TimesTen and Oracle Essbase

Optimizing the Performance of the Oracle BI Applications using Oracle Datawarehousing Features and Oracle DAC

SAP HANA SAP s In-Memory Database. Dr. Martin Kittel, SAP HANA Development January 16, 2013

BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB

System Protection for Hyper-V Whitepaper

Tiber Solutions. Understanding the Current & Future Landscape of BI and Data Storage. Jim Hadley

Oracle Database In-Memory The Next Big Thing

Power BI Performance. Tips and Techniques

OLAP Services. MicroStrategy Products. MicroStrategy OLAP Services Delivers Economic Savings, Analytical Insight, and up to 50x Faster Performance

<Insert Picture Here> Enhancing the Performance and Analytic Content of the Data Warehouse Using Oracle OLAP Option

Reporting Services. White Paper. Published: August 2007 Updated: July 2008

Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010

<Insert Picture Here> Oracle Retail Data Model Overview

Safe Harbor Statement

CHAPTER 5: BUSINESS ANALYTICS

How, What, and Where of Data Warehouses for MySQL

Session# - AaS 2.1 Title SQL On Big Data - Technology, Architecture and Roadmap

LearnFromGuru Polish your knowledge

<Insert Picture Here> Oracle and/or Hadoop And what you need to know

How to Choose Between Hadoop, NoSQL and RDBMS

Affordable, Scalable, Reliable OLTP in a Cloud and Big Data World: IBM DB2 purescale

SAP BO 4.1 COURSE CONTENT

The Methodology Behind the Dell SQL Server Advisor Tool

MS SQL Performance (Tuning) Best Practices:

System Protection for Hyper-V User Guide

Efficient Data Access and Data Integration Using Information Objects Mica J. Block

A Comprehensive Review of Self-Service Data Visualization in MicroStrategy. Vijay Anand January 28, 2014

Online Courses. Version 9 Comprehensive Series. What's New Series

SQL Server 2012 Business Intelligence Boot Camp

Tuning Tableau and Your Database for Great Performance PRESENT ED BY

SQL SERVER TRAINING CURRICULUM

Best Practices for Implementing Oracle Data Integrator (ODI) July 21, 2011

Oracle9i Database Release 2 Product Family

How To Manage A Database Server 2012

Monetizing Millions of Mobile Users with Cloud Business Analytics

SAP Business One analytics powered by SAP HANA An Overview

Integrating Apache Spark with an Enterprise Data Warehouse

This is a training module for Maximo Asset Management V7.1. It demonstrates how to use the E-Audit function.

Creating Connection with Hive

A Comparison of Approaches to Large-Scale Data Analysis

OBIEE 11g Data Modeling Best Practices

Building Advanced Data Models with SAP HANA. Werner Steyn Customer Solution Adoption, SAP Labs, LLC.

Creating BI solutions with BISM Tabular. Written By: Dan Clark

Up Your R Game. James Taylor, Decision Management Solutions Bill Franks, Teradata

SQL Server Replication Guide

Navigating the Big Data infrastructure layer Helena Schwenk

The Kentik Data Engine

Microsoft SQL Server 2008 Step by Step

Actian Vector in Hadoop

Trafodion Operational SQL-on-Hadoop

Oracle Big Data SQL Technical Update

In-memory data pipeline and warehouse at scale using Spark, Spark SQL, Tachyon and Parquet

CHAPTER 4: BUSINESS ANALYTICS

Managing User Accounts

Enhance your Analytics using Logical Data Warehouse and Data Virtualization thru SAP HANA smart data access SESSION CODE: 0210

Optimize Oracle Business Intelligence Analytics with Oracle 12c In-Memory Database Option

Sage Intelligence Financial Reporting for Sage ERP X3 Version 6.5 Installation Guide

Distributed Architecture of Oracle Database In-memory

Transcription:

HP Vertica and MicroStrategy 10: a functional overview including recommendations for performance optimization Presented by: Ritika Rahate

MicroStrategy Data Access Workflows There are numerous ways for MicroStrategy to interact with HP Vertica Ad-hoc Schema Live Connect In-Memory Cube Modeled Schema

MicroStrategy Is Most Commonly Used To Send Analytical Queries to HP Vertica Analytical Queries have specific characteristics that differentiate them from operational queries Analytical queries involve processing of massive quantities of data Accessing large data volumes Processing massive data volumes Typical Challenge Achieve interactive response time Vertica offers the following key features: Columnar Orientation Compression Projections Massively Parallel Processing

Schema Design Is Essential for Analytical Query Performance All key features are implemented as part of schema design Pro Tip: Use the Database Designer Tool offered by Vertica

MicroStrategy Unique Optimizations for HP Vertica Vertica-specific SQL syntax Analytical functions (OLAP functions) CASE expressions Full outer joins Set operators Subqueries Multi-pass SQL for Analytical Sophistication Use of temporary tables Use Read Optimized Storage Analyze statistics on temporary tables Middle-tier computation of calculations not available in Vertica Support for key Vertica features Massively Parallel Processing Label queries for simplified analysis High Availability and Load Balancing Secure connectivity Extensions to Vertica functionality Aggregate awareness with physical summary tables Middle-tier caching via In-Memory Cubes Report caching Application-level partitioning

MicroStrategy Generates Vertica-Specific SQL Syntax MicroStrategy integrates with HP Vertica s broad list of database functions and SQL functionality to improve analytical performance 99 35 37 Function patterns pushed down Unique VLDB properties Data types supported

MicroStrategy Unique Optimizations for HP Vertica Vertica-specific SQL syntax Analytical functions (OLAP functions) CASE expressions Full outer joins Set operators Subqueries Multi-pass SQL for Analytical Sophistication Use of temporary tables Use Read Optimized Storage Analyze statistics on temporary tables Middle-tier computation of calculations not available in Vertica Support for key Vertica features Massively Parallel Processing Label queries for simplified analysis High Availability and Load Balancing Secure connectivity Extensions to Vertica functionality Aggregate awareness with physical summary tables Middle-tier caching via In-Memory Cubes Report caching Application-level partitioning

MicroStrategy Generates Multi-Pass SQL Queries For Analytical Richness By default MicroStrategy creates temporary tables to hold intermediate result sets user-session visibility session-scoped data create local temporary table ZZSP00 on commit preserve rows as select a13.year_id YEAR_ID, a12.subcat_id SUBCAT_ID, sum(a11.tot_unit_sales) WJXBFS1 from ITEM_MNTH_SLS a11, LU_ITEM a12, LU_MONTH a13 where a11.item_id = a12.item_id and a11.month_id = a13.month_id group by a13.year_id, a12.subcat_id unsegmented all nodes data replication & distribution

Large Intermediate Result Sets Can Bypass Write Store for Better Performance Query hint forces storage to Read Optimized Storage (ROS) Data stored in the Read Optimized Storage(ROS) Highly organized by compression Indexed Example: create local temporary table ZZMD00 on commit preserve rows as select /*+ DIRECT */ a11.year_id YEAR_ID, sum(a11.tot_unit_sales) WJXBFS1 from ITEM_MNTH_SLS a11

Analyzing Large Intermediate Result Sets Improves Query Execution MicroStrategy can instruct Vertica to generate statistics on temporary tables Optimal Query plan Vertica Query Optimizer Analyze_statistics for temp table 101110010101

MicroStrategy Provides Middle-tier Computations for Analytical Sophistication Combining multiple insert statements, removes the overhead of parsing structurally identical statements repeatedly Row-by-Row Inserts are Slow Requires time-consuming locking/unlocking of table Bulk-Inserts are Fast Uses Parameterized Statements to insert blocks of data all at once vs. Row Insert Row Insert Row Insert Row Insert Bulk Insert

Enabling Parameterized Inserts in MicroStrategy Significantly Improves Response Time Parameterized inserts are enabled a DB Instance level in MicroStrategy Navigate to DB instance DB connection Click Modify to edit the Database Connection. Check Use parameterized queries is located on the Advanced tab

SQL Pass #N-1 SQL Pass #N-1 SQL Pass #3 Single SQL Pass SQL Pass #2 SQL Pass #2 SQL Pass #1 SQL Pass #1 SQL Pass #1 MicroStrategy Avoids Unnecessary Workload on Vertica Enabling SQL Global Optimization reduces the number of SQL passes improving query performance Before Global Optimization Redundant SQL Pass After Global Optimization Level 1 Redundant SQL Pass automatically removed Before Global Optimization Metric definitions force different SQL passes After Global Optimization Level 2 SQL Engine automatically combines different SQL passes into a single SQL pass FROM HAVING Category Sum(Sales)>50000 FROM HAVING Category Sum(Sales)>50000 FROM GROUPBY.. Sum(Revenue) ITEM_MTH_SLS FROM HAVING Category Sum(Sales)>50000 FROM GROUPBY.. Count(item) ITEM_MTH_SLS Units Sold Units Received FROM SQL Pass # N-2 SQL Pass # N-1 Units Sold Units Received FROM SQL Pass # N-3 SQL Pass # N-2 Sum(Revenue) Count(Item) FROM SQL Pass # 1 SQL Pass # 2 FROM Sum(Revenue) Count(Item) ITEM_MTH_SLS.

SQL Pass #4 Single SQL Pass SQL Pass #1 SQL Pass #2 SQL Pass #1 SQL Pass #1 SQL Pass #1 MicroStrategy Pushes Smart SQL to Vertica SQL Global Optimization is enabled by default for Vertica Before Global Optimization Filter Conditions Force Four Separate SQL Passes FROM SLS.REGION =LU_REGION.REGION AND REGION = Northeast FROM SLS.REGION =LU_REGION.REGION AND REGION = Central Northeast Revenue, Central Revenue, Southeast Revenue FROM SQL Pass # 1 SQL Pass # 2 SQL Pass # 3. Separate SQL Passes After Global Optimization Level 3 Resolve Filter Conditions into a Single SQL Pass FROM Sum(SLS.Revenue(IF LU_REGION.REGION = Northeast,0), Sum(SLS.Revenue(IF LU_REGION.REGION = Central), Sum(SLS.Revenue(IF LU_REGION.REGION = Southeast), SLS.REGION =LU_REGION.REGION Before Global Optimization Intermediate results stored in multiple temp tables CREATE TABLE ZZMD01 AS Category FROM Region = Northeast AND CREATE TABLE ZZMD02 AS Category. FROM Region = Central AND Intermediate Table ZZMD01 stores Northeast Categories Intermediate Table ZZMD02 stores Central Categories After Global Optimization Level 4 Intermediate results stored in one temp table CREATE TABLE ZZMD01 AS Category Sum(Revenue(IF.REGION = Northeast,0)), Sum(Revenue(IF REGION = Central,0)) FROM

MicroStrategy Unique Optimizations for HP Vertica Vertica-specific SQL syntax Analytical functions (OLAP functions) CASE expressions Full outer joins Set operators Subqueries Multi-pass SQL for Analytical Sophistication Use of temporary tables Use Read Optimized Storage Analyze statistics on temporary tables Middle-tier computation of calculations not available in Vertica Support for key Vertica features Massively Parallel Processing Label queries for simplified analysis High Availability and Load Balancing Secure connectivity Extensions to Vertica functionality Aggregate awareness with physical summary tables Middle-tier caching via In-Memory Cubes Report caching Application-level partitioning

Easily Identify MicroStrategy Workloads in Vertica for Profiling and Debugging purposes Monitoring MicroStrategy workloads in Vertica using Query Labels to ensure efficient utilization of available resources MicroStrategy queries can be easily identified in vertica using the /*+label(label-name)*/ hint These query label hints can be passed as a prefix to, INSERT statements. select identifier, projections_used, query_duration_us, query_start, user_name, processed_row_count from QUERY_PROFILES where identifier = 'MSTR_39C90CD7475A6AAFEFB125BE4FB8B0D7';

Configuring High Availability and Load Balancing in an MPP Environment Leverage Vertica-specific features Vertica cluster failover scenarios Initiator node goes down Session ends and the query is lost Executor node goes down If node goes down in the middle of processing a multi-pass sql job, a query failure error message is sent to the end user and the report has to be rerun If No results have been sent back yet. In this case the end user query will be processed by another node in the Vertica cluster

Connect Securely to HP Vertica MicroStrategy recommends using encrypted data connections Security features in Vertica Client Authentication Connection Encryption Client Authorization

Summary MicroStrategy and HP Vertica continue to have a strong partnership Multi-faceted technical integration of products Continued optimization provides a seamless reporting experience

Resources Link to integration paper: TN47683 Contact: Ritika Rahate rrahate@microstrategy.com

Questions