James Serra Sr BI Architect JamesSerra3@gmail.com http://jamesserra.com/

Similar documents
SQL Server PDW. Artur Vieira Premier Field Engineer

Building an Effective Data Warehouse Architecture James Serra

SQL Server Parallel Data Warehouse: Architecture Overview. José Blakeley Database Systems Group, Microsoft Corporation

HP Enterprise Data Warehouse Deep Dive. Steve Tramack, Sr. Engineering Manager, I2A Solutions, HP

Microsoft Analytics Platform System. Solution Brief

Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc.

PSAM, NEC PCIe SSD Appliance for Microsoft SQL Server (Reference Architecture) September 11 th, 2014 NEC Corporation

The HP Neoview data warehousing platform for business intelligence Die clevere Alternative

Parallel Data Warehouse

HP high availability solutions for Microsoft SQL Server Fast Track Data Warehouse using SQL Server 2012 failover clustering

Maximum performance, minimal risk for data warehousing

SAN Conceptual and Design Basics

Big Data Processing: Past, Present and Future

HP ProLiant DL580 Gen8 and HP LE PCIe Workload WHITE PAPER Accelerator 90TB Microsoft SQL Server Data Warehouse Fast Track Reference Architecture

SQL Server 2012 Parallel Data Warehouse. Solution Brief

MOC 20467B: Designing Business Intelligence Solutions with Microsoft SQL Server 2012

Server Consolidation with SQL Server 2008

Upgrading to Microsoft SQL Server 2008 R2 from Microsoft SQL Server 2008, SQL Server 2005, and SQL Server 2000

Innovative technology for big data analytics

SMB Direct for SQL Server and Private Cloud

HP Enterprise Data Warehouse Appliance architecture overview and performance guide

LEARNING SOLUTIONS website milner.com/learning phone

Comparing SMB Direct 3.0 performance over RoCE, InfiniBand and Ethernet. September 2014

HP 85 TB reference architectures for Microsoft SQL Server 2012 Fast Track Data Warehouse: HP ProLiant DL980 G7 and P2000 G3 MSA Storage

All Silicon Data Warehouse: Violin Memory Fast Track Data Warehouse Reference Architecture. Installation and Configuration Guide

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

2009 Oracle Corporation 1

MS 20467: Designing Business Intelligence Solutions with Microsoft SQL Server 2012

TekSouth Fights US Air Force Data Center Sprawl with iomemory

SQL Server 2012 Performance White Paper

The Evolution of Microsoft SQL Server: The right time for Violin flash Memory Arrays

Introduction to Database as a Service

IBM PureData System for Transactions. Technical Deep Dive. Jonathan Rossi, PureSystems Specialist

Hadoop Size does Hadoop Summit 2013

Dell Microsoft SQL Server 2008 Fast Track Data Warehouse Performance Characterization

Planning the Installation and Installing SQL Server

Please give me your feedback

Evolving Solutions Disruptive Technology Series Modern Data Warehouse

Open-E Data Storage Software and Intel Modular Server a certified virtualization solution

QuickSpecs. What's New Support for two InfiniBand 4X QDR 36P Managed Switches

Oracle Exadata Database Machine for SAP Systems - Innovation Provided by SAP and Oracle for Joint Customers

Designing Business Intelligence Solutions with Microsoft SQL Server 2012 Course 20467A; 5 Days

Microsoft SQL Server 2014 Fast Track

The Pros and Cons of Data Warehouse Appliances

Quickly Deploy Microsoft Private Cloud and SQL Server 2012 Data Warehouse on Hitachi Converged Solutions. September 25, 2013

Performance characterization report for Microsoft Hyper-V R2 on HP StorageWorks P4500 SAN storage

Preview of Oracle Database 12c In-Memory Option. Copyright 2013, Oracle and/or its affiliates. All rights reserved.

Virtualizing Microsoft SQL Server 2008 on the Hitachi Adaptable Modular Storage 2000 Family Using Microsoft Hyper-V

A Breakthrough Platform for Next-Generation Data Warehousing and Big Data Solutions

Structured data meets unstructured data in Azure and Hadoop

Dell Virtualization Solution for Microsoft SQL Server 2012 using PowerEdge R820

SUN ORACLE EXADATA STORAGE SERVER

Brocade and EMC Solution for Microsoft Hyper-V and SharePoint Clusters

HP ProLiant BL660c Gen9 and Microsoft SQL Server 2014 technical brief

How To Use Hp Vertica Ondemand

SQLCAT Largest SQL Server Projects in the World

Private cloud computing advances

How To Write An Article On An Hp Appsystem For Spera Hana

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

SQL diagnostic manager Management Pack for Microsoft System Center. Overview

Optimizing Large Arrays with StoneFly Storage Concentrators

SQL Server Virtualization

Dell s SAP HANA Appliance

System Requirements. Version 8.2 November 23, For the most recent version of this document, visit our documentation website.

The Methodology Behind the Dell SQL Server Advisor Tool

Einsatzfelder von IBM PureData Systems und Ihre Vorteile.

Mind Q Systems Private Limited

Inge Os Sales Consulting Manager Oracle Norway

Instant-On Enterprise

High Availability Databases based on Oracle 10g RAC on Linux

SQL Server 2008 Performance and Scale

SQL Server 2012 Database Administration With AlwaysOn & Clustering Techniques

Michael Kagan.

Administering Microsoft SQL Server 2012 Databases

System Requirements Version 8.0 July 25, 2013

Implementing a Data Warehouse with Microsoft SQL Server 2012 (70-463)

The New Economics of SAP Business Suite powered by SAP HANA SAP AG. All rights reserved. 2

SQL Server 2005 Features Comparison

SQL Server What s New? Christopher Speer. Technology Solution Specialist (SQL Server, BizTalk Server, Power BI, Azure) v-cspeer@microsoft.

SAP HANA SAP s In-Memory Database. Dr. Martin Kittel, SAP HANA Development January 16, 2013

EMC Unified Storage for Microsoft SQL Server 2008

VMware vsphere 5.1 Advanced Administration

Course Outline. Module 1: Introduction to Data Warehousing

White Paper. Recording Server Virtualization

Deploying Microsoft SQL Server 2008 R2 with Logical Partitioning on the Hitachi Virtual Storage Platform with Hitachi Dynamic Tiering

Diablo and VMware TM powering SQL Server TM in Virtual SAN TM. A Diablo Technologies Whitepaper. May 2015

Exadata Database Machine

VMware vsphere 5.0 Boot Camp

SQL Server Consolidation Using Cisco Unified Computing System and Microsoft Hyper-V

Overview: X5 Generation Database Machines

Express5800 Scalable Enterprise Server Reference Architecture. For NEC PCIe SSD Appliance for Microsoft SQL Server

ONSITE TRAINING CATALOG

Transcription:

James Serra Sr BI Architect JamesSerra3@gmail.com http://jamesserra.com/

Our Focus: Microsoft Pure-Play Data Warehousing & Business Intelligence Partner Our Customers: Our Reputation: "B.I. Voyage came in and proved their subject matter expertise and worked well with our crew. Our CEO is now sold on the business value and corporate impact of having modern day business intelligence solutions!" email from Steve Bonanno CIO, Direct Edge I do count you guys as leading partners and this view is represented throughout the PDW business. email from Christian Kleinerman, Head of PDW Engineering Microsoft

Agenda Data Warehouse Fast Track Data Warehouse (FTDW) Business Data Warehouse Appliance (BDW) Business Decision Appliance (BDA) Database Consolidation Appliance (DBC) Parallel Data Warehouse (PDW)

Legacy applications + data marts = chaos Production Control MRP Inventory Control Parts Management Logistics Shipping Raw Goods Finance Marketing Sales Accounting Management Reporting Engineering Actuarial Enterprise data warehouse = order Continuity Consolidation Control Compliance Collaboration Single version of the truth Enterprise Data Warehouse Order Control Purchasing Human Resources Every question = decision

SERVER CACHE SQL SERVER WINDOWS CPU CORES FC SWITCH CACHE FC HBA FC HBA A B A B A B STORAGE CONTROLLER A B A B DISK DISK LUN DISK DISK LUN CPU Feed Rate SQL Server HBA Port Rate Switch Port Rate SP Port Rate Read Ahead Rate LUN Read Rate Disk Feed Rate

Guidance Reference Architectures, Fast Track brand Appliances Build it yourself Custom configurations High IT expertise Cooking recipe Very fast time to value Probably higher success No options (besides size ) Can be sold to customers Tied to HW vendor

Software: SQL Server 2008 R2 Enterprise Windows Server 2008 Configuration guidelines: Physical table structures Indexes Compression SQL Server settings Windows Server settings Loading Hardware: Tight specifications for servers, storage and networking Per core building block

You put together after receiving all the hardware (you need to install the OS, the edition of SQL Server that you ve purchased, and any other products such as SharePoint and PowerPivot for SharePoint) Eliminates guesswork and is designed to save you months of configuration, setup, testing and tuning

You put together after receiving all the hardware (you need to install the OS, the edition of SQL Server that you ve purchased, and any other products such as SharePoint and PowerPivot for SharePoint) Eliminates guesswork and is designed to save you months of configuration, setup, testing and tuning

Option Pros Cons 1. Basic Evaluation Very fast system set-up and procurement (days to weeks) Minimize cost of design and evaluation Lower infrastructure skill requirements Possibility of over-specified storage or under-specified CPU 2. Full Evaluation Predefined reference architecture tailored to expected workload Potential for cost-saving on hardware Increased confidence in solution Evaluation takes effort and time (weeks to months) Requires detailed understanding of target workload 3. User-defined Reference Architecture Potential to reuse existing hardware Potential to incorporate latest hardware System highly tailored for your usecase Process takes several months Requires significant infrastructure expertise Requires significant SQL Server expertise

These metrics are use to both validate and position Fast Track RA s Maximum Consumption Rate (MCR) Ability of SQL Server to process data for a specific CPU and Server combination and a standard SQL query Benchmark Consumption Rate (BCR) Ability of SQL Server to process data for a specific CPU and Server combination and a user workload or query User Data Capacity (UDC) Maximum available SQL Server storage for a specific Fast Track RA assuming 2.5:1 page compression factor and 300 GB 15K SAS. 30% of this storage should be reserved for DBA operations

Offered as a reference architecture

Months Design, Build & Deploy in weeks rather than months Design Build Deploy Custom-built solution Assess and understand workload Define architecture Evaluate alternatives Design specific implementation Acquire HW & SW components Build solution Load data Proof Of Concept & Validation Tune & Balance HW & SW Integrate in environment Burn in & Stability 1 2 3 4 Design Build Deploy Use Integrated & Optimized Appliance Choose appliance for workload Acquire appliance Install appliance Extract & load data Stand-up in production Monitor & Manage Extract and manipulate data Generate reports Make decisions 1 2 3 4 Monitor and troubleshoot Use Extract and manipulate data Generate reports Make decisions 5

Scale Out for both Performance and Capacity simultaneously by adding racks A prepackaged or pre-configured balanced set of hardware (servers, memory, storage and I/O channels), software (operating system, DBMS and management software), service and support, sold as a unit with built-in redundancy for high availability positioned as a platform for data warehousing. Control Rack 10 Node Data Rack HP PDW 1 Rack 17 Servers 22 Processors /132 Cores 125 TB HP PDW 4 Rack: 47 Servers 82 Processors / 492 Cores 500 TB

SMP HW advancements increasing ability to scale-up Scaling is limited High end SMP very expensive Extremely high concurrency for some workloads Less than 1-2 TB of data SMP will almost always be better. Usually <10TB Full SQL Server functionality HA must be architected in MPP with PDW HW advancements increasing ability to scale-up & scale-out Scaling to 1 PB+ Scale out is relatively low cost Relatively high concurrency for complex workloads > 10 TB (typically) up to 1 PB Limited SQL Server functionality HA is built in

Control Node (2) Single connection point for SQL queries. Single touch point for DBAs. Management Servers (2) Patch management. Active Directory. Node Image. Landing Zone (1) Server and storage dedicated to loading data. Backup Node (1) Server and storage dedicated to handling backup. Failover Clusters Dual networks Mirrored drives Hot swap drive Dual power supplies Dual cooling fans Storage Node (10) 11 or 24 Disks each. Dual network cards. Dedicated SAN. Compute Node (11) A SQL Server 2008 Instance. Highly Tuned SMP. 8 Cores each. 8 Disks each (TempDB).

Query 1?????????? Query 1 is standard T-SQL submitted to SQL Server on Control Node Query is executed on all 10 Nodes Results are sent back to client

??????????? L????????? L?????????? L???????? L???????? L?????????? L???????? L???????? Load L L???????? L???????? L???????? Multiple queries are simultaneously executed across all nodes. PDW supports querying while data is loading.

Replicated A table structure that exists as A full copy within each discrete DBMS instance. Distributed A table structure that is hashed on a single column and uniformly distributed across all nodes on the appliance. Each distribution is A separate physical table in the DBMS. Ultra shared nothing The ability to design a schema of both distributed and replicated tables to minimize data movement Small sets of data can be more efficiently stored in full. Certain set operations are more efficient against full sets of data.

As an end user or DBA you think about 1 table: LineItem. You run select * from LineItem PDW is an appliance, simple to use! You don t care or need to know that there are actually 320 tables representing your 1 logical table. That each of those 320 is using it own clustered index and has range partitioning.

Departmental Reporting SQL Server Regional Reporting Central EDW Hub High-Performance Reporting SQL Server FastTrack SQL Server Analysis Services Landing Zone ETL Tools

http://bit.ly/w1cah5 http://bit.ly/ywep9c

http://bit.ly/yyuelc http://bit.ly/y7bxy5 http://bit.ly/xazy9h http://bit.ly/ybz2qj http://bit.ly/aqmtm8 http://bit.ly/yks1pd http://bit.ly/wzssdd http://ibm.co/zmyihu http://bit.ly/y5deuy http://bit.ly/xutanu http://bit.ly/avzq79