How to Move Your Business to Big Data: The Next Generation Enterprise Architecture



Similar documents
High Availability Databases based on Oracle 10g RAC on Linux

The Virtualization Practice

Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

Implementing Multi-Tenanted Storage for Service Providers with Cloudian HyperStore. The Challenge SOLUTION GUIDE

How To Use Arcgis For Free On A Gdb (For A Gis Server) For A Small Business

SQL Server Virtualization 101. David Klee, Group Principal and Practice Lead. SQL PASS Virtualization VC,

Data movement for globally deployed Big Data Hadoop architectures

Designing, Optimizing and Maintaining a Database Administrative Solution for Microsoft SQL Server 2008

ZADARA STORAGE. Managed, hybrid storage EXECUTIVE SUMMARY. Research Brief

ICAS4108B Complete database back-up and recovery

Introduction to Database as a Service

MS Design, Optimize and Maintain Database for Microsoft SQL Server 2008

Requirements Checklist for Choosing a Cloud Backup and Recovery Service Provider

NET ACCESS HIPAA COMPLIANT FLEXCloud

How To Run A Modern Business With Microsoft Arknow

Accelerating Wordpress for Pagerank and Profit

Brian LaGoe, Systems Administrator Benjamin Jellema, Systems Administrator Eastern Michigan University

MySQL Strategy. Morten Andersen, MySQL Enterprise Sales. Copyright 2014 Oracle and/or its affiliates. All rights reserved.

How To Run A Cloud Computer System

MANAGED EXCHANGE SOLUTIONS Secure, Scalable and Compliant Hosted Environments

Frequently Asked Questions about Cloud and Online Backup

Software-Defined Networks Powered by VellOS

VirtualclientTechnology 2011 July

Long term retention and archiving the challenges and the solution

Choosing Storage Systems

Simplifying Storage Operations By David Strom (published 3.15 by VMware) Introduction

Disaster Recovery Strategies

Non-Stop Hadoop Paul Scott-Murphy VP Field Techincal Service, APJ. Cloudera World Japan November 2014

Kent State University s Cloud Strategy

Brochure. Data Protector 9: Nine reasons to upgrade

Bringing the Cloud into Focus. A Whitepaper by CMIT Solutions and Cadence Management Advisors

Building Private Cloud Architectures

Upgrading Your SQL Server 2000 Database Administration (DBA) Skills to SQL Server 2008 DBA Skills Course 6317A: Three days; Instructor-Led

Cloud OS Vision. Modern platform for the world s apps

How To Write A Server On A Flash Memory On A Perforce Server

VMware Virtual Infrastucture From the Virtualized to the Automated Data Center

Course Outline: Course 6317: Upgrading Your SQL Server 2000 Database Administration (DBA) Skills to SQL Server 2008 DBA Skills

Daymark DPS Enterprise - Agentless Cloud Backup and Recovery Software

Guide to Database as a Service (DBaaS) Part 2 Delivering Database as a Service to Your Organization

Reference Architecture and Best Practices for Virtualizing Hadoop Workloads Justin Murray VMware

Big data management with IBM General Parallel File System

Expand Your Infrastructure with the Elastic Cloud. Mark Ryland Chief Solutions Architect Jenn Steele Product Marketing Manager

Hadoop in the Hybrid Cloud

Requirements Checklist for Choosing a Cloud Backup and Recovery Service Provider

XTM Web 2.0 Enterprise Architecture Hardware Implementation Guidelines. A.Zydroń 18 April Page 1 of 12

Backup and Recovery: The Benefits of Multiple Deduplication Policies

Cloud DBMS: An Overview. Shan-Hung Wu, NetDB CS, NTHU Spring, 2015

Introduction to Cloud : Cloud and Cloud Storage. Lecture 2. Dr. Dalit Naor IBM Haifa Research Storage Systems. Dalit Naor, IBM Haifa Research

Best Practices Report

How To Store Data On A Server Or Hard Drive (For A Cloud)

Microsoft SQL Database Administrator Certification

Microsoft SharePoint Architectural Models

The Porticor Virtual Private Data solution includes two or three major components:

EMC XtremSF: Delivering Next Generation Storage Performance for SQL Server

STORAGE CENTER WITH NAS STORAGE CENTER DATASHEET

Demystifying the Cloud Computing

Who is SharePoint Joel?

6231A - Maintaining a Microsoft SQL Server 2008 Database

Uncompromised business agility with Oracle, NetApp and VMware

Dimension Data Enabling the Journey to the Cloud

Data Backup and Restore (DBR) Overview Detailed Description Pricing... 5 SLAs... 5 Service Matrix Service Description

Unified network traffic monitoring for physical and VMware environments

Array Networks & Microsoft Exchange Server 2010

Oracle s Cloud Computing Strategy

Saving Millions through Data Warehouse Offloading to Hadoop. Jack Norris, CMO MapR Technologies. MapR Technologies. All rights reserved.

Highly available, scalable and secure data with Cassandra and DataStax Enterprise. GOTO Berlin 27 th February 2014

Redefining Backup for VMware Environment. Copyright 2009 EMC Corporation. All rights reserved.

Apache Hadoop. Alexandru Costan

VERITAS NetBackup 6.0 Enterprise Server INNOVATIVE DATA PROTECTION DATASHEET. Product Highlights

Business Continuity with the. Concerto 7000 All Flash Array. Layers of Protection for Here, Near and Anywhere Data Availability

Best Practices for Monitoring Databases on VMware. Dean Richards Senior DBA, Confio Software

Improve performance and availability of Banking Portal with HADOOP

Using EonStor FC-host Storage Systems in VMware Infrastructure 3 and vsphere 4

Best Practices for Using MySQL in the Cloud

Understanding ArcGIS Deployments in Public and Private Cloud. Marwa Mabrouk

COMPUTER MEASUREMENT GROUP - India Hyderabad Chapter. Strategies to Optimize Cloud Costs By Cloud Performance Monitoring

Big Data With Hadoop

Information Builders Mission & Value Proposition

Storage and Disaster Recovery

Product Overview. UNIFIED COMPUTING Managed Hosting - Storage Data Sheet

Oracle Database Backup Service. Secure Backup in the Oracle Cloud

TABLE OF CONTENTS THE SHAREPOINT MVP GUIDE TO ACHIEVING HIGH AVAILABILITY FOR SHAREPOINT DATA. Introduction. Examining Third-Party Replication Models

Providing Self-Service, Life-cycle Management for Databases with VMware vfabric Data Director

STREAM FRBC

Testing Automation for Distributed Applications By Isabel Drost-Fromm, Software Engineer, Elastic

Business white paper. environments. The top 5 challenges and solutions for backup and recovery

Explain how to prepare the hardware and other resources necessary to install SQL Server. Install SQL Server. Manage and configure SQL Server.

Exchange Data Protection: To the DAG and Beyond. Whitepaper by Brien Posey

MOC 20467B: Designing Business Intelligence Solutions with Microsoft SQL Server 2012

WHITEPAPER. A Technical Perspective on the Talena Data Availability Management Solution

EMC XtremSF: Delivering Next Generation Performance for Oracle Database

Transcription:

How to Move Your Business to Big Data: The Next Generation Enterprise Architecture Jim Scott Director, Enterprise Strategy & Architecture @kingmesal #BigDataEverywhere London 1

Agenda Current State History Moving Forward The Next Enterprise Architecture Business Implications Concrete Implementations 2

Current State 2014 2014 MapR MapR Technologies Technologies 3

Study History to Prepare for the Future A data center was built The servers were statically partitioned If we want to break the cycle we have to break the partitions and become dynamic 4

Understanding the Why s Isolation of resources Assists in troubleshooting Prevents the analytics team from impacting production Maximum throughput of an application Guaranteed volume (maximum): compute, memory and storage Business Continuity We know exactly what is backed up, when, and where Difficult to perfect and to test 5

Issues with Isolated Workloads Segregated servers lead to under utilized hardware Wasted capacity and energy Complicated processes to move data to the required processing servers Operational impact, including extra monitoring Time delays moving data (not real-time) Troubleshooting time when there are issues Difficult to thoroughly test DEV vs. QA vs. Production Environments have different shapes and sizes They will not have identical configurations 6

Goals Moving Forward Leverage all existing hardware Create isolation in a different way Improve production operational processes Fix process of moving from DEV to QA to Production Support real-time business continuity 7

The Next Last Enterprise Architecture 2014 2014 MapR MapR Technologies Technologies 8

The Next Generation Enterprise Architecture Dynamic compute resources Common storage platform Real-time application support Flexible programming models Deployment management Solution based approach Applications to operate a business * This is a pluggable architecture 9

Technologies That Work 10

We Will Call This Architecture 11

What s in a Name The letter Z is the last letter in the English alphabet, but Zeta is not the last letter of the Greek alphabet But this is the last generalized architecture you will need. Sixth letter of the Greek alphabet Hexagon represents the 6 surrounding pieces Zeta represents the number 7 7 total components in this architecture Components work with a global resource manager 12

Origin Story of the Zeta Architecture Cultivated by Jim Scott Created the pretty diagrams Put a nice name on it Documented the concepts Not really a new concept Google pretty much pioneered these technology concepts They have never really discussed it cohesively in this way 13

Zeta Architecture at Google 14

Concrete Implementations 2014 2014 MapR MapR Technologies Technologies 15

Web Server Logs Web server generates logs Land on local disk Logs periodically rotated Shipped to other servers Run jobs on logs 16

Web Server Logs Web server generates logs Land on DFS Logs still rotate Logs now tolerant of a server failure prior to rotation Logs are instantaneously available for computation Run jobs on logs Data locality 17

Advertising Platform 18

Advertising Platform - Simplified 19

Advertising Platform on Zeta 20

Business Implications 2014 2014 MapR MapR Technologies Technologies 21

Integration of Existing Systems Use standards like NFS to connect existing systems Pluggable security models fit into your companies current standards Database that have been tested to work MySQL, Vertica, Sybase IQ ElasticSearch, SOLR May work, but not tested Oracle, DB2, SQL Server, PostgreSQL Applications in this architecture can still use them If they start supporting these technologies things change 22

Rethink the Data Center All Servers Run Mesos Participate in the Distributed File System Dynamic Allocation of Resources Spin up more web servers Custom Business Applications Big Data Analytics Data Locality No more shipping data Store and process the data where it was created 23

Simplified Architecture Less moving parts Less things to go wrong Better resource utilization Scale any application up or down on demand Common deployment model (new isolation model) Repeatability between environments (dev, qa, production) Shared file system Get at the data anywhere in the cluster Simplifies business continuity 24

Business Continuity Resilience Redundancy High Availability Spare Capacity Production Research Recovery Snapshots Disaster Recovery Datacenter 1 WAN Datacenter 2 Contingency Protect against the unforeseen Multisite Capability Production WAN EC2 25

Platform-wide Security and Compliance Authentication, Authorization, Auditing Users and jobs All tiers Data protection Wire-level encryption between servers Masking Regulatory Compliance Automatic expiration of old data Data locality supported by distributed file system 26

Net Benefit Reduced operating expenses (OPEX) Better utilization of available capacity and data center space Reduced capital expenses (CAPEX) Less total hardware needed Improves time to market Streamlined deployments Environments become consistent and predictable Delivers a competitive advantage Via platform scaling Performance improvements 27

Recap Saves valuable time and money Enables stronger business continuity capabilities Google has been doing this for years Real-time is the crux of everything Google does Time for the rest of us to operate at Google scale The technologies are there and they play together nicely Process changes must occur internally to achieve this architecture This approach will become the traditional way of thinking Don t get beat to it by your competitors 28

Go Forth and Implement the Zeta Architecture 29

Resources https://www.mapr.com/solutions/zeta-architecture http://radar.oreilly.com/2015/04/zeta-architecturehexagon-is-the-new-circle.html https://www.mapr.com/zeta-architecture https://www.mapr.com/data-centric-enterprise 30

$50M $50M in Free Training www.mapr.com/odt 31

Q & A Engage with us! @kingmesal maprtech mapr-technologies MapR jscott@mapr.com maprtech 32