A modern, flexible approach to Hadoop implementation incorporating innovations from HP Vertica & IDOL
|
|
|
- Joanna O’Connor’
- 10 years ago
- Views:
Transcription
1 A modern, flexible approach to Hadoop implementation incorporating innovations from HP Vertica & IDOL Gilles Noisette, HP EMEA Big Data CoE London 2015
2 Agenda Hadoop in the HP Big Data picture HP Platforms for Hadoop HP Reference Architectures for Hadoop HP Big Data Reference Architecture HP Haven & Hadoop HP Vertica Fast analytics on Hadoop HP IDOL Smart Hadoop Data Lake HP SecureData for Hadoop Trafodion SQL DBMS on Hadoop HP Big Data Services
3 The HP Haven Big Data Platform Powering Big Data Analytics to Applications Turn 100% of your data into action. Human Data Machine Data Business Data Haven Big Data Platform Insight Haven Enterprise SQL / BI / Reporting Predictive Analytics Machine Learning Log Analytics Search Image / Audio / Video Haven OnHadoop Secure Data Lake Exploration Open Data Format Governance Native support for MapR, Hortonworks & Cloudera Haven OnDemand Open APIs Rapid POCs & deployment Elastic / Multi-tenant Private Cloud-ready Pay-as-you-go HP Vertica, HP IDOL, KeyView, HP 3 Distributed R Predictive Analytics HP Vertica SQL on Hadoop, HP IDOL for Hadoop HP Vertica OnDemand & HP IDOL OnDemand
4 UID ProLiant DL380e Gen8 500 GB 500 GB UID GB 500 GB UID UID UID UID GB 500 GB UID ProLiant SL4540 Gen8 HP Big Data platform Hadoop centric view an HP company Analytics Data Intelligence Security SQL DBMS HP Vertica HP IDOL HP SecureData Trafodion Open Source Hadoop Ecosystem Open Source HP ProLiant / Converged Infrastructure DL380, Apollo 4200, Apollo 4530, Moonshot 1500, Network Cluster Operation HP BSM / HP DSM / HP CMU 4
5 HP Reference Architectures for Hadoop
6 UID ProLiant DL380e Gen8 UID UID UID UID UID UID ProLiant SL4540 Gen8 UID ProLiant DL380e Gen8 UID A B Moonshot 1500 HP Reference Architecture(s) for Hadoop Flexible, pre-approved & optimized configurations + Scaling from 4 to thousands of HP ProLiant Servers Sized to customer s workload and storage needs Impressive Processor and Storage density A set of pre-tested hardware components Processor, Drives, Network, 1TB/8TB disk size etc... Breakthrough economics, density, simplicity DL GB 500 GB Apollo GB 500 GB 500 GB 500 GB Apollo 4200 HP GbE HP GbE x 2 Network Switches 3 x DL360 Gen9 Head s 24 x HP ProLiant Apollo 4530 Worker s Moonshot 1500 HP Apollo 4000 example PB raw storage 630 TB Hadoop usable 756 Xeon E5 cores for a full rack 3.5 PB raw storage 900 TB Hadoop usable 960 Xeon E5 cores for a full rack 4.26 PB raw storage 1 PB Hadoop usable 756 Xeon E5 cores for a full rack 1620 Xeon E3 cores 3240 Linux CPUs for a full rack
7 HP Apollo Bringing Big Data storage server density to enterprise The enterprise bridge to Big Data - Available June 1, 2015 Storage density Plug and play Performance and efficiency Leadership storage density 28 LFF or 50 SFF HDD Enterprise bridge Fits traditional enterprise/sme rack server data centers deploy today, no cost of change Configuration flexibility Balanced capacity, performance and throughput with flexible options - Disks, CPUs, I/O and interconnects Highest storage density in a traditional 2U rack server TB 7
8 HP Apollo 4530 System - Massive density for Hadoop and Big Data Analytics Purpose-built for Hadoop and Big Data analytics - Available June 1, 2015 Analytics At scale Versatile performance Hadoop optimized 3 servers in 4U chassis ideal for Hadoop-based analytics with 3-copy data replication Efficient analytics scaling Up to 30 servers with 15 HDDs/SSDs each and 3.6 PB capacity per 42U rack For Big Data variety Customize for Hadoop workload variety and NoSQL analytics with disk, CPU, I/O and interconnect options Unleash the full value of Big Data with Hadoop 8
9 You need more than good servers to get a good cluster It s also about Networking and Cluster operation + HP Networking Network matters for Hadoop clusters HP s perfect Top of Rack and Aggregation switch offer Hadoop likes the HP Deep Buffer caching feature HP IRF simplifies architecture of server access networks and enables massive scalability HP FlexFabric 5930 Switch Series : 32 x 40GbE + 6 x 40G uplink ports family of high-density, ultra-low-latency Aggregation switches HP FlexFabric 5900 Switch Series : 48 x 10GbE + 4 x 40GbE ports Family of low-latency Top of Racks (ToR) switches HP Switch HP Insight Cluster Management utility Designed to operate top500 clusters Provision thousand of nodes in minutes Monitor clusters of any size (2D instant view, 3D time view) Control thousand of servers like one Perfectly fits Hadoop cluster operation needs 1GbE, 10GbE or 40GbE Hadoop cluster behavior real time analysis 9
10 HP Big Data Reference Architecture 10
11 Interesting released Hadoop feature Architecture trends YARN Labelling (-labels / jira YARN-796) Capability to create groups of similar nodes to run different types of applications with different workload, each, on the most appropriate group of node Admin tags nodes with labels (e.g.: GPU, Storm) One node can have more than one label (e.g.: GPU, m710) Applications can include labels in container requests I want a GPU Application Master 11 Manager [Storm] Manager [Analytic, XL230a] HP Apollo 6000 blade Manager [GPU, m710] Moonshot cartridge Enabling the next Generation of Hadoop Applications...
12 Interesting released Hadoop features Architecture trends HDFS Tiering / Heterogeneous Storage Tiers (HDFS-2832) For example, HBase can request that its data files (Hfiles) be stored on SSD. Then when HBase does writes and reads from HDFS, these requests will hit SSD and provide the latency requirements that HBase needs for supporting near real time applications. Phase2: HDFS Application APIs for heterogeneous storage HDFS SSD storage tier HDFS Memory as a storage tier (beta) HDFS Archival Storage Design (HDFS-6584) Introduces a new concept of storage policies. For accommodating future storage technology and different cluster characteristics, cluster administrators will be able to modify the predefined storage policies and/or define custom storage policies. Data policy names : Very Hot Hot Warm Luke Warm Cold 12
13 New approach to address Big Data demands Modern and Flexible Current traditional Big Data approach New HP Big Data approach Compute and storage are always collocated All servers are identical Data is partitioned across servers on direct-attached storage (DAS) Separate compute and storage tiers connected by Ethernet networking Standard Hadoop installed asymmetrically with storage components on the storage servers and yarn applications on the compute servers Compute Optimized Servers YARN Applications Two Socket, 2U Servers YARN Applications, HDFS, ORC Files, Parquet, Hbase, Cassandra HDFS, ORC Files, Parquet, Hbase, Cassandra Storage Optimized Servers 14
14 Benefits of HP Big Data Reference Architecture HP Moonshot and HP Apollo servers addresse a variety of enterprise big data needs Compute HP Moonshot Storage HP Apollo Ethernet (RoCE) Cluster consolidation Multiple big data environments can directly access a shared pool of data Flexibility to scale Scale compute and storage independently Maximum elasticity Rapidly provision compute without affecting storage Breakthrough economics Significantly better density, cost and power through workload optimized components 15
15 HP Apollo and Moonshot - HP Big Data Reference Architecture 2X Hadoop MapReduce performance with the same footprint 2.5X HBase performance with the same footprint 2 X Higher Density versus 20% more Memory 46% Less Power (Watts) Traditional architecture 16 Big Data Reference Architecture Note: Comparison configuration is ProLiant DL380 Gen9 servers
16 Maximum Elasticity for Big Data workloads Hadoop Labels feature (jira YARN-796) HP contributed IP into the Hadoop trunk Specifying labels on nodes allows for scheduling of YARN containers to specific pools of nodes - Admins able to target workloads at optimized platforms Combined with the HP Big Data Reference Architecture, compute nodes can be dynamically assigned - No data repartitioning 12am 6am Hadoop Cluster 1 Hadoop Cluster 2 6am 12am Hadoop Cluster 1 Hadoop Cluster 2 Vertica Analytics Spark 18 Storage Storage
17 HP Haven & Hadoop
18 HP IDOL for Hadoop To Build a Smarter data Lake
19 HP Intelligent Data Operating Layer (IDOL) The OS for human information Single processing layer to handle the continuum of human information Connect Understand Act & Automate Access virtually any source of information Form an understanding of information, including docs, s, databases, social media, rich media, etc. Over 500 functions to derive actionable insights aka: HP Autonomy IDOL 23
20 A Smarter Data Lake Needs HP IDOL Features Integration points with Hadoop Breakdown information silos across enterprise Understand myriad file formats and types Improved, intuitive visibility to contents Automatically analyse rich media Connectors & Policies KeyView + IDOL to Vertica IDOL Server (incl HDFS Sync) Image Server & Video Server Knowledge Graph Advanced Speech-to-Text 24
21 HP Vertica SQL on Hadoop Fast analytics on Hadoop
22 HP Vertica Analytics platform 7 High-performance data analytics platform purpose-built for big data - columnar database engine Blazing fast analytics Gain insight into your data in near-real time by running queries 50x -1,000x faster than legacy products Massive scalability - PBs Infinitely scale your solution by adding an unlimited number of industry-standard servers Open architecture Protect and embrace your investment in hardware and software with built-in support for Hadoop, R, and a range of ETL and BI tools Optimized data storage Store 10x-30x more data per server than row databases with patented columnar compression Load & analyze growing forms of semi-structured data Quickly and easily load, explore, analyze emerging and rapidly growing forms of semi-structured data. Easy Set-Up and Administration Get to market quickly with your analytics initiatives at low cost of administration and maintenance 26 Speed, scalability, and openness at lower TCO
23 HP Vertica Data Storage Options and Performance HP Vertica SQL on Hadoop Query Engine Vertica ANSI SQL-99 Vertica ANSI SQL-99 Vertica ANSI SQL-99 Vertica ANSI SQL-99 Vertica ANSI SQL-99 Format Vertica ROS Vertica ROS Hadoop Format* Flex Tables Flat Files File System EXT4 HDFS HDFS HDFS HDFS Fastest Analytics Performance Slowest Discovery Structured Semi-Structured *Supported Hadoop file formats : Parquet, ORC 29
24 HP Secure Data for Hadoop To Secure your data
25 HP SecureData Data-Centric Encryption and Tokenization HP SecureData Key Servers HP SecureData Central Management Console HP Stateless Key Management No key database to store or manage High performance, unlimited scalability Both encryption & tokenization technologies Format Preserving Encryption (FPE) for De-Identification Secure Stateless Tokenization (SST) for Payment Card Industry Customize solution to meet your exact requirements Broad Platform Support On-premise / cloud / Big Data Structured / Unstructured Linux, Hadoop, Windows, AWS, IBM z/os, HP NonStop, Teradata, etc Quick time-to-value Complete end-to-end protection within a common platform Format-preservation dramatically reduces implementation effort FPE AES HP SecureData Web Services API Tax ID 8juYE%Uks&dDFa2345^WFLERG HP SecureData Command Line and Automated Parsers Credit Card HP SecureData Native APIs (C, Java, C#,.NET) First Name: Gunther Last Name: Robertson SSN: DOB: First Name: Uywjlqo Last Name: Muwruwwbp SSN: DOB: Ija&3k24kQotugDF2390^32 0OWioNu2(*872weW Oiuqwriuweuwr%oIUOw1@ Tax ID SST Partial SST Obvious SST AZ UYTZ AZS-UXD
26 Options for Securing Data in Hadoop with HP Security Voltage Hadoop Cluster Applications & Data HP Security Voltage 1 4 Hadoop Jobs & Analytics Applications & Data 2 Landing Zone ETL & Batch HP Security Voltage HDFS 5 Hadoop Jobs & Analytics HP Security Voltage Applications, Analytics & Data Applications & Data Hadoop Jobs HP Security Voltage 6 Egress Zone ETL & Batch HP Security Voltage Applications, Analytics & Data 7 HP Security Voltage BI Tools & Downstream Applications Legend: Unprotected Data De-Identified Data Application with HP Security Voltage Interface Point Standard Application 35
27 HP Trafodion v1.0.0 ( Open Source since June 2014) Forrester - Mike Gualtieri (October 22nd, 2013) The Future of Hadoop is real time and transactional Doug Cutting (October 30th, 2013) We're in the middle of a revolution in data processing it is inevitable that we will see just about every kind of workload be moved to this platform even OnLine Transaction Processing (OLTP) Copyright Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
28 Addresses an under-served Hadoop market segment Operational Real-time insights SQL DBMS = OLTP + Summary BI Interactive Parameterized reports Drilldown visualization Exploration Non-interactive Data preparation Incremental batch processing Dashboards, scorecards Batch Current Market Focus: Data Warehousing and Analytics Operational batch processing Enterprise reports Data mining Trafodion Focus Sub-second Response Time Hours Adds Value to Hadoop 37 Transaction Support Data Integrity Real-time Performance Operational Optimizations Workload Management
29 Trafodion Trafodion is a joint HP Labs and HP-IT research project to develop operational SQL on Hadoop database capabilities Complete : Full-function SQL Reuse existing SQL skills and improve developer productivity Protected : Distributed ACID transactions Guarantees data consistency across multiple rows, tables, SQL statements Efficient : Optimized for low-latency read and write transactions Supports real-time, high concurrency, transaction processing applications Interoperable : Standard ODBC/JDBC access Works with existing tools and applications Open : Hadoop and Linux distribution neutral Easy to add to your existing infrastructure and no vendor lock-in Hadoop + Operational SQL Open source project sponsorship and investment from HP 38 Production ready version 1.0 release available at
30 HP Big Data Services
31 Advisory and Discovery Services for Big Data Advisory Our industry and technical experts can support people in technology assessments and strategy development. Big Data TW Used to define Big Data strategy Transformation Workshop format Discovery Workshop Used to identify/prioritize use-cases Validate functional and technical viability Discovery Experience Discovery Lab Time boxed engagement to run a pilot Based on use-cases from workshop Run on Haven cloud environment Insert a Haven lab in the customer ecosystem Platform, platform management and lab function management (on-premise or cloud) 41
32 HP Services for Hadoop Bringing value to the customer Technical Services Analytics Services Hadoop Roadmap Service Enterprise Design Services Advisory & Discovery Services Information Management Services Hadoop Proof of Concept Cluster Implementation Services Hadoop Solutions & Applications Development Data Science Services Support/Management Services Cluster Support Managed Services As-a-Service 42
33 Summary +
34 Summary HP offers industry leading capability for Hadoop Open systems Deep expertise Complete support Ongoing innovation Leading Partnerships Contribution to Apache community Collaboration with Hortonworks Full portfolio of consulting services Projects Moonshot HP ProLiant Gen9 HP Apollo Industry Standard Solutions HP Insight CMU HP BSM HP DSM Global Solution Center Haven Big Data Platform Designed for Big Data an HP company 45
35 Thank You
36 Learn more about HP Haven Solution brochure Technical white paper HP Vertica SQL on Hadoop FAQ Customer analytics use case 47
37 HP Big data Reference Architecture External Collateral White papers: HP Big Data Reference Architecture: A Modern Approach HP Big Data Reference Architecture: Cloudera Enterprise reference architecture implementation HP Big Data Reference Architecture: Hortonworks Data Platform reference architecture implementation Blog posts: HP Blog post (from Greg Battas) Hortonworks blog post Joseph George s blog post (The HP Big Data Reference Architecture: It s Worth Taking a Closer Look ) Silicon Angle Blog post Forrester Blog Post Videos: Steve Tramack interview on The Cube at Discover 48
38 Monitoring Hadoop with HP Insight Cluster Management Utility Hadoop worker-nodes Timed View Hadoop cluster behavior real time analysis 49
Trafodion Operational SQL-on-Hadoop
Trafodion Operational SQL-on-Hadoop SophiaConf 2015 Pierre Baudelle, HP EMEA TSC July 6 th, 2015 Hadoop workload profiles Operational Interactive Non-interactive Batch Real-time analytics Operational SQL
Enterprise Operational SQL on Hadoop Trafodion Overview
Enterprise Operational SQL on Hadoop Trafodion Overview Rohit Jain Distinguished & Chief Technologist Strategic & Emerging Technologies Enterprise Database Solutions Copyright 2012 Hewlett-Packard Development
How To Use Hp Vertica Ondemand
Data sheet HP Vertica OnDemand Enterprise-class Big Data analytics in the cloud Enterprise-class Big Data analytics for any size organization Vertica OnDemand Organizations today are experiencing a greater
Big Data Analytics: Today's Gold Rush November 20, 2013
Copyright 2013 Vivit Worldwide Big Data Analytics: Today's Gold Rush November 20, 2013 Brought to you by Copyright 2013 Vivit Worldwide Hosted by Bernard Szymczak Vivit Leader Ohio Chapter TQA SIG Copyright
News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren
News and trends in Data Warehouse Automation, Big Data and BI Johan Hendrickx & Dirk Vermeiren Extreme Agility from Source to Analysis DWH Appliances & DWH Automation Typical Architecture 3 What Business
Il mondo dei DB Cambia : Tecnologie e opportunita`
Il mondo dei DB Cambia : Tecnologie e opportunita` Giorgio Raico Pre-Sales Consultant Hewlett-Packard Italiana 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject
Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack
Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack HIGHLIGHTS Real-Time Results Elasticsearch on Cisco UCS enables a deeper
SQL Server 2012 Parallel Data Warehouse. Solution Brief
SQL Server 2012 Parallel Data Warehouse Solution Brief Published February 22, 2013 Contents Introduction... 1 Microsoft Platform: Windows Server and SQL Server... 2 SQL Server 2012 Parallel Data Warehouse...
HPE Vertica & Hadoop. Tapping Innovation to Turbocharge Your Big Data. #SeizeTheData
HPE Vertica & Hadoop Tapping Innovation to Turbocharge Your Big Data #SeizeTheData The HPE Vertica portfolio One Vertica Engine running on Cloud, Bare Metal, or Hadoop Data Nodes HPE Vertica OnDemand &
Oracle Big Data SQL Technical Update
Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical
Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>
s Big Data solutions Roger Wullschleger DBTA Workshop on Big Data, Cloud Data Management and NoSQL 10. October 2012, Stade de Suisse, Berne 1 The following is intended to outline
SEIZE THE DATA. 2015 SEIZE THE DATA. 2015
1 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. BIG DATA CONFERENCE 2015 Boston August 10-13 Predicting and reducing deforestation
Hadoop: Embracing future hardware
Hadoop: Embracing future hardware Suresh Srinivas @suresh_m_s Page 1 About Me Architect & Founder at Hortonworks Long time Apache Hadoop committer and PMC member Designed and developed many key Hadoop
Oracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya
Oracle Database - Engineered for Innovation Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya Oracle Database 11g Release 2 Shipping since September 2009 11.2.0.3 Patch Set now
Oracle Database 12c Plug In. Switch On. Get SMART.
Oracle Database 12c Plug In. Switch On. Get SMART. Duncan Harvey Head of Core Technology, Oracle EMEA March 2015 Safe Harbor Statement The following is intended to outline our general product direction.
Luncheon Webinar Series May 13, 2013
Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration
HP ConvergedSystem 900 for SAP HANA Scale-up solution architecture
Technical white paper HP ConvergedSystem 900 for SAP HANA Scale-up solution architecture Table of contents Executive summary... 2 Solution overview... 3 Solution components... 4 Storage... 5 Compute...
Data-Centric security and HP NonStop-centric ecosystems. Andrew Price, XYPRO Technology Corporation Mark Bower, Voltage Security
Title Data-Centric security and HP NonStop-centric ecosystems A breakthrough strategy for neutralizing sensitive data against advanced threats and attacks Andrew Price, XYPRO Technology Corporation Mark
Platfora Big Data Analytics
Platfora Big Data Analytics ISV Partner Solution Case Study and Cisco Unified Computing System Platfora, the leading enterprise big data analytics platform built natively on Hadoop and Spark, delivers
HP Big Data Reference Architecture: A Modern Approach
Technical white paper HP Big Data Reference Architecture: A Modern Approach HP BDRA with Apollo 2000 System and Apollo 4200 Servers Table of contents Executive summary... 2 Introduction... 2 Breaking the
Turning Data Into Answers With HP Vertica
Turning Data Into Answers With HP Vertica Sekher Seshadri March, 2014 Agenda Big Data Challenges and Opportunities HP Vertica Overview Customer Use Cases Q&A 2 Big Data Challenges & Opportunities Completing
Introducing Oracle Exalytics In-Memory Machine
Introducing Oracle Exalytics In-Memory Machine Jon Ainsworth Director of Business Development Oracle EMEA Business Analytics 1 Copyright 2011, Oracle and/or its affiliates. All rights Agenda Topics Oracle
Einsatzfelder von IBM PureData Systems und Ihre Vorteile.
Einsatzfelder von IBM PureData Systems und Ihre Vorteile [email protected] Agenda Information technology challenges PureSystems and PureData introduction PureData for Transactions PureData for Analytics
How To Write An Article On An Hp Appsystem For Spera Hana
Technical white paper HP AppSystem for SAP HANA Distributed architecture with 3PAR StoreServ 7400 storage Table of contents Executive summary... 2 Introduction... 2 Appliance components... 3 3PAR StoreServ
Ubuntu and Hadoop: the perfect match
WHITE PAPER Ubuntu and Hadoop: the perfect match February 2012 Copyright Canonical 2012 www.canonical.com Executive introduction In many fields of IT, there are always stand-out technologies. This is definitely
Datenverwaltung im Wandel - Building an Enterprise Data Hub with
Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees
Pentaho High-Performance Big Data Reference Configurations using Cisco Unified Computing System
Pentaho High-Performance Big Data Reference Configurations using Cisco Unified Computing System By Jake Cornelius Senior Vice President of Products Pentaho June 1, 2012 Pentaho Delivers High-Performance
Big Data Technologies Compared June 2014
Big Data Technologies Compared June 2014 Agenda What is Big Data Big Data Technology Comparison Summary Other Big Data Technologies Questions 2 What is Big Data by Example The SKA Telescope is a new development
Actian SQL in Hadoop Buyer s Guide
Actian SQL in Hadoop Buyer s Guide Contents Introduction: Big Data and Hadoop... 3 SQL on Hadoop Benefits... 4 Approaches to SQL on Hadoop... 4 The Top 10 SQL in Hadoop Capabilities... 5 SQL in Hadoop
Real-Time Big Data Analytics SAP HANA with the Intel Distribution for Apache Hadoop software
Real-Time Big Data Analytics with the Intel Distribution for Apache Hadoop software Executive Summary is already helping businesses extract value out of Big Data by enabling real-time analysis of diverse
HadoopTM Analytics DDN
DDN Solution Brief Accelerate> HadoopTM Analytics with the SFA Big Data Platform Organizations that need to extract value from all data can leverage the award winning SFA platform to really accelerate
TE's Analytics on Hadoop and SAP HANA Using SAP Vora
TE's Analytics on Hadoop and SAP HANA Using SAP Vora Naveen Narra Senior Manager TE Connectivity Santha Kumar Rajendran Enterprise Data Architect TE Balaji Krishna - Director, SAP HANA Product Mgmt. -
BIG DATA TRENDS AND TECHNOLOGIES
BIG DATA TRENDS AND TECHNOLOGIES THE WORLD OF DATA IS CHANGING Cloud WHAT IS BIG DATA? Big data are datasets that grow so large that they become awkward to work with using onhand database management tools.
Microsoft Analytics Platform System. Solution Brief
Microsoft Analytics Platform System Solution Brief Contents 4 Introduction 4 Microsoft Analytics Platform System 5 Enterprise-ready Big Data 7 Next-generation performance at scale 10 Engineered for optimal
Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database
Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database Built up on Cisco s big data common platform architecture (CPA), a
Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database
Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica
The Future of Data Management
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class
Convergence is accelerating the path to the New Style of Business
Convergence is accelerating the path to the New Style of Business Paul Durzan Vice President Converged Data Center Infrastructure Hewlett Packard Enterprise Copyright 2012 Hewlett-Packard Development Company,
Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments
Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Important Notice 2010-2015 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, Impala, and
Please give me your feedback
Please give me your feedback Session BB4089 Speaker Claude Lorenson, Ph. D and Wendy Harms Use the mobile app to complete a session survey 1. Access My schedule 2. Click on this session 3. Go to Rate &
ENTERPRISE-CLASS MONITORING SOLUTION FOR EVERYONE ALL-IN-ONE OPEN-SOURCE DISTRIBUTED MONITORING
ENTERPRISE-CLASS MONITORING SOLUTION FOR EVERYONE ALL-IN-ONE OPEN-SOURCE DISTRIBUTED MONITORING 1 CONTENTS About Zabbix Software... 2 Main Functions... 3 Architecture... 4 Installation Requirements...
HDP Hadoop From concept to deployment.
HDP Hadoop From concept to deployment. Ankur Gupta Senior Solutions Engineer Rackspace: Page 41 27 th Jan 2015 Where are you in your Hadoop Journey? A. Researching our options B. Currently evaluating some
Evolution from Big Data to Smart Data
Evolution from Big Data to Smart Data Information is Exploding 120 HOURS VIDEO UPLOADED TO YOUTUBE 50,000 APPS DOWNLOADED 204 MILLION E-MAILS EVERY MINUTE EVERY DAY Intel Corporation 2015 The Data is Changing
Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software
WHITEPAPER Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software SanDisk ZetaScale software unlocks the full benefits of flash for In-Memory Compute and NoSQL applications
Netezza and Business Analytics Synergy
Netezza Business Partner Update: November 17, 2011 Netezza and Business Analytics Synergy Shimon Nir, IBM Agenda Business Analytics / Netezza Synergy Overview Netezza overview Enabling the Business with
HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics
HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics ESSENTIALS EMC ISILON Use the industry's first and only scale-out NAS solution with native Hadoop
Hadoop & Spark Using Amazon EMR
Hadoop & Spark Using Amazon EMR Michael Hanisch, AWS Solutions Architecture 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Agenda Why did we build Amazon EMR? What is Amazon EMR?
Enabling High performance Big Data platform with RDMA
Enabling High performance Big Data platform with RDMA Tong Liu HPC Advisory Council Oct 7 th, 2014 Shortcomings of Hadoop Administration tooling Performance Reliability SQL support Backup and recovery
Cost-Effective Business Intelligence with Red Hat and Open Source
Cost-Effective Business Intelligence with Red Hat and Open Source Sherman Wood Director, Business Intelligence, Jaspersoft September 3, 2009 1 Agenda Introductions Quick survey What is BI?: reporting,
Ganzheitliches Datenmanagement
Ganzheitliches Datenmanagement für Hadoop Michael Kohs, Senior Sales Consultant @mikchaos The Problem with Big Data Projects in 2016 Relational, Mainframe Documents and Emails Data Modeler Data Scientist
HP ProLiant BL660c Gen9 and Microsoft SQL Server 2014 technical brief
Technical white paper HP ProLiant BL660c Gen9 and Microsoft SQL Server 2014 technical brief Scale-up your Microsoft SQL Server environment to new heights Table of contents Executive summary... 2 Introduction...
Intel RAID SSD Cache Controller RCS25ZB040
SOLUTION Brief Intel RAID SSD Cache Controller RCS25ZB040 When Faster Matters Cost-Effective Intelligent RAID with Embedded High Performance Flash Intel RAID SSD Cache Controller RCS25ZB040 When Faster
HDP Enabling the Modern Data Architecture
HDP Enabling the Modern Data Architecture Herb Cunitz President, Hortonworks Page 1 Hortonworks enables adoption of Apache Hadoop through HDP (Hortonworks Data Platform) Founded in 2011 Original 24 architects,
SQL Server 2012 Performance White Paper
Published: April 2012 Applies to: SQL Server 2012 Copyright The information contained in this document represents the current view of Microsoft Corporation on the issues discussed as of the date of publication.
In-memory computing with SAP HANA
In-memory computing with SAP HANA June 2015 Amit Satoor, SAP @asatoor 2015 SAP SE or an SAP affiliate company. All rights reserved. 1 Hyperconnectivity across people, business, and devices give rise to
Microsoft Hybrid Cloud IaaS Platforms
IronPOD 400 Converged Systems Series Microsoft Hybrid Cloud IaaS Platforms IronPOD System 400 Series System Overview IRON Networks converged system products simplify Microsoft infrastructure deployments
Dell In-Memory Appliance for Cloudera Enterprise
Dell In-Memory Appliance for Cloudera Enterprise Hadoop Overview, Customer Evolution and Dell In-Memory Product Details Author: Armando Acosta Hadoop Product Manager/Subject Matter Expert [email protected]/
IRON Big Data Appliance Platform for Hadoop
IRON HDPOD Big Data Appliance Commodity Hadoop Cluster Platforms for Enterprises IRON Big Data Appliance Platform for Hadoop IRON Networks Big Data Appliance HDPOD is a comprehensive Hadoop Big Data platform,
Oracle Exadata: The World s Fastest Database Machine Exadata Database Machine Architecture
Oracle Exadata: The World s Fastest Database Machine Exadata Database Machine Architecture Ron Weiss, Exadata Product Management Exadata Database Machine Best Platform to Run the
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics
Real World Big Data Architecture - Splunk, Hadoop, RDBMS
Copyright 2015 Splunk Inc. Real World Big Data Architecture - Splunk, Hadoop, RDBMS Raanan Dagan, Big Data Specialist, Splunk Disclaimer During the course of this presentagon, we may make forward looking
Virtualizing Apache Hadoop. June, 2012
June, 2012 Table of Contents EXECUTIVE SUMMARY... 3 INTRODUCTION... 3 VIRTUALIZING APACHE HADOOP... 4 INTRODUCTION TO VSPHERE TM... 4 USE CASES AND ADVANTAGES OF VIRTUALIZING HADOOP... 4 MYTHS ABOUT RUNNING
The Future of Data Management with Hadoop and the Enterprise Data Hub
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah Cofounder & CTO, Cloudera, Inc. Twitter: @awadallah 1 2 Cloudera Snapshot Founded 2008, by former employees of Employees
Nutanix Solutions for Private Cloud. Kees Baggerman Performance and Solution Engineer
Nutanix Solutions for Private Cloud Kees Baggerman Performance and Solution Engineer Nutanix: Web-Scale Converged Infrastructure ü Founded in 2009 ü Now on fourth generation ü Core team from industry leaders
Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84
Index A Amazon Web Services (AWS), 50, 58 Analytics engine, 21 22 Apache Kafka, 38, 131 Apache S4, 38, 131 Apache Sqoop, 37, 131 Appliance pattern, 104 105 Application architecture, big data analytics
Successfully Deploying Alternative Storage Architectures for Hadoop Gus Horn Iyer Venkatesan NetApp
Successfully Deploying Alternative Storage Architectures for Hadoop Gus Horn Iyer Venkatesan NetApp Agenda Hadoop and storage Alternative storage architecture for Hadoop Use cases and customer examples
Reference Architecture, Requirements, Gaps, Roles
Reference Architecture, Requirements, Gaps, Roles The contents of this document are an excerpt from the brainstorming document M0014. The purpose is to show how a detailed Big Data Reference Architecture
The HP Neoview data warehousing platform for business intelligence
The HP Neoview data warehousing platform for business intelligence Ronald Wulff EMEA, BI Solution Architect HP Software - Neoview 2006 Hewlett-Packard Development Company, L.P. The inf ormation contained
Microsoft Private Cloud Fast Track Reference Architecture
Microsoft Private Cloud Fast Track Reference Architecture Microsoft Private Cloud Fast Track is a reference architecture designed to help build private clouds by combining Microsoft software with NEC s
Interactive data analytics drive insights
Big data Interactive data analytics drive insights Daniel Davis/Invodo/S&P. Screen images courtesy of Landmark Software and Services By Armando Acosta and Joey Jablonski The Apache Hadoop Big data has
RED HAT STORAGE PORTFOLIO OVERVIEW
RED HAT STORAGE PORTFOLIO OVERVIEW Andrew Hatfield Practice Lead Cloud Storage and Big Data MILCIS November 2015 THE RED HAT STORAGE MISSION To offer a unified, open software-defined storage portfolio
Real-Time Analytics for Big Market Data with XAP In-Memory Computing
Real-Time Analytics for Big Market Data with XAP In-Memory Computing March 2015 Real Time Analytics for Big Market Data Table of Contents Introduction 03 Main Industry Challenges....04 Achieving Real-Time
Session 1: IT Infrastructure Security Vertica / Hadoop Integration and Analytic Capabilities for Federal Big Data Challenges
Session 1: IT Infrastructure Security Vertica / Hadoop Integration and Analytic Capabilities for Federal Big Data Challenges James Campbell Corporate Systems Engineer HP Vertica [email protected] Big
CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data
Research Report CA Technologies Big Data Infrastructure Management Executive Summary CA Technologies recently exhibited new technology innovations, marking its entry into the Big Data marketplace with
HP BladeSystem Advantage over Cisco s UCS
HP BladeSystem Advantage over Cisco s UCS Priority #1: Enabling Applications Applications 2 Architectural Stability IT Best Practice: Out of Band Management The UCS way: In Band Management #1 Stability
SPEED your path to virtualization.
SPEED your path to virtualization. 2011 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Introducing HP VirtualSystem Chief pillar of
2009 Oracle Corporation 1
The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material,
Securing Hadoop Data Big Data Everywhere - Atlanta January 27, 2015
Securing Hadoop Data Big Data Everywhere - Atlanta January 27, 2015 2015 Voltage Security, Inc. A History of Excellence Company: Founded in 2002 Out of Stanford University Based in Cupertino, California
Dell Reference Configuration for DataStax Enterprise powered by Apache Cassandra
Dell Reference Configuration for DataStax Enterprise powered by Apache Cassandra A Quick Reference Configuration Guide Kris Applegate [email protected] Solution Architect Dell Solution Centers Dave
HP HAVEn: See the big picture in Big Data
HP HAVEn: See the big picture in Big Data Table of contents The HAVEn vision All Big Data matters Profit from 100 percent of your data with HP HAVEn The HP HAVEn engines The Haven ecosystem Big Data usage
Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments
Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Important Notice 2010-2016 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, Impala, and
HP Vertica. Echtzeit-Analyse extremer Datenmengen und Einbindung von Hadoop. Helmut Schmitt Sales Manager DACH
HP Vertica Echtzeit-Analyse extremer Datenmengen und Einbindung von Hadoop Helmut Schmitt Sales Manager DACH Big Data is a Massive Disruptor 2 A 100 fold multiplication in the amount of data is a 10,000
Big Data. Value, use cases and architectures. Petar Torre Lead Architect Service Provider Group. Dubrovnik, Croatia, South East Europe 20-22 May, 2013
Dubrovnik, Croatia, South East Europe 20-22 May, 2013 Big Data Value, use cases and architectures Petar Torre Lead Architect Service Provider Group 2011 2013 Cisco and/or its affiliates. All rights reserved.
James Serra Sr BI Architect [email protected] http://jamesserra.com/
James Serra Sr BI Architect [email protected] http://jamesserra.com/ Our Focus: Microsoft Pure-Play Data Warehousing & Business Intelligence Partner Our Customers: Our Reputation: "B.I. Voyage came
Big Data Analytics - Accelerated. stream-horizon.com
Big Data Analytics - Accelerated stream-horizon.com Legacy ETL platforms & conventional Data Integration approach Unable to meet latency & data throughput demands of Big Data integration challenges Based
Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances
INSIGHT Oracle's All- Out Assault on the Big Data Market: Offering Hadoop, R, Cubes, and Scalable IMDB in Familiar Packages Carl W. Olofson IDC OPINION Global Headquarters: 5 Speen Street Framingham, MA
SAS and Oracle: Big Data and Cloud Partnering Innovation Targets the Third Platform
SAS and Oracle: Big Data and Cloud Partnering Innovation Targets the Third Platform David Lawler, Oracle Senior Vice President, Product Management and Strategy Paul Kent, SAS Vice President, Big Data What
A HIGH-PERFORMANCE, SCALABLE BIG DATA APPLIANCE LAURA CHU-VIAL, SENIOR PRODUCT MARKETING MANAGER JOACHIM RAHMFELD, VP FIELD ALLIANCES OF SAP
A HIGH-PERFORMANCE, SCALABLE BIG DATA APPLIANCE LAURA CHU-VIAL, SENIOR PRODUCT MARKETING MANAGER JOACHIM RAHMFELD, VP FIELD ALLIANCES OF SAP WEBTECH EDUCATIONAL SERIES A HIGH-PERFORMANCE, SCALABLE BIG
Overview: X5 Generation Database Machines
Overview: X5 Generation Database Machines Spend Less by Doing More Spend Less by Paying Less Rob Kolb Exadata X5-2 Exadata X4-8 SuperCluster T5-8 SuperCluster M6-32 Big Memory Machine Oracle Exadata Database
Copyright 2013, Oracle and/or its affiliates. All rights reserved.
1 Oracle SPARC Server for Enterprise Computing Dr. Heiner Bauch Senior Account Architect 19. April 2013 2 The following is intended to outline our general product direction. It is intended for information
Big Data and Its Impact on the Data Warehousing Architecture
Big Data and Its Impact on the Data Warehousing Architecture Sponsored by SAP Speaker: Wayne Eckerson, Director of Research, TechTarget Wayne Eckerson: Hi my name is Wayne Eckerson, I am Director of Research
Moving From Hadoop to Spark
+ Moving From Hadoop to Spark Sujee Maniyam Founder / Principal @ www.elephantscale.com [email protected] Bay Area ACM meetup (2015-02-23) + HI, Featured in Hadoop Weekly #109 + About Me : Sujee
IBM PureData System for Transactions. Technical Deep Dive. Jonathan Rossi, PureSystems Specialist [email protected]
IBM expert integrated system Technical Deep Dive Maria N. Schwenger, PureSystems Specialist [email protected] Jonathan Rossi, PureSystems Specialist [email protected] IBM PureData System for Transactions
Copyright 2012, Oracle and/or its affiliates. All rights reserved.
1 Oracle Big Data Appliance Releases 2.5 and 3.0 Ralf Lange Global ISV & OEM Sales Agenda Quick Overview on BDA and its Positioning Product Details and Updates Security and Encryption New Hadoop Versions
