Cloudera Enterprise Data Hub. GCloud Service Definition Lot 3: Software as a Service

Size: px
Start display at page:

Download "Cloudera Enterprise Data Hub. GCloud Service Definition Lot 3: Software as a Service"

Transcription

1 Cloudera Enterprise Data Hub GCloud Service Definition Lot 3: Software as a Service December 2014

2 1 SERVICE OVERVIEW & SOLUTION Service Overview Introduction to Cloudera Cloudera Enterprise Overview Cloudera Enterprise Data Hub Components CDH Cloudera Manager Cloudera Navigator Cloudera Support INFORMATION ASSURANCE BACKUP/RESTORE AND DISASTER RECOVERY PROVISION ON-BOARDING AND OFF-BOARDING PROCESSES On-Boarding Off-Boarding SECURITY SERVICE MANAGEMENT DETAILS Technical Boundary Support Boundary User Authorization and Roles General Support details SERVICE CONSTRAINTS Planned Maintenance Emergency Maintenance SERVICE LEVELS Case Priority Definitions Support SLAs Escalation Timelines Award of Service Credits: Payment of Service Credits: Financial recompense TRAINING INVOICING PROCESS TERMINATION TERMS DATA EXTRACTION /REMOVAL CRITERIA Data standards in use Consumer generated data Data extraction Price of extraction...14

3 13.5 Purge & destroy DATA PROCESSING AND STORAGE LOCATION(S) DATA RESTORATION / SERVICE MIGRATION CUSTOMER RESPONSIBILITIES TECHNICAL REQUIREMENTS BROWSERS DETAILS OF ANY TRIAL SERVICE AVAILABLE ICT Greening Policy Compliance ICT Strategy Policy Compliance W3C Compliance... 16

4 1 SERVICE OVERVIEW & SOLUTION 1.1 Service Overview Cloudera Enterprise is a revolutionary data management platform that is designed specifically to address the opportunities and challenges of Big Data. Cloudera Enterprise combines Apache Hadoop with a number of other open source projects to create a single, massively scalable system where you can unite storage with an array of powerful processing and analytic frameworks - a vision we call the Enterprise Data Hub. By uniting flexible storage and processing under a single management framework and set of system resources, Cloudera delivers the versatility and agility required for modern data management - where you can ingest, store, process, explore and analyse data of any type or quantity without migrating it between multiple specialised systems. The Cloudera Enterprise Data Hub includes core Apache Hadoop functionality for flexible, scalable storage and data processing as well as several added value projects including Cloudera Impala for interactive SQL, Cloudera Search for unstructured search, Apache HBase for real-time NoSQL and Cloudera Navigator for data management (data discovery, metadata management, lineage and auditing). The platform also adds security features to Hadoop, enabling strong authentication, fine-grained authorisation and encryption. This Service Definition covers the Software Subscription offerings that can be provided by Cloudera as part of GCloud Lot 2. For information about the Cloudera Training and Services offerings, please refer to the Cloudera Service Definition for Lot 4.

5 1.2 Introduction to Cloudera Cloudera is the first and leading commercial provider of Apache Hadoop and the top contributor to the Hadoop open source community. Founded in June 2008 by leading experts on big data from Facebook, Yahoo!, Google, and Oracle. Cloudera s Chief Architect, Doug Cutting, is the original creator of Hadoop, and is on the board of the Apache Software Foundation CLOUDERA FOUNDED BY MIKE OLSON, AMR AWADALLAH 2009 CDH: FIRST COMMERCIAL APACHE HADOOP 2011 CLOUDERA REACHES 100 PRODUCTION 2012 CLOUDERA ENTERPRISE 4: THE STANDARD FOR HADOOP IN THE ENTERPRISE NOW TRANSFORMING HOW COMPANIES THINK ABOUT DATA CDH CLOUDERA MANAGER CLOUDERA ENTERPRISE 4 ASK BIGGER QUESTIONS 2009 HADOOP CREATOR DOUG CUTTING JOINS CLOUDERA 2010 CLOUDERA MANAGER: FIRST MANAGEMENT 2011 CLOUDERA UNIVERSITY EXPANDS TO CLOUDERA CONNECT Cloudera pioneered the business case for Hadoop with CDH, the world s most comprehensive, thoroughly tested and widely deployed 100% open source distribution of Apache Hadoop in both commercial and non-commercial environments. Now, the company is redefining data management with its Platform for Big Data, Cloudera Enterprise, empowering enterprises to Ask Bigger Questions and gain rich, actionable insights from all their data, to quickly and easily derive real business value that translates into competitive advantage. As the top contributor to the Apache open source community and leading educator of data professionals with the broadest array of Hadoop training and certification programs, Cloudera also offers comprehensive consulting services. Over 700 partners across a broad eco-system of hardware, software and services have teamed with Cloudera to help meet organizations big data goals. With tens of thousands of nodes under management and hundreds of customers across diverse markets, Cloudera is the category leader that has set the standard for Hadoop in the enterprise. Cloudera's goal is for CDH to serve as the industry standard for big data management in the enterprise, in order to realize this goal we must: Continue to develop a platform that is open. CDH is 100% open source Create a platform that has enterprise functionality and properties. CDH leads the industry in such functionality including the most extensive set of functionality for security, availability, recoverability and integration/extensibility.

6 Create a platform that supports a diverse ecosystem. CDH has a supporting ecosystem that is ~10X larger than the next closest distribution. Create a platform that supports an ever-broadening set of workloads. CDH has facilities for batch MapReduce workloads as well as interactive SQL and Search. Maintaining a rich commercial ecosystem of hardware, software and services providers is central to Cloudera's strategy. Today there are more than 700 companies that are Cloudera partners, a commercial ecosystem that is nearly 10X the size of the next closest competitor. Cloudera has maintained a certification program for the past 2 years where partners test that their solutions interoperate with Cloudera's platform. These certifications are valid for the life of a major CDH release and we assure compatibility from update to update. Cloudera also often develops joint roadmaps with key gold and platinum partners like SAS, Informatics, Teradata, Microstrategy and Oracle. The List of many of Cloudera partners: Cloudera has the most effective, experienced, and talented engineering team of any Big Data company. Cloudera additionally has the most committers and contributors to the open source Hadoop Ecosystem of any other Big Data company. Cloudera is ahead of all competition and intends to remain so by continuously innovating and providing value to its customers. 1.3 Cloudera Enterprise Overview Cloudera Enterprise helps you become information-driven by leveraging the best of the open source community with the enterprise capabilities you need to succeed with Apache Hadoop in your organization. Designed specifically for missioncritical environments, Cloudera Enterprise includes CDH, the world s most popular open source Hadoop-based platform, as well as advanced system management and data management tools plus dedicated support and community advocacy from our world-class team of Hadoop developers and experts. Cloudera is your partner on the path to big data. Cloudera Enterprise, with Apache Hadoop at the core, is: Unified one integrated system, bringing diverse users and application workloads to one pool of data on common infrastructure; no data movement required Secure perimeter security, authentication, granular authorization, and data protection Governed enterprise-grade data auditing, data lineage, and data discovery Managed native high-availability, fault-tolerance and self-healing storage, automated backup and disaster recovery, and advanced system and data management Open Apache-licensed open source to ensure your data and applications remain yours, and an open platform to connect with all of your existing investments in technology and skills The Cloudera Enterprise Data Hub provides: One massively scalable platform to store any amount or type of data, in its original form, for as long as desired or required Integrated with your existing infrastructure and tools Flexible to run a variety of enterprise workloads -- including batch processing, interactive SQL, enterprise search and advanced analytics Robust security, governance, data protection, and management that enterprises require With Cloudera Enterprise, today s leading organizations put their data at the center of their operations, to increase business visibility and reduce costs, while successfully managing risk and compliance requirements.

7 Cloudera Enterprise includes the following components: CDH: At the core of Cloudera Enterprise is CDH, which combines Apache Hadoop with a number of other open source projects to create a single, massively scalable system where you can unite storage with an array of powerful processing and analytic frameworks. Cloudera Manager: Cloudera Enterprise includes Cloudera Manager to help you easily deploy, manage, monitor, and diagnose issues with your cluster. Cloudera is critical for operating clusters at scale. Cloudera Support: Get the industry s best technical support for Hadoop. With Cloudera Support, you ll experience more uptime, faster issue resolution, better performance to support your mission critical applications, and faster delivery of the platform features you care about. Cloudera Enterprise also offers support for several advanced components that extend and complement the value of Apache Hadoop: Online NoSQL HBase: a distributed key-value store that helps you build real-time applications on massive tables (billions of rows, millions of columns) with fast, random access. Analytic SQL Impala: the industry s leading massively-parallel (MPP) SQL engine built for Hadoop. Search Cloudera Search lets your users query and browse data in Hadoop just they would search Google or your favorite e-commerce site. In-Memory Machine Learning and Stream Processing Apache Spark: delivers fast, in-memory analytics and realtime stream processing for Hadoop. Data Management Cloudera Navigator: provides critical enterprise data audit, lineage, and data discovery capabilities that enterprises require.

8 Cloudera Enterprise is available on a subscription basis in three editions, each designed for your specific needs. Basic Edition: Rely on superior support and advanced management for core Hadoop to run storage and batch processing in production environments. Flex Edition: Run dedicated applications built on your choice of advanced component. Data Hub Edition: Get everything you need to become information-driven, including unlimited use of every advanced component. Each edition is available with your choice of 8x5 or 24x7 support from the industry s leading team of Hadoop experts, licensed either per server, or per terabyte stored. Flex and Data Hub Editions also include open source indemnification, and an optional premium support extension for mission-critical environments. 1.4 Cloudera Enterprise Data Hub Components CDH CDH delivers everything you need for enterprise use right out of the box. By integrating Apache Hadoop with more than a dozen other critical open source projects, Cloudera has created a functionally advanced system that helps you perform endto-end Big Data workflows. The only solution with real time query & search Introduced high availability for HDFS in 2012 The most widely deployed & proven The broadest ecosystem of certified partners 100% open source & built for the enterprise Cloudera Manager As the industry s first and most sophisticated management application for Apache Hadoop, Cloudera Manager sets the standard for enterprise deployment by delivering granular visibility into and control over every part of CDH empowering operators to improve cluster performance, enhance quality of service, increase compliance and reduce administrative costs. As with any distributed computing or storage platform, deployment and ongoing administration of a Hadoop cluster can be difficult and time consuming. Deciding which components and versions to deploy based on use cases; assigning roles for nodes; effectively configuring, starting and managing services across the cluster; and performing diagnostics to optimize cluster performance require significant expertise and constant attention. Cloudera Manager is designed to make administration of CDH simple and straightforward, at any scale. With Cloudera Manager, you can easily deploy and centrally operate the complete Hadoop stack. The application automates the installation process, reducing deployment time from weeks to minutes; gives you a cluster-wide, real-time view of nodes and services running; provides a single, central console to enact configuration changes across your cluster; and incorporates a full range of reporting and diagnostic tools to help you optimize performance and utilization. Manage: Easily deploy, configure and operate clusters with centralized, intuitive administration for all services, host and workflows Monitor: Maintain a central view of all activity in the cluster through heat-maps, proactive health checks and alerts Diagnose: Easily diagnose and resolve issues with operational reports and dashboards, events, intuitive log viewing and search, audit trails and integration with Cloudera Support Integrate: Integrate Cloudera Manager with existing enterprise monitoring tools through SNMP, SMTP and a comprehensive API

9 1.4.3 Cloudera Navigator Cloudera Navigator is the only native end-to-end governance solution for Apache Hadoop-based systems. Through a single user interface, it provides visibility for administrators, data managers, data scientists, and analysts to secure, govern, and explore the large amounts of diverse data that land in Hadoop. Cloudera Navigator is part of Cloudera Enterprise s comprehensive data security and governance offering and is key to meeting compliance and regulatory requirements. Cloudera Navigator includes: Comprehensive, Unified Auditing Across Hadoop o Maintain a full audit history and track access for HDFS, Impala, Hive, HBase, and Sentry o Easily report on data access to meet regulatory requirements o Export audit information to global Security Information and Event Management (SIEM) systems to incorporate into infrastructure-wide reporting Unified, Searchable Technical and Business Metadata o Consolidate technical metadata for Hadoop files and tables o Easily track, classify, and locate data to comply with business governance and compliance rules Collect, View, and Share Lineage o Automatically collect, and view upstream and downstream column-level lineage in an easy-to-follow graph o Quickly identify the origin of a data set and its impact on downstream analysis o Export lineage to enterprise-wide lineage management systems Lifecycle Management o Define and automate complex data lifecycle activities, such as classification, retention, and encryption policies - all built on Navigator s rich business metadata foundation Comprehensive encryption and key management o Navigator encrypt provides transparent encryption for Hadoop data that is scalable and highly performant. Navigator key trustee provides a virtual safe-deposit box for managing encryption keys and other Hadoop security assets Cloudera Support Cloudera offers the industry s highest quality technical support for Hadoop. We have a dedicated team of support engineers comprised of contributors and committers for every component of CDH, our market-leading open source Apache Hadoop distribution. No one knows the Hadoop stack better or has more experience supporting large-scale clusters in production. With Cloudera Support behind you, you ll experience more uptime, faster issue resolution, better performance to support your mission critical applications, and faster delivery of the platform features you care about Dedicated team of experts with a global presence End-to-end coverage for the complete Cloudera platform - Contributors and committers for every part of CDH Tens of thousands of nodes under management across industry 8x5 or 24x7 service levels Proactive cluster optimization Regular releases Thorough documentation Rich knowledgebase Influence over open source roadmap

10 2 INFORMATION ASSURANCE Cloudera Enterprise includes components implementing perimeter security, authentication, granular authorization, and data protection as well as enterprise-grade data auditing, data lineage, and data discovery. 3 BACKUP/RESTORE AND DISASTER RECOVERY PROVISION By default, all data is replicated onto 3 servers for resilience. If backup and disaster recovery are required, this is usually provided by implementing two clusters in two separate location. Data can be replicated between the clusters using the Cloudera Backup and Disaster Recovery (BDR) facility. 4 ON-BOARDING AND OFF-BOARDING PROCESSES 4.1 On-Boarding On procurement of the service, a Cloudera Account Executive will contact the customer to arrange onboarding. This will include delivery of a license key to enable the enterprise features of the Cloudera software and onboarding of Primary Support Contact(s) from the customer organization. 4.2 Off-Boarding When the subscription ends (if it is not renewed), the license key will expire and the customer will no longer have access to the enterprise features in the Cloudera software or Cloudera Support. 5 SECURITY Cloudera Enterprise supports the following security features: Authentication via Kerberos, LDAP or Active Directory Authorisation can be controlled via Apache Sentry Auditing of services via Cloudera Navigator Encryption of data at rest via Navigator Encrypt and data in flight via SSL 6 SERVICE MANAGEMENT DETAILS 6.1 Technical Boundary Cloudera Support includes remote predictive, proactive and reactive support for Cloudera software as described in the Support Agreement. 6.2 Support Boundary Cloudera Support includes remote predictive, proactive and reactive support for Cloudera software as described in the Support Agreement. 6.3 User Authorization and Roles Authorisation of access to data can be controlled via Apache Sentry.

11 6.4 General Support details Cloudera offers the industry s highest quality technical support for Hadoop. We have a dedicated team of support engineers comprised of contributors and committers for every component of CDH, our market-leading open source Apache Hadoop distribution. No one knows the Hadoop stack better or has more experience supporting large-scale clusters in production. With Cloudera Support behind you, you ll experience more uptime, faster issue resolution, better performance to support your mission critical applications, and faster delivery of the platform features you care about Dedicated team of experts with a global presence End-to-end coverage for the complete Cloudera platform - Contributors and committers for every part of CDH Tens of thousands of nodes under management across industry 8x5 or 24x7 service levels Proactive cluster optimization Regular releases Thorough documentation Rich knowledgebase Influence over open source roadmap 7 SERVICE CONSTRAINTS 7.1 Planned Maintenance N/A Cloudera provides software and support for that software, but does not host the system or data. 7.2 Emergency Maintenance N/A Cloudera provides software and support for that software, but does not host the system or data.

12 8 SERVICE LEVELS The SLAs for Cloudera Support cases depend on the priority of the case. Priority levels and SLAs are described in the tables below: 8.1 Case Priority Definitions CASE PRIORITY CLOUDERA RESPONSIBILITIES CUSTOMER RESPONSIBILITIES DEFINITION P1 FOR 8x5 SUBSCRIPTION: Resources dedicated Monday through Friday during customer s local business hours until a resolution or workaround is in place. FOR 24x7 SUBSCRIPTION Resources dedicated 24x7 until a resolution or workaround is in place FOR 8x5 SUBSCRIPTION: Designated resources that are available Monday through Friday during customer s local business hours. Ability to provide necessary diagnostic information. FOR 24x7 SUBSCRIPTION Designated resources available 24x7 until a resolution or workaround is in place. Ability to provide necessary diagnostic information Total loss or continuous instability of functionality or inability to use a feature on a production system. Development systems do not apply here. Inability to use a feature or functionality that is currently relied upon for production functionality. P2 FOR 8x5 SUBSCRIPTION Resources available Monday through Friday during local business hours until a resolution or workaround is in place FOR 24x7 SUBSCRIPTION: Resources dedicated 24x7 until a resolution or workaround is in place FOR 8x5 SUBSCRIPTION Resources available Monday through Friday during local business hours until a resolution or workaround is in place. Ability to provide necessary diagnostic information. FOR 24x7 SUBSCRIPTION Designated resources available 24x7 until a resolution or workaround is in place. Ability to provide necessary diagnostic information Performance degraded or severely limited but not causing a total loss of functionality. Inability to deploy a feature that is not currently relied upon in a production environment. P3: Resources available Monday through Friday during local business hours until a resolution or workaround is in place Resources available Monday through Friday during local business hours until a resolution or workaround is in place. Ability to provide necessary diagnostic information. General questions. Workaround in place for Priority 1 and Priority 2 issues. P4 Solid understanding of the customer request documented in our systems for reviewed by Product Marketing Use cases for the feature request and specifics on requested functionality Feature Requests

13 8.2 Support SLAs CASE PRIORITY INITIAL RESPONSE TARGET 24x7SUBSCRIPTION UPDATE FREQUENCY TARGET 24x7 SUBSCRIPTION P1 Within 1 hour Updated every 4 hours P2 Within 2 hours Updated every business day P3 Within 8 hours Updated every 3 business days P4 Within 24 hours N/A, feature request CASE PRIORITY INITIAL RESPONSE TARGET 8x5 SUBSCRIPTION UPDATE FREQUENCY TARGET 8x5 SUBSCRIPTION P1 Within 1 business hour Updated every 4 business hours P2 Within 2 business hours Updated every business day P3 Within 8 business hours Updated every 3 business days P4 Within 2 business days N/A, feature request 8.3 Escalation Timelines CASE PRIORITY ESCALATION TIMELINE 24x7 SUBSCRIPTION ESCALATION TIMELINE 8x5 SUBSCRIPTION P1 Within 2 hours Within 2 business hours P2 Within 12 hours Within 12 business hours P3 Within 3 days Within 5 days P4 N/A N/A Business Days are defined as Monday-Friday, excluding holidays observed by Cloudera. 24x7 applies for Status Update Frequency only for P1s. For the rest of the priorities, you provide the same service irrespective of contract type. 8.4 Award of Service Credits: N/A 8.5 Payment of Service Credits: N/A

14 9 Financial recompense N/A 10 TRAINING Training is available as part of the Cloudera Professional Services and Training offering until GCloud Lot 4 (SCS). 11 INVOICING PROCESS See terms and conditions 12 TERMINATION TERMS See terms and conditions. 13 DATA EXTRACTION /REMOVAL CRITERIA 13.1 Data standards in use Cloudera Enterprise is based on the HDFS filesystem which is capable of storing any data type or file format including both structured and unstructured data Consumer generated data N/A Cloudera provides software and support for that software, but does not host the system or data Data extraction There are many ways of extracting data from Cloudera Enterprise e.g. HDFS APIs, HDFS shell commands, Apache Hue, JDBC/ODBC, Thrift/REST APIs for the various services Price of extraction N/A Cloudera provides software and support for that software, but does not host the system or data Purge & destroy N/A Cloudera provides software and support for that software, but does not host the system or data. 14 DATA PROCESSING AND STORAGE LOCATION(S) N/A Cloudera provides software and support for that software, but does not host the system or data.

15 15 DATA RESTORATION / SERVICE MIGRATION N/A Cloudera provides software and support for that software, but does not host the system or data. 16 CUSTOMER RESPONSIBILITIES See terms and conditions 17 TECHNICAL REQUIREMENTS All requirements and support versions for Cloudera Enterprise are listed in the online documentation. For Cloudera Manager this is here: pic_4_2_unique_1 And for CDH it is here: 18 BROWSERS The Cloudera Manager Admin Console, which you use to install, configure, manage, and monitor services, supports the following browsers: Mozilla Firefox 11 and higher Google Chrome Internet Explorer 9 and higher Safari 5 and higher 19 DETAILS OF ANY TRIAL SERVICE AVAILABLE A 60 day trial version of Cloudera Enterprise can be downloaded from the Cloudera website. 20 ICT Greening Policy Compliance Cloudera completely endorses the UK Government s policy to provide a cost effective and energy efficient ICT estate, which is fully exploited, with reduced environmental impacts to enable new and sustainable ways of working with our customers. We seek relationships in the delivery of services with those entities that have both ethical commitments to schemes endorsing Carbon reduction policies. 21 ICT Strategy Policy Compliance Cloudera is committed to supporting the UK Government s aspirations and objectives to improve both the image and performance of services provisioned through ICT resources. We anticipate that the types of services we offer will particularly support the development of and achievement of the Intelligent Customer Function through the provision of services that support informed decision making to maximise outcomes in the provision of services that support the Digital by Default strategy and associated policies.

16 22 W3C Compliance Cloudera is committed to continue to develop its services to support social inclusion and commits to continue to develop the provision of services that allow access to them. We have made significant investment to align our digital service offerings to best practice in delivering W3C based services. We are cognizant of the requirements for inclusiveness and our services consider these requirements as part of their delivery and where it is applicable.

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics ESSENTIALS EMC ISILON Use the industry's first and only scale-out NAS solution with native Hadoop

More information

The Future of Data Management

The Future of Data Management The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class

More information

More Data in Less Time

More Data in Less Time More Data in Less Time Leveraging Cloudera CDH as an Operational Data Store Daniel Tydecks, Systems Engineering DACH & CE Goals of an Operational Data Store Load Data Sources Traditional Architecture Operational

More information

Deploying an Operational Data Store Designed for Big Data

Deploying an Operational Data Store Designed for Big Data Deploying an Operational Data Store Designed for Big Data A fast, secure, and scalable data staging environment with no data volume or variety constraints Sponsored by: Version: 102 Table of Contents Introduction

More information

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Datenverwaltung im Wandel - Building an Enterprise Data Hub with Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees

More information

INDUSTRY BRIEF DATA CONSOLIDATION AND MULTI-TENANCY IN FINANCIAL SERVICES

INDUSTRY BRIEF DATA CONSOLIDATION AND MULTI-TENANCY IN FINANCIAL SERVICES INDUSTRY BRIEF DATA CONSOLIDATION AND MULTI-TENANCY IN FINANCIAL SERVICES Data Consolidation and Multi-Tenancy in Financial Services CLOUDERA INDUSTRY BRIEF 2 Table of Contents Introduction 3 Security

More information

Security Consultants / Security Managed Services

Security Consultants / Security Managed Services Security Consultants / Security Managed Services Service Definition Document for G-Cloudv7 Services October 2015 Table of Contents Service Overview...3 Our Approach... 3 Features... 3 Benefits... 4 ON-BOARDING

More information

The Future of Big Data SAS Automotive Roundtable Los Angeles, CA 5 March 2015 Mike Olson Chief Strategy Officer, Cofounder @mikeolson

The Future of Big Data SAS Automotive Roundtable Los Angeles, CA 5 March 2015 Mike Olson Chief Strategy Officer, Cofounder @mikeolson The Future of Big Data SAS Automotive Roundtable Los Angeles, CA 5 March 2015 Mike Olson Chief Strategy Officer, Cofounder @mikeolson 1 A New Platform for Pervasive Analytics Multiple big data opportunities

More information

Hadoop Trends and Practical Use Cases. April 2014

Hadoop Trends and Practical Use Cases. April 2014 Hadoop Trends and Practical Use Cases John Howey Cloudera jhowey@cloudera.com Kevin Lewis Cloudera klewis@cloudera.com April 2014 1 Agenda Hadoop Overview Latest Trends in Hadoop Enterprise Ready Beyond

More information

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes Highly competitive enterprises are increasingly finding ways to maximize and accelerate

More information

The Future of Data Management with Hadoop and the Enterprise Data Hub

The Future of Data Management with Hadoop and the Enterprise Data Hub The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah Cofounder & CTO, Cloudera, Inc. Twitter: @awadallah 1 2 Cloudera Snapshot Founded 2008, by former employees of Employees

More information

Data Governance in the Hadoop Data Lake. Michael Lang May 2015

Data Governance in the Hadoop Data Lake. Michael Lang May 2015 Data Governance in the Hadoop Data Lake Michael Lang May 2015 Introduction Product Manager for Teradata Loom Joined Teradata as part of acquisition of Revelytix, original developer of Loom VP of Sales

More information

Dell In-Memory Appliance for Cloudera Enterprise

Dell In-Memory Appliance for Cloudera Enterprise Dell In-Memory Appliance for Cloudera Enterprise Hadoop Overview, Customer Evolution and Dell In-Memory Product Details Author: Armando Acosta Hadoop Product Manager/Subject Matter Expert Armando_Acosta@Dell.com/

More information

CDH AND BUSINESS CONTINUITY:

CDH AND BUSINESS CONTINUITY: WHITE PAPER CDH AND BUSINESS CONTINUITY: An overview of the availability, data protection and disaster recovery features in Hadoop Abstract Using the sophisticated built-in capabilities of CDH for tunable

More information

Hadoop Ecosystem B Y R A H I M A.

Hadoop Ecosystem B Y R A H I M A. Hadoop Ecosystem B Y R A H I M A. History of Hadoop Hadoop was created by Doug Cutting, the creator of Apache Lucene, the widely used text search library. Hadoop has its origins in Apache Nutch, an open

More information

Data Governance in the Hadoop Data Lake. Kiran Kamreddy May 2015

Data Governance in the Hadoop Data Lake. Kiran Kamreddy May 2015 Data Governance in the Hadoop Data Lake Kiran Kamreddy May 2015 One Data Lake: Many Definitions A centralized repository of raw data into which many data-producing streams flow and from which downstream

More information

IBM InfoSphere Guardium Data Activity Monitor for Hadoop-based systems

IBM InfoSphere Guardium Data Activity Monitor for Hadoop-based systems IBM InfoSphere Guardium Data Activity Monitor for Hadoop-based systems Proactively address regulatory compliance requirements and protect sensitive data in real time Highlights Monitor and audit data activity

More information

How to avoid building a data swamp

How to avoid building a data swamp How to avoid building a data swamp Case studies in Hadoop data management and governance Mark Donsky, Product Management, Cloudera Naren Korenu, Engineering, Cloudera 1 Abstract DELETE How can you make

More information

The Enterprise Data Hub and The Modern Information Architecture

The Enterprise Data Hub and The Modern Information Architecture The Enterprise Data Hub and The Modern Information Architecture Dr. Amr Awadallah CTO & Co-Founder, Cloudera Twitter: @awadallah 1 2013 Cloudera, Inc. All rights reserved. Cloudera Overview The Leader

More information

Fighting Cyber Fraud with Hadoop. Niel Dunnage Senior Solutions Architect

Fighting Cyber Fraud with Hadoop. Niel Dunnage Senior Solutions Architect Fighting Cyber Fraud with Hadoop Niel Dunnage Senior Solutions Architect 1 Summary Big Data is an increasingly powerful enterprise asset and this talk will explore the relationship between big data and

More information

Data Warehouse as a Service. Lot 2 - Platform as a Service. Version: 1.1, Issue Date: 05/02/2014. Classification: Open

Data Warehouse as a Service. Lot 2 - Platform as a Service. Version: 1.1, Issue Date: 05/02/2014. Classification: Open Data Warehouse as a Service Version: 1.1, Issue Date: 05/02/2014 Classification: Open Classification: Open ii MDS Technologies Ltd 2014. Other than for the sole purpose of evaluating this Response, no

More information

Interactive data analytics drive insights

Interactive data analytics drive insights Big data Interactive data analytics drive insights Daniel Davis/Invodo/S&P. Screen images courtesy of Landmark Software and Services By Armando Acosta and Joey Jablonski The Apache Hadoop Big data has

More information

Enterprise IT is complex. Today, IT infrastructure spans the physical, the virtual and applications, and crosses public, private and hybrid clouds.

Enterprise IT is complex. Today, IT infrastructure spans the physical, the virtual and applications, and crosses public, private and hybrid clouds. ENTERPRISE MONITORING & LIFECYCLE MANAGEMENT Unify IT Operations Enterprise IT is complex. Today, IT infrastructure spans the physical, the virtual and applications, and crosses public, private and hybrid

More information

Cloudera Enterprise Data Hub in Telecom:

Cloudera Enterprise Data Hub in Telecom: Cloudera Enterprise Data Hub in Telecom: Three Customer Case Studies Version: 103 Table of Contents Introduction 3 Cloudera Enterprise Data Hub for Telcos 4 Cloudera Enterprise Data Hub in Telecom: Customer

More information

How To Handle Big Data With A Data Scientist

How To Handle Big Data With A Data Scientist III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution

More information

Luncheon Webinar Series May 13, 2013

Luncheon Webinar Series May 13, 2013 Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration

More information

// your essential partner CLOUD

// your essential partner CLOUD Benefit from business continuity with real-time replication of applications and data to a secure container in the cloud, which can be called into action within minutes. Protecting and ensuring the recoverability

More information

WHITEPAPER. A Technical Perspective on the Talena Data Availability Management Solution

WHITEPAPER. A Technical Perspective on the Talena Data Availability Management Solution WHITEPAPER A Technical Perspective on the Talena Data Availability Management Solution BIG DATA TECHNOLOGY LANDSCAPE Over the past decade, the emergence of social media, mobile, and cloud technologies

More information

Securing Your Enterprise Hadoop Ecosystem Comprehensive Security for the Enterprise with Cloudera

Securing Your Enterprise Hadoop Ecosystem Comprehensive Security for the Enterprise with Cloudera Securing Your Enterprise Hadoop Ecosystem Comprehensive Security for the Enterprise with Cloudera Version: 103 Table of Contents Introduction 3 Importance of Security 3 Growing Pains 3 Security Requirements

More information

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Important Notice 2010-2015 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, Impala, and

More information

G-CLOUD FRAMEWORK SERVICE DEFINITION. Kofax Model Office Bundle Proposal ISSUE 1

G-CLOUD FRAMEWORK SERVICE DEFINITION. Kofax Model Office Bundle Proposal ISSUE 1 G-CLOUD FRAMEWORK SERVICE DEFINITION Kofax Model Office Bundle Proposal ISSUE 1 Sept 2013 Table of Contents 1 SERVICE OVERVIEW & SOLUTION... 2 2 INFORMATION ASSURANCE... 3 3 BACKUP/RESTORE AND DISASTER

More information

Big Data Management and Security

Big Data Management and Security Big Data Management and Security Audit Concerns and Business Risks Tami Frankenfield Sr. Director, Analytics and Enterprise Data Mercury Insurance What is Big Data? Velocity + Volume + Variety = Value

More information

PAAS Public Sector Managed Services

PAAS Public Sector Managed Services Meritec Limited Meritec House, Acorn Business Park, Skipton, North Yorkshire, BD23 2UE 0845 3451155 servicepoint@meritec.co.uk www.meritec.co.uk Registered In England & Wales No. 3224622 Table of Contents

More information

Oracle Big Data Building A Big Data Management System

Oracle Big Data Building A Big Data Management System Oracle Big Building A Big Management System Copyright 2015, Oracle and/or its affiliates. All rights reserved. Effi Psychogiou ECEMEA Big Product Director May, 2015 Safe Harbor Statement The following

More information

G-Cloud 6 SERVICE DEFINITION

G-Cloud 6 SERVICE DEFINITION ORACLE CORPORATION UK LTD ( Oracle ) G-Cloud 6 SERVICE DEFINITION Date: [ 29 / 11] 2014 v. 1 This is Oracle s G-Cloud 6 Service Definition for the following service(s): Oracle Business Intelligence Cloud

More information

Apache Hadoop in the Enterprise. Dr. Amr Awadallah, CTO/Founder @awadallah, aaa@cloudera.com

Apache Hadoop in the Enterprise. Dr. Amr Awadallah, CTO/Founder @awadallah, aaa@cloudera.com Apache Hadoop in the Enterprise Dr. Amr Awadallah, CTO/Founder @awadallah, aaa@cloudera.com Cloudera The Leader in Big Data Management Powered by Apache Hadoop The Leading Open Source Distribution of Apache

More information

Vistara Lifecycle Management

Vistara Lifecycle Management Vistara Lifecycle Management Solution Brief Unify IT Operations Enterprise IT is complex. Today, IT infrastructure spans the physical, the virtual and applications, and crosses public, private and hybrid

More information

Audit Management. service definition document

Audit Management. service definition document Audit Management service definition document Contents Introduction... 3 Service Description... 3 Features and Benefits... 4 Architecture... 5 Service Delivery... 6 Service Provisioning Time... 7 Service

More information

Elastic Application Platform for Market Data Real-Time Analytics. for E-Commerce

Elastic Application Platform for Market Data Real-Time Analytics. for E-Commerce Elastic Application Platform for Market Data Real-Time Analytics Can you deliver real-time pricing, on high-speed market data, for real-time critical for E-Commerce decisions? Market Data Analytics applications

More information

Upcoming Announcements

Upcoming Announcements Enterprise Hadoop Enterprise Hadoop Jeff Markham Technical Director, APAC jmarkham@hortonworks.com Page 1 Upcoming Announcements April 2 Hortonworks Platform 2.1 A continued focus on innovation within

More information

Fighting Cyber Fraud with Hadoop. Niel Dunnage Senior Solutions Architect

Fighting Cyber Fraud with Hadoop. Niel Dunnage Senior Solutions Architect Fighting Cyber Fraud with Hadoop Niel Dunnage Senior Solutions Architect 1 Summary Big Data is an increasingly powerful enterprise asset with many potential user cases in this case we ll explore the relationship

More information

<Insert Picture Here> Big Data

<Insert Picture Here> Big Data Big Data Kevin Kalmbach Principal Sales Consultant, Public Sector Engineered Systems Program Agenda What is Big Data and why it is important? What is your Big

More information

G-Cloud Service Definition Canopy Big Data proof of concept Service SCS

G-Cloud Service Definition Canopy Big Data proof of concept Service SCS G-Cloud Service Definition Canopy Big Data proof of concept Service SCS Canopy Big Data proof of concept Service SCS Canopy Big Data Proof of Concept (PoC) Service is a consulting service that helps the

More information

WHAT S NEW IN SAS 9.4

WHAT S NEW IN SAS 9.4 WHAT S NEW IN SAS 9.4 PLATFORM, HPA & SAS GRID COMPUTING MICHAEL GODDARD CHIEF ARCHITECT SAS INSTITUTE, NEW ZEALAND SAS 9.4 WHAT S NEW IN THE PLATFORM Platform update SAS Grid Computing update Hadoop support

More information

IBM QRadar as a Service

IBM QRadar as a Service Government Efficiency through Innovative Reform IBM QRadar as a Service Service Definition Copyright IBM Corporation 2014 Table of Contents IBM Cloud Overview... 2 IBM/Sentinel PaaS... 2 QRadar... 2 Major

More information

WHITE PAPER LOWER COSTS, INCREASE PRODUCTIVITY, AND ACCELERATE VALUE, WITH ENTERPRISE- READY HADOOP

WHITE PAPER LOWER COSTS, INCREASE PRODUCTIVITY, AND ACCELERATE VALUE, WITH ENTERPRISE- READY HADOOP WHITE PAPER LOWER COSTS, INCREASE PRODUCTIVITY, AND ACCELERATE VALUE, WITH ENTERPRISE- READY HADOOP CLOUDERA WHITE PAPER 2 Table of Contents Introduction 3 Hadoop's Role in the Big Data Challenge 3 Cloudera:

More information

Enterprise-grade Hadoop: The Building Blocks

Enterprise-grade Hadoop: The Building Blocks Enterprise-grade Hadoop: The Building Blocks An Ovum white paper for MapR Publication Date: 24 Sep 2014 Author name Summary Catalyst Hadoop was initially developed for trusted environments that did not

More information

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data Introduction to Hadoop HDFS and Ecosystems ANSHUL MITTAL Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data Topics The goal of this presentation is to give

More information

Driving Growth in Insurance With a Big Data Architecture

Driving Growth in Insurance With a Big Data Architecture Driving Growth in Insurance With a Big Data Architecture The SAS and Cloudera Advantage Version: 103 Table of Contents Overview 3 Current Data Challenges for Insurers 3 Unlocking the Power of Big Data

More information

Accelerating Enterprise Big Data Success. Tim Stevens, VP of Business and Corporate Development Cloudera

Accelerating Enterprise Big Data Success. Tim Stevens, VP of Business and Corporate Development Cloudera Accelerating Enterprise Big Data Success Tim Stevens, VP of Business and Corporate Development Cloudera 1 Big Opportunity: Extract value from data Revenue Growth x = 50 Billion 35 ZB Cost Savings Margin

More information

TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP

TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP Pythian White Paper TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP ABSTRACT As companies increasingly rely on big data to steer decisions, they also find themselves looking for ways to simplify

More information

Neocol E-Discovery Consulting Services

Neocol E-Discovery Consulting Services Neocol E-Discovery Consulting Services Service Definition Neocol Reference: 1.0 Version: 1.0 Date: 1 March 2013 1. Service Definition 1.1. Service Overview The E-Discovery Consulting Services address needs

More information

Databricks. A Primer

Databricks. A Primer Databricks A Primer Who is Databricks? Databricks vision is to empower anyone to easily build and deploy advanced analytics solutions. The company was founded by the team who created Apache Spark, a powerful

More information

Modern IT Operations Management. Why a New Approach is Required, and How Boundary Delivers

Modern IT Operations Management. Why a New Approach is Required, and How Boundary Delivers Modern IT Operations Management Why a New Approach is Required, and How Boundary Delivers TABLE OF CONTENTS EXECUTIVE SUMMARY 3 INTRODUCTION: CHANGING NATURE OF IT 3 WHY TRADITIONAL APPROACHES ARE FAILING

More information

Securing Your Enterprise Hadoop Ecosystem Comprehensive Security for the Enterprise with Cloudera

Securing Your Enterprise Hadoop Ecosystem Comprehensive Security for the Enterprise with Cloudera Securing Your Enterprise Hadoop Ecosystem Comprehensive Security for the Enterprise with Cloudera Version: 102 Table of Contents Introduction 3 Importance of Security 3 Growing Pains 3 Security Requirements

More information

Protecting Big Data Data Protection Solutions for the Business Data Lake

Protecting Big Data Data Protection Solutions for the Business Data Lake White Paper Protecting Big Data Data Protection Solutions for the Business Data Lake Abstract Big Data use cases are maturing and customers are using Big Data to improve top and bottom line revenues. With

More information

Big Data must become a first class citizen in the enterprise

Big Data must become a first class citizen in the enterprise Big Data must become a first class citizen in the enterprise An Ovum white paper for Cloudera Publication Date: 14 January 2014 Author: Tony Baer SUMMARY Catalyst Ovum view Big Data analytics have caught

More information

IBM Tivoli Storage Manager Suite for Unified Recovery

IBM Tivoli Storage Manager Suite for Unified Recovery Benefit from Business continuity with real-time replication of applications and data to a secure container in the cloud, which can be called into action within minutes Why do we choose to partner with

More information

Service Definition Nine23 MDM

Service Definition Nine23 MDM Service Definition Nine23 MDM G-Cloud iv Contents 1 Service Nine23 Mobile Device Management System.....4 1.1 Overview Nine23 MDM...... 4 1.2 Open Standards. 5 1.3 User requirements.....5 1.3.1 Client Browser....5

More information

Hadoop in the Hybrid Cloud

Hadoop in the Hybrid Cloud Presented by Hortonworks and Microsoft Introduction An increasing number of enterprises are either currently using or are planning to use cloud deployment models to expand their IT infrastructure. Big

More information

Hadoop & Spark Using Amazon EMR

Hadoop & Spark Using Amazon EMR Hadoop & Spark Using Amazon EMR Michael Hanisch, AWS Solutions Architecture 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Agenda Why did we build Amazon EMR? What is Amazon EMR?

More information

GPS G-Cloud Lot 4: Oracle Business Intelligence Cloud Consultancy Service Definition

GPS G-Cloud Lot 4: Oracle Business Intelligence Cloud Consultancy Service Definition GPS G-Cloud Lot 4: Contents 1 Introduction... 3 2 Service... 4 2.1 Cloud Consultancy Overview... 4 2.2 Information assurance... 5 2.3 Backup/Restore and Disaster Recovery... 6 2.4 On-boarding and Off-boarding...

More information

Constructing a Data Lake: Hadoop and Oracle Database United!

Constructing a Data Lake: Hadoop and Oracle Database United! Constructing a Data Lake: Hadoop and Oracle Database United! Sharon Sophia Stephen Big Data PreSales Consultant February 21, 2015 Safe Harbor The following is intended to outline our general product direction.

More information

Powerful Duo: MapR Big Data Analytics with Cisco ACI Network Switches

Powerful Duo: MapR Big Data Analytics with Cisco ACI Network Switches Powerful Duo: MapR Big Data Analytics with Cisco ACI Network Switches Introduction For companies that want to quickly gain insights into or opportunities from big data - the dramatic volume growth in corporate

More information

EMC Federation Big Data Solutions. Copyright 2015 EMC Corporation. All rights reserved.

EMC Federation Big Data Solutions. Copyright 2015 EMC Corporation. All rights reserved. EMC Federation Big Data Solutions 1 Introduction to data analytics Federation offering 2 Traditional Analytics! Traditional type of data analysis, sometimes called Business Intelligence! Type of analytics

More information

Service Definition Document

Service Definition Document Service Definition Document QinetiQ Secure Cloud Protective Monitoring Service (AWARE) QinetiQ Secure Cloud Protective Monitoring Service (DETER) Secure Multi-Tenant Protective Monitoring Service (AWARE)

More information

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics In Organizations Mark Vervuurt Cluster Data Science & Analytics AGENDA 1. Yellow Elephant 2. Data Ingestion & Complex Event Processing 3. SQL on Hadoop 4. NoSQL 5. InMemory 6. Data Science & Machine Learning

More information

Information Architecture

Information Architecture The Bloor Group Actian and The Big Data Information Architecture WHITE PAPER The Actian Big Data Information Architecture Actian and The Big Data Information Architecture Originally founded in 2005 to

More information

The Big Data Ecosystem at LinkedIn Roshan Sumbaly, Jay Kreps, and Sam Shah LinkedIn

The Big Data Ecosystem at LinkedIn Roshan Sumbaly, Jay Kreps, and Sam Shah LinkedIn The Big Data Ecosystem at LinkedIn Roshan Sumbaly, Jay Kreps, and Sam Shah LinkedIn Presented by :- Ishank Kumar Aakash Patel Vishnu Dev Yadav CONTENT Abstract Introduction Related work The Ecosystem Ingress

More information

Cloudera in the Public Cloud

Cloudera in the Public Cloud Cloudera in the Public Cloud Deployment Options for the Enterprise Data Hub Version: Q414-102 Table of Contents Executive Summary 3 The Case for Public Cloud 5 Public Cloud vs On-Premise 6 Public Cloud

More information

G-Cloud Service Definition. Atos SI Oracle CRM and CX Services

G-Cloud Service Definition. Atos SI Oracle CRM and CX Services G-Cloud Service Definition Atos SI Oracle CRM and CX Services Atos SI Oracle CRM and CX Services SCS Atos provides a range of expert Customer Relationship Management (CRM) and Customer Experience (CX)

More information

GPG13 Protective Monitoring. Service Definition

GPG13 Protective Monitoring. Service Definition GPG13 Protective Monitoring Service Definition Issue Number V1.3 Document Date 27 November 2014 Author: D.M.Woodcock Classification UNCLASSIFIED Version G-Cloud 6 2014 Copyright Assuria Limited. All rights

More information

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Important Notice 2010-2016 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, Impala, and

More information

Software as a Service (SaaS) Online HR

Software as a Service (SaaS) Online HR Software as a Service (SaaS) Online HR Contents Service Definition... 3 An overview of the G-Cloud Service... 3 Key Service Attributes... 4 Information assurance... 4 Details of the level of backup/restore

More information

Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam sastry.vedantam@oracle.com

Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam sastry.vedantam@oracle.com Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam sastry.vedantam@oracle.com Agenda The rise of Big Data & Hadoop MySQL in the Big Data Lifecycle MySQL Solutions for Big Data Q&A

More information

Apache Hadoop: Past, Present, and Future

Apache Hadoop: Past, Present, and Future The 4 th China Cloud Computing Conference May 25 th, 2012. Apache Hadoop: Past, Present, and Future Dr. Amr Awadallah Founder, Chief Technical Officer aaa@cloudera.com, twitter: @awadallah Hadoop Past

More information

Apache Hadoop: The Big Data Refinery

Apache Hadoop: The Big Data Refinery Architecting the Future of Big Data Whitepaper Apache Hadoop: The Big Data Refinery Introduction Big data has become an extremely popular term, due to the well-documented explosion in the amount of data

More information

Spektrix Service Definition

Spektrix Service Definition Spektrix Service Definition An overview of the G-Cloud Service Spektrix is a cloud-based ticketing and marketing software package provided as Software as a Service. It is designed for arts and entertainment

More information

Oracle Big Data SQL Technical Update

Oracle Big Data SQL Technical Update Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical

More information

Apache Sentry. Prasad Mujumdar prasadm@apache.org prasadm@cloudera.com

Apache Sentry. Prasad Mujumdar prasadm@apache.org prasadm@cloudera.com Apache Sentry Prasad Mujumdar prasadm@apache.org prasadm@cloudera.com Agenda Various aspects of data security Apache Sentry for authorization Key concepts of Apache Sentry Sentry features Sentry architecture

More information

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook Hadoop Ecosystem Overview CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook Agenda Introduce Hadoop projects to prepare you for your group work Intimate detail will be provided in future

More information

WHITE PAPER. Hadoop and HDFS: Storage for Next Generation Data Management. Version: Q414-102

WHITE PAPER. Hadoop and HDFS: Storage for Next Generation Data Management. Version: Q414-102 Storage for Next Generation Data Management Version: Q414-102 Table of Content Storage for the Modern Enterprise 3 The Challenges of Big Data 5 Data at the Center of the Enterprise 6 The Internals of HDFS

More information

AtScale Intelligence Platform

AtScale Intelligence Platform AtScale Intelligence Platform PUT THE POWER OF HADOOP IN THE HANDS OF BUSINESS USERS. Connect your BI tools directly to Hadoop without compromising scale, performance, or control. TURN HADOOP INTO A HIGH-PERFORMANCE

More information

Cloud-based Infrastructure and Application Support Service Definition

Cloud-based Infrastructure and Application Support Service Definition +44 (0) 20 3603 7830 hello@equalexperts.com www.equalexperts.com 30 Brock Street London, NW1 3FG Cloud-based Infrastructure and Application Support Service Definition Overview We provide 24/7 support to

More information

Integrated windows authentication for customers based on Probation GSI network

Integrated windows authentication for customers based on Probation GSI network Product Overview Victims Tracker (VT) is a software application, which was developed by London Probation Trust (LPT) to effectively manage the engagement / contact with victims of crime and the management

More information

Business white paper. environments. The top 5 challenges and solutions for backup and recovery

Business white paper. environments. The top 5 challenges and solutions for backup and recovery Business white paper Protecting missioncritical application environments The top 5 challenges and solutions for backup and recovery Table of contents 3 Executive summary 3 Key facts about mission-critical

More information

OPEN MODERN DATA ARCHITECTURE FOR FINANCIAL SERVICES RISK MANAGEMENT

OPEN MODERN DATA ARCHITECTURE FOR FINANCIAL SERVICES RISK MANAGEMENT WHITEPAPER OPEN MODERN DATA ARCHITECTURE FOR FINANCIAL SERVICES RISK MANAGEMENT A top-tier global bank s end-of-day risk analysis jobs didn t complete in time for the next start of trading day. To solve

More information

Evaluation criteria for Google Apps backup

Evaluation criteria for Google Apps backup Evaluation criteria for Google Apps backup CHECKLIST Backupify provides a truly independent cloud backup service to give you complete control and ownership of your data. Powerful search has always been

More information

Ensure PCI DSS compliance for your Hadoop environment. A Hortonworks White Paper October 2015

Ensure PCI DSS compliance for your Hadoop environment. A Hortonworks White Paper October 2015 Ensure PCI DSS compliance for your Hadoop environment A Hortonworks White Paper October 2015 2 Contents Overview Why PCI matters to your business Building support for PCI compliance into your Hadoop environment

More information

UDiMan. Introduction. Benefits: Name: UDiMan Identity Management service. Service Type: Software as a Service (SaaS Lot 3)

UDiMan. Introduction. Benefits: Name: UDiMan Identity Management service. Service Type: Software as a Service (SaaS Lot 3) UDiMan Name: UDiMan Identity Management service Service Type: Software as a Service (SaaS Lot 3) Introduction UDiMan is an Enterprise Identity Management solution supporting mission critical authentication

More information

ScienceLogic vs. Open Source IT Monitoring

ScienceLogic vs. Open Source IT Monitoring ScienceLogic vs. Open Source IT Monitoring Next Generation Monitoring or Open Source Software? The table below compares ScienceLogic with currently available open source network management solutions across

More information

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani Technical Architect - Big Data Syntel Agenda Welcome to the Zoo! Evolution Timeline Traditional BI/DW Architecture Where Hadoop Fits In 2 Welcome to

More information

Databricks. A Primer

Databricks. A Primer Databricks A Primer Who is Databricks? Databricks was founded by the team behind Apache Spark, the most active open source project in the big data ecosystem today. Our mission at Databricks is to dramatically

More information

Amazon Relational Database Service (RDS)

Amazon Relational Database Service (RDS) Amazon Relational Database Service (RDS) G-Cloud Service 1 1.An overview of the G-Cloud Service Arcus Global are approved to sell to the UK Public Sector as official Amazon Web Services resellers. Amazon

More information

TE's Analytics on Hadoop and SAP HANA Using SAP Vora

TE's Analytics on Hadoop and SAP HANA Using SAP Vora TE's Analytics on Hadoop and SAP HANA Using SAP Vora Naveen Narra Senior Manager TE Connectivity Santha Kumar Rajendran Enterprise Data Architect TE Balaji Krishna - Director, SAP HANA Product Mgmt. -

More information

White Paper: What You Need To Know About Hadoop

White Paper: What You Need To Know About Hadoop CTOlabs.com White Paper: What You Need To Know About Hadoop June 2011 A White Paper providing succinct information for the enterprise technologist. Inside: What is Hadoop, really? Issues the Hadoop stack

More information

SAS BIG DATA SOLUTIONS ON AWS SAS FORUM ESPAÑA, OCTOBER 16 TH, 2014 IAN MEYERS SOLUTIONS ARCHITECT / AMAZON WEB SERVICES

SAS BIG DATA SOLUTIONS ON AWS SAS FORUM ESPAÑA, OCTOBER 16 TH, 2014 IAN MEYERS SOLUTIONS ARCHITECT / AMAZON WEB SERVICES SAS BIG DATA SOLUTIONS ON AWS SAS FORUM ESPAÑA, OCTOBER 16 TH, 2014 IAN MEYERS SOLUTIONS ARCHITECT / AMAZON WEB SERVICES AWS GLOBAL INFRASTRUCTURE 10 Regions 25 Availability Zones 51 Edge locations WHAT

More information

Cloudera Backup and Disaster Recovery

Cloudera Backup and Disaster Recovery Cloudera Backup and Disaster Recovery Important Note: Cloudera Manager 4 and CDH 4 have reached End of Maintenance (EOM) on August 9, 2015. Cloudera will not support or provide patches for any of the Cloudera

More information

Ubuntu and Hadoop: the perfect match

Ubuntu and Hadoop: the perfect match WHITE PAPER Ubuntu and Hadoop: the perfect match February 2012 Copyright Canonical 2012 www.canonical.com Executive introduction In many fields of IT, there are always stand-out technologies. This is definitely

More information

MS Analytics as a Service

MS Analytics as a Service G-Cloud 7 Service Definition Version 1 Cloud Services 6 th October 2015 Table of Contents Version 1 Company Profile... 2 Overview... 4 Service Description... 4 Analytics consulting services:... 4 Analytics

More information