Enterprise-grade Hadoop: The Building Blocks

Size: px
Start display at page:

Download "Enterprise-grade Hadoop: The Building Blocks"

Transcription

1 Enterprise-grade Hadoop: The Building Blocks An Ovum white paper for MapR Publication Date: 24 Sep 2014 Author name

2 Summary Catalyst Hadoop was initially developed for trusted environments that did not require bulletproof security, running highly specialized analytics, and accessible only by a select cadre of elite programmers at big Internet companies. But as mainstream enterprises implement Hadoop, there is the expectation that the platform will deliver the capabilities that are expected of enterprise data platforms. Progress in filling Hadoop s gaps regarding enterprise-grade functionality has been significant; for instance, in the 2.x version, the open source community has made important advancements in opening the platform to multiple workloads and addressing some single point of failure issues. On the horizon, there are numerous open source projects aimed at addressing gaps in role-based access control, data security, and other areas. However, as an emerging platform, commercial distributions are differentiating; organizations should closely scrutinize their platform supplier s strategy for delivering an enterprisegrade platform. Ovum view Ovum believes that for Hadoop (and other Big Data technologies) to gain enterprise adoption, it must become a "first class citizen" in the enterprise. To do so, Hadoop must be capable of behaving like an enterprise platform. If Hadoop is to become a platform on which enterprises store system of record data, and rely on for competitive analytics insights and operational applications or decisions, it must deliver: Predictable performance -- Deliver consistent performance/availability/reliability to meet business SLAs. Security Provides the same degree of granular security addressing access rights, privacy, and protection unauthorized actions as enterprise databases. Data protection Ensure the same degree (e.g., backup and recovery, privacy, access, obfuscation, and auditing of usage) that is offered by enterprise databases. Data governance and stewardship -- Support the policy-driven lifecycle management of data, like any enterprise database. As a platform, Hadoop is making excellent progress toward meeting these requirements. MapR has been proactive in charting a dual-pronged strategy, supporting critical mass open source projects targeting security and manageability, while extending unique capabilities that address high availability and fault tolerance, data protection, multi-tenancy/resource-management, and performance consistency. Hadoop must become a first class citizen in the enterprise Hadoop originated with Internet companies that had -- at the time -- unique computing and analytic problems. As such, these companies jointly developed the platform in the open source community based on work performed at Google, Yahoo, Facebook, LinkedIn and other organizations. Early 2014 Ovum. All rights reserved. Unauthorized reproduction prohibited. Page 2

3 adopters typically implemented Hadoop as a special, often standalone platform, and run by special teams. The rules of engagement are changing as enterprises adopt Hadoop. The standalone, SWAT team model of Hadoop deployment will not be sustainable in the broader enterprise. Instead, IT departments deploying Hadoop on behalf of multiple business groups must integrate it into the data processing fabric, covered by the same policies regarding access, data protection and retention, resource management, and security. With Hadoop increasingly being utilized as an operational platform for both decision support and data-driven applications, SLA compliance regarding performance and uptime become extremely relevant. Hadoop must become a first class citizen in the enterprise. It must become part of the enterprise data fabric because it will operate alongside existing OLTP, data warehousing, and other decision support data stores. The technology must be accessible to the people and skills that already form the IT organization. Hadoop must map to existing data center infrastructures, policies, and practices that are already applied to data platforms. To become a first class citizen in the enterprise, Hadoop must become enterprise-ready for maintaining security, delivering consistent performance and high availability, providing the management capabilities to ensure sound operation, protecting data, and ensuring data governance. The Five Building Blocks for Enterprise Hadoop 1. Security Hadoop was not originally designed for security. Like the Internet, Hadoop was originally designed for use in trusted environments where there was no perceived need for enterprise-grade security features. Because Hadoop clusters were originally deployed behind firewalls with few having the knowledge to access them, the main security concern wasn t hacking; instead, it was facilitating the pooling of disjointed clusters. That resulted in support of Kerberos for token-based authentication to remote clusters. The original core Hadoop platform also made provision for controlling data access through a coarse-grained approach involving simple user permissions at the file system and directory levels. Third-party tools could enforce protection at the perimeter through firewalls and packages that guard against malware and intrusions, but collectively, all of these measures were just the first step for any data platform that is supposed to be enterprise-grade. An enterprise-grade solution includes full authentication, authorization, and accounting ( AAA ) measures that ensure that that users are who they claim, they can only access data for which they are authorized, and all actions are logged to support administrative audits. At minimum, this requires a unified, single-sign-on capability for all Hadoop modules that can be linked to existing enterprise directories (e.g., LDAP, Active Directory), with authorization for role-based access to specific categories of data. Fault-tolerant authentication must be supported from the client to all nodes of the cluster. 2. Performance and High Availability Enterprise data stores are expected to perform and be available on a consistent, predictable basis. That encompasses high performance, fast response times, and high uptime, and requires features such as automated, dynamic load balancing, rolling updates, and immediate failover. Hadoop s 2014 Ovum. All rights reserved. Unauthorized reproduction prohibited. Page 3

4 MapReduce heritage until recently limited the platform to batch mode operations. As such, expectations for performance and conformance to service level agreement (SLA) commitments were modest. However, as Hadoop accommodates new forms of workloads such as interactive SQL query, real-time stream processing, database operations, and search, it will be utilized increasingly as an operational platform; this is where SLA compliance; uptime and availability; operational efficiency; and managed recovery become especially critical. This places high importance on eliminating single points of failure (SPOF), especially with the NameNode, which the Hadoop community addressed with v 2.0, and that MapR has supported in a more distributed way from the outset. This also places high importance on having predictable performance, both for interactive query and reporting workloads. Furthermore, the purpose of running real-time streaming for operational applications or decision support will be undermined if processing is unexpectedly interrupted. 3. Manageability Enterprise-grade data platforms are expected to be reliable. This requires powerful management capabilities that provide full visibility of operations and granular management of resources and recovery. Although not a high priority for early adopters who utilized Hadoop primarily as a batch analytics platform, manageability grows essential as enterprises deploy it in an operational role and across multiple business units. They entails providing complete visibility, down to node level, of all running services and resource consumption across all clusters. Making operations transparent is an important first step towards meeting the service levels that users expect. With visibility comes the ability to manage resources granularly, isolating resources down to service and node level, along with the ability to dynamically shift loads according to demand and priority. It also requires the ability to maintain availability through capabilities such as: Enabling organizations to take advantage of the latest versions of Hadoop through rolling updates that avoid taking entire clusters offline, and allowing the core Hadoop platform to be upgraded separately from the Apache ecosystem components; Building and running standard workflows for system stops, restarts, and reconfiguration; Granular multi-tenancy across groups or applications which allows for resource isolation and scheduling, resource quotas for data storage and processing, security and, and reporting/auditing of system utilization for ongoing optimization; and Policy-based disaster recovery that prioritizes mirroring of operations by importance or criticality. Manageability becomes essential as Hadoop evolves to a multi-purpose platform, capable of running multiple types of workloads side by side across different portions of the cluster a capability that is partially opened up by Hadoop 2.x s YARN resource management framework. This is a critical development for Hadoop to evolve into becoming a truly shared resource that can be utilized by multiple business units concurrently. Additionally, as Hadoop usage grows to support multiple business units or applications as a shared service from a single cluster, the need for a true multitenant environment become more pronounced Ovum. All rights reserved. Unauthorized reproduction prohibited. Page 4

5 4. Data protection Enterprise Hadoop platforms are expected to support either within the platform, or through enabling technology that third parties leverage provisions to protect privacy and authorize access to specific categories of data. Although not a major issue in Hadoop s early days, when it was utilized as an batch processing platform, as new operational workloads emerge, provisions must be made to ensure that the data always remains sacrosanct. Data must be fully recoverable; features such as disaster recovery, snapshots, and rollbacks will become critical. Because no system can be made 100% available, provisions must be made for snapshotting capabilities that pinpoint the status of all operations on Hadoop files or HBase and Hive tables at specific points in time. Additionally, within the cluster, the platform should have the capability to intelligently replicate data across multiple nodes deployed across different racks, and for providing selective mirroring onto remote clusters that pinpoints the most sensitive data. And there should be straightforward automation to replicate data to remote clusters to protect against geographical disasters. Additionally, the platform must provide enabling technology to support complete or selective obscuring of data as dictated by enterprise policy or external regulation. That encompasses data encryption (for data at rest or on the wire), data masking, and data redaction. This includes protection of data in movement (on the wire) and at rest. These capabilities may not necessarily be provided by the Hadoop platform provider; but Hadoop must enable selective application of such measures. 5. Data governance While data retained in Hadoop will be more varied than data stored in traditional enterprise transaction databases and data warehouses, it is not immune from enterprise policies regarding data retention and lifecycle management, including full data lineage tracking (e.g., sources of data, how the data was transformed or consumed, and by whom) to generate audit trails. The challenge is compounded by the extended breadth and depth of data sources that will be stored in Hadoop compared to a traditional enterprise data warehouse. With its more economical storage, it becomes more practical to store longer timeframes of data in Hadoop; for instance, instead of storing 1 3 years of customer data, an enterprise could keep 7 10 years of data live. Likewise, because Hadoop (unlike traditional data warehouses) can accommodate variably structured data, the sources of data may become more diverse. That makes data governance a broader challenge for Hadoop, although with most organizations just beginning their implementations, that may not be at the top of their task list. But, with more and more varied data stored in Hadoop, it will become extremely critical to enforce strict role-based access controls by data set, while carefully tracking lineage to document data utilization. With more varied sources of data, it will become important to assign confidence levels regarding the validity and cleanliness of data. With extended periods of data stored, data retention policies will have to be enforced where applicable. Ovum does not necessarily expect Hadoop platform providers to specialize in delivering complete portfolios of data governance tools, but the underlying platform must be capable of enforcing those controls at the necessary level of granularity Ovum. All rights reserved. Unauthorized reproduction prohibited. Page 5

6 MapR s strategy for supporting enterprise-grade Hadoop Supporting and accelerating Apache Hadoop Hadoop is not a single monolithic platform, but a collection of open source projects that commercial distribution providers test, integrate, package and support. The MapR Data Platform is built on a distributed computing architecture that was designed from the outset for high availability, reliability, and manageability. MapR s strategy is to support all major Apache Hadoop open source projects, and support industrydefined APIs for areas where MapR has written differentiated implementations, like HDFS and HBase. MapR supports a broad variety of Apache Hadoop and other open source software projects and in many cases, offers choices of competing open source projects in areas such as interactive SQL query. At the core of the MapR Data Platform is the MapR File System (MapR-FS). It provides an NFS and POSIX-compliant file system that replaces HDFS and supports consistent snapshots, mirroring, and random read/writes and provides a NAS-like interface for enterprise data sources. With the MapR Enterprise Database Edition of the distribution, MapR provides MapR-DB, an integrated in-hadoop NoSQL database that provides a high-performance alternative to HBase, while maintaining HBase API compatibility. MapR also supports native Apache HBase. Enterprise capabilities enabled by the MapR platform The Hadoop community has made important advancements with the 2.x generation with regard to the ability to run multiple workloads, take snapshots, and provide a degree of redundancy with the NameNode, which was formerly a single point of failure. MapR follows a dual strategy for supporting a variety of Apache projects that are addressing gaps in the Hadoop platform for manageability, security, and data protection, while leveraging its unique architecture to provide some capabilities that are beyond the capabilities of HDFS. NFS compatibility and POSIX compliance This provide several advantages. Industry-standard NFS compatibility enables existing enterprise applications to access Hadoop data as if they were accessing a NAS; it also allows existing enterprise data sources to be ingested into Hadoop without need for special connectors or utilities such as Flume or Sqoop. POSIX compliance allows the random read/write capabilities that are taken for granted with enterprise databases. Platform Resiliency MapR approaches this requirement in several ways. Its architecture distributes NameNode metadata across all worker nodes in the cluster. While Hadoop 2.x has added a secondary backup NameNode, MapR s architecture offers a different approach that distributes metadata in shards across the cluster, co-residing with the data with which it is associated. This capability is available out-of-the-box, requiring no additional configuration or hardware, and allows instant recovery, with files and tables available rapidly after node failures or cluster restarts. And, if nodes fail while MapReduce batch jobs are running, MapR lets the jobs run to completion, rather than requiring a complete restart from scratch. Snapshots are also supported in Hadoop 2.x, but they are inconsistent if any writes are being 2014 Ovum. All rights reserved. Unauthorized reproduction prohibited. Page 6

7 performed on files; for MapR, snapshots are consistent as you would expect in an enterprise storage system, capturing the status of all files at a specific point in time, even if they are open. Mirroring is configurable; users can designate the volumes and directories to mirror, rather than covering the entire cluster by default. Security This is a work in progress for the Hadoop platform in general. The community has responded with a variety of projects, while established third party tooling addresses some of the gaps. This is an area where MapR is adopting a dual strategy, addressing key customer requirements in the core platform, and implementing security projects from Apache for additional layers of protection. Authentication has been built directly on top of the core platform, with MapR supporting both Kerberos as well as simplified username/password based authentication to integrate into environments that haven t adopted Kerberos. Authorization is available at both a file-level, with permissions bits, and on a column and column-family level for tables stored in MapR-DB, controlled by Boolean access control expressions (ACEs). Wire-level encryption is also supported for all types of communication between nodes in the cluster. To achieve more granular project-specific authorization for projects like Hive and Impala, MapR will also support Sentry. Data protection This is also a work in progress for all Hadoop players. MapR currently supports wire level encryption for data in motion, but currently relies on third parties for encryption, key management, and masking for data at rest. For data replication, it ensures that at least one replica is deployed on a separate rack. MapR s snapshotting capability can pinpoint file status at specific points of time, even while files are open. And for mirroring, MapR provides granularity to choose data by specific volumes or directories, providing a more selective alternative to covering the entire cluster. Manageability and Multi-tenancy Most commercial Hadoop distributions provide node level monitoring and management. In addition to this, the MapR Control System (MCS) provides detailed management of jobs and data, the latter controlled through a unique concept called a volume. A volume is a logical partition of the file system that can be associated with a rich data management policy. Using MCS, users can modify several attributes of a volume, including quota, snapshot and mirror policy and schedule, administrative permissions, and data placement policy. Combined with YARN or other resource managers such as Mesos, this provide fine-grained resource and workload management for multi-tenant environments. MapR takes a few steps further with data placement control and job placement control. These capabilities ensure that the multiple user groups in a MapR cluster can run jobs in segregated portions of the cluster to minimize resource contention. Combined with integrated security, organizations can deploy a MapR cluster with multiple distinct user groups and data sets, a configuration that requires separate clusters on other Hadoop distributions. Additionally, the MapR Control System provides flexibility, supporting access for a GUI, command level interface, or through third party tools via RESTful APIs Ovum. All rights reserved. Unauthorized reproduction prohibited. Page 7

8 Appendix Author Tony Baer, Principal Analyst, Ovum IT Information Management Ovum Consulting We hope that this analysis will help you make informed and imaginative business decisions. If you have further requirements, Ovum s consulting team may be able to help you. For more information about Ovum s consulting capabilities, please contact us directly at consulting@ovum.com. Copyright notice and disclaimer The contents of this product are protected by international copyright laws, database rights and other intellectual property rights. The owner of these rights is Informa Telecoms and Media Limited, our affiliates or other third party licensors. All product and company names and logos contained within or appearing on this product are the trademarks, service marks or trading names of their respective owners, including Informa Telecoms and Media Limited. This product may not be copied, reproduced, distributed or transmitted in any form or by any means without the prior permission of Informa Telecoms and Media Limited. Whilst reasonable efforts have been made to ensure that the information and content of this product was correct as at the date of first publication, neither Informa Telecoms and Media Limited nor any person engaged or employed by Informa Telecoms and Media Limited accepts any liability for any errors, omissions or other inaccuracies. Readers should independently verify any facts and figures as no liability can be accepted in this regard - readers assume full responsibility and risk accordingly for their use of such information and content. Any views and/or opinions expressed in this product by individual authors or contributors are their personal views and/or opinions and do not necessarily reflect the views and/or opinions of Informa Telecoms and Media Limited Ovum. All rights reserved. Unauthorized reproduction prohibited. Page 8

9 CONTACT US INTERNATIONAL OFFICES Beijing Dubai Hong Kong Hyderabad Johannesburg London Melbourne New York San Francisco Sao Paulo Tokyo 2014 Ovum. All rights reserved. Unauthorized reproduction prohibited. Page 9

On the Radar: Tamr. Applying machine learning to integrating Big Data. Publication Date: Sept. 2014 Product code: IT0014-002934.

On the Radar: Tamr. Applying machine learning to integrating Big Data. Publication Date: Sept. 2014 Product code: IT0014-002934. Applying machine learning to integrating Big Data Publication Date: Sept. 2014 Product code: IT0014-002934 Tony Baer Summary Catalyst Traditional data integration approaches may not scale for Big Data.

More information

Big Data must become a first class citizen in the enterprise

Big Data must become a first class citizen in the enterprise Big Data must become a first class citizen in the enterprise An Ovum white paper for Cloudera Publication Date: 14 January 2014 Author: Tony Baer SUMMARY Catalyst Ovum view Big Data analytics have caught

More information

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics ESSENTIALS EMC ISILON Use the industry's first and only scale-out NAS solution with native Hadoop

More information

Addressing Enterprise Needs with a Software Defined Network Platform

Addressing Enterprise Needs with a Software Defined Network Platform Addressing Enterprise Needs with a Software Defined Network Platform Dynamic, customizable approach meets customer demand Date: December 2015 Author: Mike Sapien Ovum view Enterprise customers have virtualized

More information

On the Radar: CipherCloud

On the Radar: CipherCloud Cloud access security delivered on enterprise gateways Publication Date: 18 Feb 2015 Product code: IT0022-000305 Rik Turner Summary Catalyst CipherCloud develops cloud visibility and security technology

More information

Re-architecting Legacy Systems is the Keystone for Transformation

Re-architecting Legacy Systems is the Keystone for Transformation Re-architecting Legacy Systems is the Keystone for Transformation Legacy modernization lays the groundwork for the modern enterprise An Ovum White Paper Contents Executive summary... Introduction... Key

More information

SWOT Assessment: BeyondTrust Privileged Identity Management Portfolio

SWOT Assessment: BeyondTrust Privileged Identity Management Portfolio SWOT Assessment: BeyondTrust Privileged Identity Management Portfolio Analyzing the strengths, weaknesses, opportunities, and threats Publication Date: 11 Jun 2015 Product code: IT0022-000387 Andrew Kellett

More information

How To Use Syncplicity Panorama On A Mobile Device

How To Use Syncplicity Panorama On A Mobile Device On the Radar: Syncplicity Panorama New mobile content access tools for modern business work styles Publication Date: 11 Mar 2015 Product code: IT0021-000064 Richard Edwards Summary Catalyst The typical

More information

Hybrid WAN Services emerging as a viable network option

Hybrid WAN Services emerging as a viable network option Hybrid WAN Services emerging as a viable network option Customers now going beyond MPLS-based services Date: December 2015 Author: Mike Sapien Summary In a nutshell Business customers have relied on MPLS-based

More information

Public Sector Enterprises and Cloud Computing: Realizing Efficiencies

Public Sector Enterprises and Cloud Computing: Realizing Efficiencies Public Sector Enterprises and Cloud Computing: Realizing Efficiencies Summary Catalyst Cloud technology, and its suitability for public services, continues to be a subject that polarizes CIOs. For some,

More information

Financial services perspectives on the role and real impact of cloud

Financial services perspectives on the role and real impact of cloud Financial services perspectives on the role and real impact of cloud Executive Summary Ovum has recently concluded an independent and in-depth survey of 400 senior CIOs within financial services institutions

More information

Big Data Management and Security

Big Data Management and Security Big Data Management and Security Audit Concerns and Business Risks Tami Frankenfield Sr. Director, Analytics and Enterprise Data Mercury Insurance What is Big Data? Velocity + Volume + Variety = Value

More information

On the Radar: Pulse Secure

On the Radar: Pulse Secure Secure access management for corporate and personal endpoints on company networks Publication Date: 17 Jul 2015 Product code: IT0022-000431 Rik Turner Summary Catalyst Pulse Secure is a developer of secure

More information

Case Study: Vitamix. Improving strategic business integration using IT service management practices and technology

Case Study: Vitamix. Improving strategic business integration using IT service management practices and technology Improving strategic business integration using IT service management practices and technology Publication Date: 17 Sep 2014 Product code: IT0022-000180 Adam Holtby Summary Catalyst For Vitamix, a key driver

More information

Ovum Decision Matrix: Selecting an Enterprise File Sync and Share Product, 2014 15

Ovum Decision Matrix: Selecting an Enterprise File Sync and Share Product, 2014 15 Ovum Decision Matrix: Selecting an Enterprise File Sync and Share Product, 2014 15 Excerpt prepared for Egnyte, Inc. Publication Date: 28 Aug 2014 Product code: IT0021-000018 Richard Edwards Summary Catalyst

More information

On the Radar: Alation harnesses crowdsourcing and machine learning to speed data access

On the Radar: Alation harnesses crowdsourcing and machine learning to speed data access On the Radar: Alation harnesses crowdsourcing and machine learning to speed data access Summary Catalyst As organizations widen their net and analyze more data sources, it becomes all too easy for business

More information

How To Rank Customer Analytics Vendors

How To Rank Customer Analytics Vendors Ovum Decision Matrix: Selecting a Customer Analytics Solution for Telcos, 2015 16 Publication Date: 10 Sep 2015 Product code: IT0012-000135 Adaora Okeleke Summary Catalyst Telcos quest for a competitive

More information

SWOT Assessment: BMC Remedy v9

SWOT Assessment: BMC Remedy v9 SWOT Assessment: BMC Remedy v9 Analyzing the strengths, weaknesses, opportunities, and threats Publication Date: 17 Aug 2015 Product code: IT0022-000489 Adam Holtby Summary Catalyst BMC Software is an

More information

Staying agile with Big Data

Staying agile with Big Data An Ovum white paper for Red Hat Publication Date: 09 Sep 2014 Tony Baer Summary Catalyst Like any major technology project, organizations implementing Big Data projects face challenges with aligning business

More information

Data Security in Hadoop

Data Security in Hadoop Data Security in Hadoop Eric Mizell Director, Solution Engineering Page 1 What is Data Security? Data Security for Hadoop allows you to administer a singular policy for authentication of users, authorize

More information

SWOT Assessment: Alfresco, Alfresco One, v5.0

SWOT Assessment: Alfresco, Alfresco One, v5.0 SWOT Assessment: Alfresco, Alfresco One, v5.0 Analyzing the strengths, weaknesses, opportunities, and threats Publication Date: May 5 th, 2015 Product code: IT0014-003012 Sue Clarke Summary Catalyst When

More information

CDH AND BUSINESS CONTINUITY:

CDH AND BUSINESS CONTINUITY: WHITE PAPER CDH AND BUSINESS CONTINUITY: An overview of the availability, data protection and disaster recovery features in Hadoop Abstract Using the sophisticated built-in capabilities of CDH for tunable

More information

Winning with Emerging CRM Channels. An Ovum White Paper

Winning with Emerging CRM Channels. An Ovum White Paper Winning with Emerging CRM Channels An Ovum White Paper Introduction If there has been one constant over the past five years, it is the shift in how consumers interact not just with each other, but how

More information

RPO represents the data differential between the source cluster and the replicas.

RPO represents the data differential between the source cluster and the replicas. Technical brief Introduction Disaster recovery (DR) is the science of returning a system to operating status after a site-wide disaster. DR enables business continuity for significant data center failures

More information

HP s revitalized workforce optimization suite is worth a fresh look

HP s revitalized workforce optimization suite is worth a fresh look HP s revitalized workforce optimization suite is worth a fresh look Publication Date: 27 Jul 2015 Product code: IT0020-000139 Keith Dawson Ovum view Summary When contact center buyers look to acquire workforce

More information

LMS and Student Success at Greenville College: A Case Study

LMS and Student Success at Greenville College: A Case Study LMS and Student Success at Greenville College: A Case Study Overcoming hurdles to improve student retention Publication Date: 23 May 2014 Product code: IT0008-000200 Navneet Johal SUMMARY Catalyst Confusion

More information

Financial Institutions and the cloud: moving from BAU to business transformation

Financial Institutions and the cloud: moving from BAU to business transformation Financial Institutions and the cloud: moving from BAU to business transformation Summary Catalyst The role of cloud technology among banks and insurers has been hotly debated over the last 5 years, creating

More information

Cisco Unified Data Center Solutions for MapR: Deliver Automated, High-Performance Hadoop Workloads

Cisco Unified Data Center Solutions for MapR: Deliver Automated, High-Performance Hadoop Workloads Solution Overview Cisco Unified Data Center Solutions for MapR: Deliver Automated, High-Performance Hadoop Workloads What You Will Learn MapR Hadoop clusters on Cisco Unified Computing System (Cisco UCS

More information

INDUSTRY BRIEF DATA CONSOLIDATION AND MULTI-TENANCY IN FINANCIAL SERVICES

INDUSTRY BRIEF DATA CONSOLIDATION AND MULTI-TENANCY IN FINANCIAL SERVICES INDUSTRY BRIEF DATA CONSOLIDATION AND MULTI-TENANCY IN FINANCIAL SERVICES Data Consolidation and Multi-Tenancy in Financial Services CLOUDERA INDUSTRY BRIEF 2 Table of Contents Introduction 3 Security

More information

2015 Trends to Watch: Higher Education

2015 Trends to Watch: Higher Education 2015 Trends to Watch: Higher Education Leveraging IT to benefit the institutional mission Publication Date: 05 Nov 2014 Product code: IT0008-000217 Navneet Johal Summary Catalyst The higher education industry

More information

Web Application Firewalls: The TCO Question

Web Application Firewalls: The TCO Question Web Application Firewalls: The TCO Question Ovum looks into total cost of ownership for WAFs Rik Turner Summary Catalyst Ovum has carried out a series of interviews with companies in North America, Europe,

More information

Cloudera Enterprise Data Hub. GCloud Service Definition Lot 3: Software as a Service

Cloudera Enterprise Data Hub. GCloud Service Definition Lot 3: Software as a Service Cloudera Enterprise Data Hub GCloud Service Definition Lot 3: Software as a Service December 2014 1 SERVICE OVERVIEW & SOLUTION... 4 1.1 Service Overview... 4 1.2 Introduction to Cloudera... 5 1.3 Cloudera

More information

High Availability on MapR

High Availability on MapR Technical brief Introduction High availability (HA) is the ability of a system to remain up and running despite unforeseen failures, avoiding unplanned downtime or service disruption*. HA is a critical

More information

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Important Notice 2010-2015 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, Impala, and

More information

Hadoop Ecosystem B Y R A H I M A.

Hadoop Ecosystem B Y R A H I M A. Hadoop Ecosystem B Y R A H I M A. History of Hadoop Hadoop was created by Doug Cutting, the creator of Apache Lucene, the widely used text search library. Hadoop has its origins in Apache Nutch, an open

More information

Enterprise Content Management: The Suite Perspective

Enterprise Content Management: The Suite Perspective Enterprise Content Management: The Suite Perspective Publication Date: 04 Dec 2015 Product code: IT0014-003079 Sue Clarke Summary Catalyst The Ovum Decision Matrix: Selecting an Enterprise Content Management

More information

Deploying Hadoop with Manager

Deploying Hadoop with Manager Deploying Hadoop with Manager SUSE Big Data Made Easier Peter Linnell / Sales Engineer plinnell@suse.com Alejandro Bonilla / Sales Engineer abonilla@suse.com 2 Hadoop Core Components 3 Typical Hadoop Distribution

More information

Move Data from Oracle to Hadoop and Gain New Business Insights

Move Data from Oracle to Hadoop and Gain New Business Insights Move Data from Oracle to Hadoop and Gain New Business Insights Written by Lenka Vanek, senior director of engineering, Dell Software Abstract Today, the majority of data for transaction processing resides

More information

Hadoop & Spark Using Amazon EMR

Hadoop & Spark Using Amazon EMR Hadoop & Spark Using Amazon EMR Michael Hanisch, AWS Solutions Architecture 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Agenda Why did we build Amazon EMR? What is Amazon EMR?

More information

WHITEPAPER. A Technical Perspective on the Talena Data Availability Management Solution

WHITEPAPER. A Technical Perspective on the Talena Data Availability Management Solution WHITEPAPER A Technical Perspective on the Talena Data Availability Management Solution BIG DATA TECHNOLOGY LANDSCAPE Over the past decade, the emergence of social media, mobile, and cloud technologies

More information

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Important Notice 2010-2016 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, Impala, and

More information

IBM InfoSphere Guardium Data Activity Monitor for Hadoop-based systems

IBM InfoSphere Guardium Data Activity Monitor for Hadoop-based systems IBM InfoSphere Guardium Data Activity Monitor for Hadoop-based systems Proactively address regulatory compliance requirements and protect sensitive data in real time Highlights Monitor and audit data activity

More information

On the Radar: ForgeRock

On the Radar: ForgeRock Identity management for B2C and the Internet of Things Publication Date: 03 Dec 2015 Product code: IT0022-000500 Rik Turner Summary Catalyst ForgeRock develops identity and access management (IAM) technology

More information

2015 Global Payments Insight: Bill Pay Services. With big change comes big opportunity

2015 Global Payments Insight: Bill Pay Services. With big change comes big opportunity 2015 Global Payments Insight: Bill Pay Services With big change comes big opportunity Catalyst Payments are at a crossroads The payments market is changing. From cash to checks, to charge and credit cards,

More information

Realising possibilities in the cloud: The need for a trusted broker

Realising possibilities in the cloud: The need for a trusted broker Realising possibilities in the cloud: The need for a trusted broker Sponsored by BT and Cisco Camille Mendler Summary Catalyst This report draws on a custom study of the cloud experiences and plans of

More information

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE Hadoop Storage-as-a-Service ABSTRACT This White Paper illustrates how EMC Elastic Cloud Storage (ECS ) can be used to streamline the Hadoop data analytics

More information

Optimized for the Industrial Internet: GE s Industrial Data Lake Platform

Optimized for the Industrial Internet: GE s Industrial Data Lake Platform Optimized for the Industrial Internet: GE s Industrial Lake Platform Agenda The Opportunity The Solution The Challenges The Results Solutions for Industrial Internet, deep domain expertise 2 GESoftware.com

More information

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013 Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software SC13, November, 2013 Agenda Abstract Opportunity: HPC Adoption of Big Data Analytics on Apache

More information

Making analytics a first-class healthcare citizen: lessons from Oracle customers

Making analytics a first-class healthcare citizen: lessons from Oracle customers Making analytics a first-class healthcare citizen: lessons from Oracle customers Publication Date: 21 Nov 2014 Product code: IT0011-000335 Charlotte Davies Ovum view Summary Technology is being increasingly

More information

Apache Sentry. Prasad Mujumdar prasadm@apache.org prasadm@cloudera.com

Apache Sentry. Prasad Mujumdar prasadm@apache.org prasadm@cloudera.com Apache Sentry Prasad Mujumdar prasadm@apache.org prasadm@cloudera.com Agenda Various aspects of data security Apache Sentry for authorization Key concepts of Apache Sentry Sentry features Sentry architecture

More information

Rethinking Cloud Content Collaboration in Financial Services

Rethinking Cloud Content Collaboration in Financial Services Rethinking Cloud Content Collaboration in Financial Services Executive Summary The financial services sector generally prefers to take a risk-averse approach to new technology trends, with the need for

More information

How to Choose Between Hadoop, NoSQL and RDBMS

How to Choose Between Hadoop, NoSQL and RDBMS How to Choose Between Hadoop, NoSQL and RDBMS Keywords: Jean-Pierre Dijcks Oracle Redwood City, CA, USA Big Data, Hadoop, NoSQL Database, Relational Database, SQL, Security, Performance Introduction A

More information

The Future of Data Management

The Future of Data Management The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class

More information

Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010

Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010 Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010 Better Together Writer: Bill Baer, Technical Product Manager, SharePoint Product Group Technical Reviewers: Steve Peschka,

More information

On the Radar: JReport

On the Radar: JReport Embedded reporting and analytics Publication Date: April 30 th, 2015 Product code: IT0014-003010 Surya Mukherjee Summary Catalyst Jinfonet Software, through its reporting and dashboarding applications,

More information

Powerful Duo: MapR Big Data Analytics with Cisco ACI Network Switches

Powerful Duo: MapR Big Data Analytics with Cisco ACI Network Switches Powerful Duo: MapR Big Data Analytics with Cisco ACI Network Switches Introduction For companies that want to quickly gain insights into or opportunities from big data - the dramatic volume growth in corporate

More information

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes Highly competitive enterprises are increasingly finding ways to maximize and accelerate

More information

<Insert Picture Here> Big Data

<Insert Picture Here> Big Data Big Data Kevin Kalmbach Principal Sales Consultant, Public Sector Engineered Systems Program Agenda What is Big Data and why it is important? What is your Big

More information

More Data in Less Time

More Data in Less Time More Data in Less Time Leveraging Cloudera CDH as an Operational Data Store Daniel Tydecks, Systems Engineering DACH & CE Goals of an Operational Data Store Load Data Sources Traditional Architecture Operational

More information

... ... PEPPERDATA OVERVIEW AND DIFFERENTIATORS ... ... ... ... ...

... ... PEPPERDATA OVERVIEW AND DIFFERENTIATORS ... ... ... ... ... ..................................... WHITEPAPER PEPPERDATA OVERVIEW AND DIFFERENTIATORS INTRODUCTION Prospective customers will often pose the question, How is Pepperdata different from tools like Ganglia,

More information

Upcoming Announcements

Upcoming Announcements Enterprise Hadoop Enterprise Hadoop Jeff Markham Technical Director, APAC jmarkham@hortonworks.com Page 1 Upcoming Announcements April 2 Hortonworks Platform 2.1 A continued focus on innovation within

More information

On the Radar: Apperian MAM

On the Radar: Apperian MAM Mobile application management and enterprise app store Publication Date: 12 May 2015 Product code: IT0021-000082 Richard Absalom Summary Catalyst There is a massive opportunity for enterprises to develop,

More information

MULTITENANCY AND THE ENTERPRISE DATA HUB:

MULTITENANCY AND THE ENTERPRISE DATA HUB: MULTITENANCY AND THE ENTERPRISE DATA HUB: Version: Q414-105 Table of Content Introduction 3 Business Objectives for Multitenant Environments 3 Standard Isolation Models of an EDH 4 Elements of a Multitenant

More information

BIG DATA TRENDS AND TECHNOLOGIES

BIG DATA TRENDS AND TECHNOLOGIES BIG DATA TRENDS AND TECHNOLOGIES THE WORLD OF DATA IS CHANGING Cloud WHAT IS BIG DATA? Big data are datasets that grow so large that they become awkward to work with using onhand database management tools.

More information

Why enterprise data archiving is critical in a changing landscape

Why enterprise data archiving is critical in a changing landscape Why enterprise data archiving is critical in a changing landscape Ovum white paper for Informatica SUMMARY Catalyst Ovum view The most successful enterprises manage data as strategic asset. They have complete

More information

The Future of Payments 2015: Financial Institutions. The Payments Value Chain is Driven by Customers

The Future of Payments 2015: Financial Institutions. The Payments Value Chain is Driven by Customers The Future of Payments 2015: Financial Institutions The Payments Value Chain is Driven by Customers 1 Catalyst Payments Are at a Crossroads The payments market is changing. From cash to checks, to charge

More information

Hadoop IST 734 SS CHUNG

Hadoop IST 734 SS CHUNG Hadoop IST 734 SS CHUNG Introduction What is Big Data?? Bulk Amount Unstructured Lots of Applications which need to handle huge amount of data (in terms of 500+ TB per day) If a regular machine need to

More information

GoodData. Platform Overview

GoodData. Platform Overview GoodData Platform Overview GoodData Platform: 2 3 The GoodData Platform GoodData Platform GoodData has helped more than users make sense of their data with advanced business analytics. It s open Thanks

More information

EMC s Enterprise Hadoop Solution. By Julie Lockner, Senior Analyst, and Terri McClure, Senior Analyst

EMC s Enterprise Hadoop Solution. By Julie Lockner, Senior Analyst, and Terri McClure, Senior Analyst White Paper EMC s Enterprise Hadoop Solution Isilon Scale-out NAS and Greenplum HD By Julie Lockner, Senior Analyst, and Terri McClure, Senior Analyst February 2012 This ESG White Paper was commissioned

More information

Virtualizing Apache Hadoop. June, 2012

Virtualizing Apache Hadoop. June, 2012 June, 2012 Table of Contents EXECUTIVE SUMMARY... 3 INTRODUCTION... 3 VIRTUALIZING APACHE HADOOP... 4 INTRODUCTION TO VSPHERE TM... 4 USE CASES AND ADVANTAGES OF VIRTUALIZING HADOOP... 4 MYTHS ABOUT RUNNING

More information

ORACLE COHERENCE 12CR2

ORACLE COHERENCE 12CR2 ORACLE COHERENCE 12CR2 KEY FEATURES AND BENEFITS ORACLE COHERENCE IS THE #1 IN-MEMORY DATA GRID. KEY FEATURES Fault-tolerant in-memory distributed data caching and processing Persistence for fast recovery

More information

SWOT Assessment: CoreMedia, CoreMedia Platform

SWOT Assessment: CoreMedia, CoreMedia Platform SWOT Assessment: CoreMedia, CoreMedia Platform Analyzing the strengths, weaknesses, opportunities, and threats Publication Date: 12 May 2016 Product code: IT0014-003122 Sue Clarke Summary Catalyst Organizations

More information

HDFS Users Guide. Table of contents

HDFS Users Guide. Table of contents Table of contents 1 Purpose...2 2 Overview...2 3 Prerequisites...3 4 Web Interface...3 5 Shell Commands... 3 5.1 DFSAdmin Command...4 6 Secondary NameNode...4 7 Checkpoint Node...5 8 Backup Node...6 9

More information

How To Get Value From Data In An Enterprise Business

How To Get Value From Data In An Enterprise Business Thriving in the Age of Big Data Analytics and Self-Service The new shape of BI Tom Pringle, Surya Mukherjee & Tony Baer Table of contents Executive Summary... 3 The new age of analytics and Oracle... 3

More information

Overview. Big Data in Apache Hadoop. - HDFS - MapReduce in Hadoop - YARN. https://hadoop.apache.org. Big Data Management and Analytics

Overview. Big Data in Apache Hadoop. - HDFS - MapReduce in Hadoop - YARN. https://hadoop.apache.org. Big Data Management and Analytics Overview Big Data in Apache Hadoop - HDFS - MapReduce in Hadoop - YARN https://hadoop.apache.org 138 Apache Hadoop - Historical Background - 2003: Google publishes its cluster architecture & DFS (GFS)

More information

How To Understand The Implications Of Outsourced Testing

How To Understand The Implications Of Outsourced Testing Ovum Decision Matrix: Selecting an Outsourced Testing Service Provider, 2014 2015 Author: Thomas Reuner Summary Catalyst The emergence of comprehensive outsourced testing of software applications, in which

More information

Apache Hadoop. Alexandru Costan

Apache Hadoop. Alexandru Costan 1 Apache Hadoop Alexandru Costan Big Data Landscape No one-size-fits-all solution: SQL, NoSQL, MapReduce, No standard, except Hadoop 2 Outline What is Hadoop? Who uses it? Architecture HDFS MapReduce Open

More information

Comparing the Hadoop Distributed File System (HDFS) with the Cassandra File System (CFS)

Comparing the Hadoop Distributed File System (HDFS) with the Cassandra File System (CFS) Comparing the Hadoop Distributed File System (HDFS) with the Cassandra File System (CFS) White Paper BY DATASTAX CORPORATION August 2013 1 Table of Contents Abstract 3 Introduction 3 Overview of HDFS 4

More information

WHITE PAPER. Hadoop and HDFS: Storage for Next Generation Data Management. Version: Q414-102

WHITE PAPER. Hadoop and HDFS: Storage for Next Generation Data Management. Version: Q414-102 Storage for Next Generation Data Management Version: Q414-102 Table of Content Storage for the Modern Enterprise 3 The Challenges of Big Data 5 Data at the Center of the Enterprise 6 The Internals of HDFS

More information

Data movement for globally deployed Big Data Hadoop architectures

Data movement for globally deployed Big Data Hadoop architectures Data movement for globally deployed Big Data Hadoop architectures Scott Rudenstein VP Technical Services November 2015 WANdisco Background WANdisco: Wide Area Network Distributed Computing " Enterprise

More information

Oracle Big Data Fundamentals Ed 1 NEW

Oracle Big Data Fundamentals Ed 1 NEW Oracle University Contact Us: +90 212 329 6779 Oracle Big Data Fundamentals Ed 1 NEW Duration: 5 Days What you will learn In the Oracle Big Data Fundamentals course, learn to use Oracle's Integrated Big

More information

Non-Stop Hadoop Paul Scott-Murphy VP Field Techincal Service, APJ. Cloudera World Japan November 2014

Non-Stop Hadoop Paul Scott-Murphy VP Field Techincal Service, APJ. Cloudera World Japan November 2014 Non-Stop Hadoop Paul Scott-Murphy VP Field Techincal Service, APJ Cloudera World Japan November 2014 WANdisco Background WANdisco: Wide Area Network Distributed Computing Enterprise ready, high availability

More information

Datameer Big Data Governance

Datameer Big Data Governance TECHNICAL BRIEF Datameer Big Data Governance Bringing open-architected and forward-compatible governance controls to Hadoop analytics As big data moves toward greater mainstream adoption, its compliance

More information

Self-service BI for big data applications using Apache Drill

Self-service BI for big data applications using Apache Drill Self-service BI for big data applications using Apache Drill 2015 MapR Technologies 2015 MapR Technologies 1 Management - MCS MapR Data Platform for Hadoop and NoSQL APACHE HADOOP AND OSS ECOSYSTEM Batch

More information

Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities

Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities Technology Insight Paper Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities By John Webster February 2015 Enabling you to make the best technology decisions Enabling

More information

Protecting Big Data Data Protection Solutions for the Business Data Lake

Protecting Big Data Data Protection Solutions for the Business Data Lake White Paper Protecting Big Data Data Protection Solutions for the Business Data Lake Abstract Big Data use cases are maturing and customers are using Big Data to improve top and bottom line revenues. With

More information

Online Transaction Processing in SQL Server 2008

Online Transaction Processing in SQL Server 2008 Online Transaction Processing in SQL Server 2008 White Paper Published: August 2007 Updated: July 2008 Summary: Microsoft SQL Server 2008 provides a database platform that is optimized for today s applications,

More information

owncloud Architecture Overview

owncloud Architecture Overview owncloud Architecture Overview Time to get control back Employees are using cloud-based services to share sensitive company data with vendors, customers, partners and each other. They are syncing data

More information

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data Introduction to Hadoop HDFS and Ecosystems ANSHUL MITTAL Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data Topics The goal of this presentation is to give

More information

CitusDB Architecture for Real-Time Big Data

CitusDB Architecture for Real-Time Big Data CitusDB Architecture for Real-Time Big Data CitusDB Highlights Empowers real-time Big Data using PostgreSQL Scales out PostgreSQL to support up to hundreds of terabytes of data Fast parallel processing

More information

Data Center Automation: Market Landscape and Maturity Model

Data Center Automation: Market Landscape and Maturity Model Data Center Automation: Market Landscape and Maturity Model Assessing the organizational readiness and market in data center automation Publication Date: 16 Dec 2015 Product code: IT0022-000569 Roy Illsley

More information

EMC ISILON OneFS OPERATING SYSTEM Powering scale-out storage for the new world of Big Data in the enterprise

EMC ISILON OneFS OPERATING SYSTEM Powering scale-out storage for the new world of Big Data in the enterprise EMC ISILON OneFS OPERATING SYSTEM Powering scale-out storage for the new world of Big Data in the enterprise ESSENTIALS Easy-to-use, single volume, single file system architecture Highly scalable with

More information

Securing Your Enterprise Hadoop Ecosystem Comprehensive Security for the Enterprise with Cloudera

Securing Your Enterprise Hadoop Ecosystem Comprehensive Security for the Enterprise with Cloudera Securing Your Enterprise Hadoop Ecosystem Comprehensive Security for the Enterprise with Cloudera Version: 103 Table of Contents Introduction 3 Importance of Security 3 Growing Pains 3 Security Requirements

More information

PEPPERDATA IN MULTI-TENANT ENVIRONMENTS

PEPPERDATA IN MULTI-TENANT ENVIRONMENTS ..................................... PEPPERDATA IN MULTI-TENANT ENVIRONMENTS technical whitepaper June 2015 SUMMARY OF WHAT S WRITTEN IN THIS DOCUMENT If you are short on time and don t want to read the

More information

Big Data With Hadoop

Big Data With Hadoop With Saurabh Singh singh.903@osu.edu The Ohio State University February 11, 2016 Overview 1 2 3 Requirements Ecosystem Resilient Distributed Datasets (RDDs) Example Code vs Mapreduce 4 5 Source: [Tutorials

More information

Solving performance and data protection problems with active-active Hadoop SOLUTIONS BRIEF

Solving performance and data protection problems with active-active Hadoop SOLUTIONS BRIEF Solving performance and data protection problems with active-active Hadoop SOLUTIONS BRIEF Solving performance and data protection problems with active-active Hadoop Many Hadoop deployments are not realizing

More information

What is a Petabyte? Gain Big or Lose Big; Measuring the Operational Risks of Big Data. Agenda

What is a Petabyte? Gain Big or Lose Big; Measuring the Operational Risks of Big Data. Agenda April - April - Gain Big or Lose Big; Measuring the Operational Risks of Big Data YouTube video here http://www.youtube.com/watch?v=o7uzbcwstu April, 0 Steve Woolley, Sr. Manager Business Continuity Dennis

More information

Self-service BI for big data applications using Apache Drill

Self-service BI for big data applications using Apache Drill Self-service BI for big data applications using Apache Drill 2015 MapR Technologies 2015 MapR Technologies 1 Data Is Doubling Every Two Years Unstructured data will account for more than 80% of the data

More information

How to Hadoop Without the Worry: Protecting Big Data at Scale

How to Hadoop Without the Worry: Protecting Big Data at Scale How to Hadoop Without the Worry: Protecting Big Data at Scale SESSION ID: CDS-W06 Davi Ottenheimer Senior Director of Trust EMC Corporation @daviottenheimer Big Data Trust. Redefined Transparency Relevance

More information