White Paper: Enhancing Functionality and Security of Enterprise Data Holdings

Save this PDF as:
 WORD  PNG  TXT  JPG

Size: px
Start display at page:

Download "White Paper: Enhancing Functionality and Security of Enterprise Data Holdings"

Transcription

1 White Paper: Enhancing Functionality and Security of Enterprise Data Holdings Examining New Mission- Enabling Design Patterns Made Possible by the Cloudera- Intel Partnership Inside: Improving Return on Investment Enhancing Support to Enterprise Missions Dramatically Improving Data Security 1

2 About This Paper The partnership between Cloudera and Intel Corporation announced in 2014 enables a closer coordination of engineering and development activities over the full stack of technologies supporting modern enterprise analytic solutions. This paper, produced by three chief technology officers with extensive experience in fielding data solutions into government agencies, reviews some key developments resulting from this new engineering work and provides design considerations for enterprises seeking to modernize in ways that economically enhance functionality and security. The Hadoop-Based Enterprise Data Hub The open source Apache Hadoop framework has become the foundation of mission-focused data modernization activities throughout industry and government. This emerging platform has widespread community adoption and a large development community focused on continual improvement, with Cloudera leading the community through investing in new open source capabilities designed to meet enterprise mission needs. Cloudera s leadership in this data management platform provides a full spectrum of capability to execute analytic workloads over data holdings and establishes the foundations for the next generation in data management, the enterprise data hub (EDH). The functionality, security, and economies of an EDH are making this the preferred approach to modern data challenges. Now new capabilities in the chip, storage, and network products made by Intel are making exciting contributions to this EDH approach, including faster data processing, complete data encryption, and compression of all data. Intel and Cloudera Working Together In 2014 Intel and Cloudera jointly announced a cooperation that saw Intel making investments in Cloudera and also an intent to work with Cloudera to integrate Intel contributions into core Apache Hadoop projects and Cloudera s CDH. Together, Intel and Cloudera continue to drive innovation through open- source technologies with a focus on security, performance, governance, and mission- focused applications. Cloudera is working closely with Intel to ensure open source solutions like Hadoop make the best use of Intel server technologies. By empowering architects with hardware and software designed to work together, missions can be economically served with more actionable. Cloudera and Enterprise Data Management Cloudera is known for leveraging the best of the open source community to meet the needs of high performance and always-available data solutions. CDH and Cloudera 2

3 Enterprise form a data management platform that can store, process, and analyze all enterprise data. This enables the establishment of an enterprise data hub (EDH): a comprehensive platform that stores and retains data, in any state, form, or volume. The EDH enhances the ability of enterprises to correlate and fuse information. It also integrates with existing infrastructure and tools, and empowers a wide variety of workloads. An EDH from Cloudera provides the enterprise-required features of enhanced security, data provenance, auditing, access control, and cluster performance optimization necessary to bring big data computing to the entire organization. The attractiveness of this solution comes from its ability to meet mission needs in economical and efficient ways and an open design that ensures future missions can be supported without forklift upgrades. Characteristics of the platform include: Unified Approach: This platform provides one integrated system, bringing diverse users and application types to one pool of data and a common infrastructure. Critical data and derivative insights can be shared easily and are no longer locked away in data silos of distinct systems; older data frameworks required different databases for different types of data work, while an EDH hosts multiple application types. Streamlined Design: The EDH approach means no data movement is required. Organizations can store any amount or type of data, in original form, for as long as desired. Enable easy replay or rework of analytical processes and accelerate modeling and transformation pipelines Enhanced Security and Governance: The platform is built to enhance perimeter security, authentication, authorization, data lineage and data protection. Security is integrated with existing enterprise solutions. This is a compliance-ready solution, supportive of audit requirements involving data (including FISMA). Managed Environment: High availability, fault tolerance, self-healing storage, and automated backup and disaster recovery are built into the platform. Open Design: Open source to ensure data and applications remain free from vendor lock-in, and open architecture to ensure connections with all existing technology capabilities and future innovation. 3

4 Intel and The World Of Data Processing Servers built with Intel CPU s, 10/40 GbE networking, and storage technologies provide operational efficiencies that can improve the performance of most any solution. Several new features however, are especially important for data processing and analytics solutions. Intel processors enable faster computation and the ability to process more data elements at once via ongoing advances in CPU microarchitecture and core count. Additionally, Intel also includes technologies to increase data throughput and bandwidth with reduced latency. Furthermore, to address security concerns and regulatory requirements for Hadoop environments, Intel also supports the acceleration of encryption/decryption by offloading advanced encryption from software to the processor silicon. Intel Xeon Processor E5 v3 Family Support to 10/40 GbE networking Intel technologies for solid state drive (SSD) storage and high-speed (10GbE and 40GbE) networking communications systems further optimize the overall throughput of the system. Taken together, these technologies optimize the total infrastructure performance from the CPU to the edge of the network, resulting in dramatic performance results in Hadoop environments. These improvements have been demonstrated in benchmark tests. For example, in tests documented at hadoop.intel.com a 97% reduction in processing time was shown in terabyte sort evaluations. Elastic scaling to adopt to fluctuating workload ensures high performance Performance and I/O enhancements High bandwidth/low latency of input and output Rapid encryption and decryption to encourage pervasive data protection Error- correcting code (ECC) memory Analytics focused instruction set (SSE4) A Full-Stack Solution Focused on Performance and Security The review above of the powerful capabilities of an EDH and the increasing performance and new capabilities of Intel s technology lets us address the next logical question: What is now possible when solutions are engineered to take advantage of both of these capabilities? The most important outcomes of the well-engineered hardware and software EDH are mission-enabling outcomes. We capture some of these outcomes below in the categories of Computing Convergence, Systems Rationalization, and Accelerated Processing. Computing Convergence Fielding your EDH on a well-engineered hardware/software stack allows more workloads to be executed over shared data. Computing convergence also means processing and analytics can both be optimized for all these workloads. This convergence has many mission-focused implications: 4

5 Decision-makers in any organization can be provided with true 360-degree views on topics of interest since all relevant data, operational and historical, can be rapidly found, correlated, and visualized. Law enforcement organizations can reduce time to discovery in criminal investigations and discover new information relevant to cases, since data is now more easily correlated and different types of workloads can be conducted over all the data. Intelligence organizations can more quickly correlate adversary activity and make more reasoned assessments of adversary intentions since multiple sources of intelligence can now be analyzed at the same time. Intelligence can be done on activities and actors in their entirety versus just isolated events. Financial analysts and acquisition professionals can run their analytical queries faster and execute rapid iterative model building and testing within the same system. Cybersecurity analysis, fraud detection, and other solutions that look for anomalies can process all relevant data at once resulting in higher accuracy and fewer false positives. The advantage of full historical data analysis can be achieved. Sampling is focused on statistical needs rather than as a result of system constraints; analyzing all relevant data at once is now possible. Systems Rationalization Since an enterprise data hub approach means separate systems can be reduced (with multi-workloads possible over single repositories of all data), management time and financial resources can be optimized. This systems rationalization has many missionfocused impacts: Mission agility is improved since it is easier to find the right data (and the data is instantly available since it does not need to be retrieved/loaded/indexed). Data of all types (including imagery, video, text) can be used all at once. Agency agility is enhanced due to new ability of developers to field new solutions faster (since new solutions are built over a known pool of all the data and developers can use any number of computing tools and paradigms to find the best mix and fit for any conceivable workload). Any agency that has multiple data storage systems (i.e. all agencies) can significantly reduce the cost and management overhead of multiple separate systems. Accelerated Processing The combined benefits from optimized CPU performance, improved storage throughput, and reduced network infrastructure latency of the Intel Architecture result in performance benefits that allow users to transform the ways in which they operate. Some mission impacts: 5

6 Organizations can make more use of incoming data faster since it can be processed as it arrives and is stored. Incoming sensor data, for example, can be correlated and processed in real time to support ongoing operations. Higher performance provides improved time-to-action, enhancing support to agency missions when users get results at speeds that appear to be instantaneous, they can now ask iterative queries of very responsive systems. Faster processing means more data can be stored in secure and accessible ways utilizing encryption and compression. Among the many benefits of this approach is the ability to always have access to the data needed without having to retrieve data from archives or other data stores. Additional Considerations on Cost The fact that the storage and computing capabilities include increased security and performance as well as significant cost savings make fielding solutions based on this combined architecture particularly virtuous. Calculation of agency return on investment and savings will vary depending on your design and mission needs, but here are some factors that will play into your calculation: Significant storage savings are common, especially when compared legacy designs. Use of proven reference architectures result in faster fielding of a working solution, translating to lower costs to field. The reduction of cost to store and access data as a feature of an enterprise data hub means more data is available there is no need to leave any data behind. The time to retrieve (via backup storage including tape) is eliminated. Reduced time required to analyze data translates to reduced costs for manpower and, by extension, training, thereby allowing existing manpower to focus on interpretation and mission-specific outcomes. Reduction of silos of data results in savings over maintaining multiple systems, which translates to lower costs for storage, processors, and time to move and analyze data separately. 6

7 Strategic Considerations For Data Modernization Projects The capabilities of Cloudera s enterprise data hub and the well-engineered enhancements of Intel s server, networking, and storage infrastructure technologies lead to several design options that planners should consider early. Our recommendations for planners include: Understand and focus on current use cases: Our review of workloads above should help planners identify and clarify the most important/prioritized use cases as design goals for your project. Determining the prioritized data flows for the first use cases will help ensure success on a project is demonstrated early. Ensure the design focuses on output: Identify the analytical queries and algorithms required to generate desired outputs. This will enable the capturing of the advanced analytics requirements and interactive query needs the system must meet and ultimately dictate the rollout of a converged computing strategy. Assess your business rules for operating over the data and interacting with the solution. Agencies now have the capability to encrypt 100% of data and to assign and control access and audit it in new ways. And the applications and solutions that run over this data can also have access controlled by this end-toend solution. Plan for future expansion of use cases: First successes will be measured based on how well they meet current agency needs. But the power of a wellengineered enterprise data hub is that it can support many new use cases and future workloads. The key action in planning for expansion is to listen to the challenges faced by mission owners, and be prepared to iteratively incorporate into the solution new workloads and new data flows provided by them. Consider the full design: Consider compute, networking, data storage, and the software framework together as the data platform. Ensure to ask for design help. Repeatable patterns from other enterprises are available for reference. Engineers from Cloudera, Intel, and their partners can help refine and turn functional reference architecture into a technical design that will rapidly bring new functionality to the agency mission. Concluding Thoughts By utilizing the embedded capabilities of the Intel hardware ecosystem harnessed to the Cloudera enterprise data hub, agencies can realize efficiencies in performance, throughput, and, most importantly, security, in all areas of the architecture. This approach enables a high performance data hub with advanced means of serving business and mission objectives. 7

8 More Reading For more federal technology and policy issues visit: CTOvision.com- A blog for enterprise technologists with a special focus on Big Data. CTOlabs.com - A reference for research and reporting on all IT issues. J.mp/ctonews - Sign up for the government technology newsletters including the Government Big Data Weekly. About the Authors Bob Flores is a co-founder and partner at Cognitio. Bob spent 31 years at the Central Intelligence Agency. While at CIA, Bob held various positions in the Directorate of Intelligence, Directorate of Support, and the National Clandestine Service. He was the agency s Chief Technology Officer. Bob serves on numerous government and industry advisory boards Bob Gourley is a co-founder of Cognitio and editor and chief of CTOvision.com He is a former federal CTO. His career included service in operational intelligence centers around the globe where his focus was operational all source intelligence analysis. He was the first director of intelligence at DoD s Joint Task Force for Computer Network Defense, served as director of technology for a division of Northrop Grumman and spent three years as the CTO of the Defense Intelligence Agency. Bob serves on numerous government and industry advisory boards. Roger Hockenberry is a co-founder and partner and CEO at Cognitio. Following a two-decade career in industry first as a technology consultant and later as a management consultant and Managing Partner at Gartner Roger spent four-years in government service in the intelligence community where he was charged with driving the realization of the vision he had helped craft as a consultant. For More Information If you have questions or would like to discuss this report, please contact me. As an advocate for better IT use in enterprises I am committed to keeping this dialogue up open on technologies, processes and best practices that will keep us all continually improving our capabilities and ability to support organizational missions. Contact: Bob Gourley CTOlabs.com 8

White Paper: SAS and Apache Hadoop For Government. Inside: Unlocking Higher Value From Business Analytics to Further the Mission

White Paper: SAS and Apache Hadoop For Government. Inside: Unlocking Higher Value From Business Analytics to Further the Mission White Paper: SAS and Apache Hadoop For Government Unlocking Higher Value From Business Analytics to Further the Mission Inside: Using SAS and Hadoop Together Design Considerations for Your SAS and Hadoop

More information

Three Open Blueprints For Big Data Success

Three Open Blueprints For Big Data Success White Paper: Three Open Blueprints For Big Data Success Featuring Pentaho s Open Data Integration Platform Inside: Leverage open framework and open source Kickstart your efforts with repeatable blueprints

More information

The Enterprise Data Hub and The Modern Information Architecture

The Enterprise Data Hub and The Modern Information Architecture The Enterprise Data Hub and The Modern Information Architecture Dr. Amr Awadallah CTO & Co-Founder, Cloudera Twitter: @awadallah 1 2013 Cloudera, Inc. All rights reserved. Cloudera Overview The Leader

More information

White Paper: Evaluating Big Data Analytical Capabilities For Government Use

White Paper: Evaluating Big Data Analytical Capabilities For Government Use CTOlabs.com White Paper: Evaluating Big Data Analytical Capabilities For Government Use March 2012 A White Paper providing context and guidance you can use Inside: The Big Data Tool Landscape Big Data

More information

White Paper: Datameer s User-Focused Big Data Solutions

White Paper: Datameer s User-Focused Big Data Solutions CTOlabs.com White Paper: Datameer s User-Focused Big Data Solutions May 2012 A White Paper providing context and guidance you can use Inside: Overview of the Big Data Framework Datameer s Approach Consideration

More information

Dell* In-Memory Appliance for Cloudera* Enterprise

Dell* In-Memory Appliance for Cloudera* Enterprise Built with Intel Dell* In-Memory Appliance for Cloudera* Enterprise Find out what faster big data analytics can do for your business The need for speed in all things related to big data is an enormous

More information

White Paper: Leveraging Web Intelligence to Enhance Cyber Security

White Paper: Leveraging Web Intelligence to Enhance Cyber Security White Paper: Leveraging Web Intelligence to Enhance Cyber Security October 2013 Inside: New context on Web Intelligence The need for external data in enterprise context Making better use of web intelligence

More information

More Data in Less Time

More Data in Less Time More Data in Less Time Leveraging Cloudera CDH as an Operational Data Store Daniel Tydecks, Systems Engineering DACH & CE Goals of an Operational Data Store Load Data Sources Traditional Architecture Operational

More information

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Datenverwaltung im Wandel - Building an Enterprise Data Hub with Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees

More information

Accelerating Enterprise Big Data Success. Tim Stevens, VP of Business and Corporate Development Cloudera

Accelerating Enterprise Big Data Success. Tim Stevens, VP of Business and Corporate Development Cloudera Accelerating Enterprise Big Data Success Tim Stevens, VP of Business and Corporate Development Cloudera 1 Big Opportunity: Extract value from data Revenue Growth x = 50 Billion 35 ZB Cost Savings Margin

More information

The Future of Data Management

The Future of Data Management The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class

More information

BIG DATA THE NEW OPPORTUNITY

BIG DATA THE NEW OPPORTUNITY Feature Biswajit Mohapatra is an IBM Certified Consultant and a global integrated delivery leader for IBM s AMS business application modernization (BAM) practice. He is IBM India s competency head for

More information

EMC PERSPECTIVE. The Private Cloud for Healthcare Enables Coordinated Patient Care

EMC PERSPECTIVE. The Private Cloud for Healthcare Enables Coordinated Patient Care EMC PERSPECTIVE The Private Cloud for Healthcare Enables Coordinated Patient Care Table of Contents A paradigm shift for Healthcare IT...................................................... 3 Cloud computing

More information

HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW

HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW 757 Maleta Lane, Suite 201 Castle Rock, CO 80108 Brett Weninger, Managing Director brett.weninger@adurant.com Dave Smelker, Managing Principal dave.smelker@adurant.com

More information

EMC s Enterprise Hadoop Solution. By Julie Lockner, Senior Analyst, and Terri McClure, Senior Analyst

EMC s Enterprise Hadoop Solution. By Julie Lockner, Senior Analyst, and Terri McClure, Senior Analyst White Paper EMC s Enterprise Hadoop Solution Isilon Scale-out NAS and Greenplum HD By Julie Lockner, Senior Analyst, and Terri McClure, Senior Analyst February 2012 This ESG White Paper was commissioned

More information

Cloudera in the Public Cloud

Cloudera in the Public Cloud Cloudera in the Public Cloud Deployment Options for the Enterprise Data Hub Version: Q414-102 Table of Contents Executive Summary 3 The Case for Public Cloud 5 Public Cloud vs On-Premise 6 Public Cloud

More information

Deploying an Operational Data Store Designed for Big Data

Deploying an Operational Data Store Designed for Big Data Deploying an Operational Data Store Designed for Big Data A fast, secure, and scalable data staging environment with no data volume or variety constraints Sponsored by: Version: 102 Table of Contents Introduction

More information

Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack

Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack HIGHLIGHTS Real-Time Results Elasticsearch on Cisco UCS enables a deeper

More information

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time SCALEOUT SOFTWARE How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time by Dr. William Bain and Dr. Mikhail Sobolev, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T wenty-first

More information

The Road to Convergence

The Road to Convergence A UBM TECHWEB WHITE PAPER SEPTEMBER 2012 The Road to Convergence Six keys to getting there with the most confidence and the least risk. Brought to you by The Road to Convergence Six keys to getting there

More information

SPEED Over of financial market survey participants will increase infrastructure spend from 1% to 6%.* RELIABILITY

SPEED Over of financial market survey participants will increase infrastructure spend from 1% to 6%.* RELIABILITY CDW FINANCIAL SERVICES WE GET IT Scale Up for Speed, Efficiency and Reliability INCREASE EFFICIENCY FROM CDW FINANCIAL SERVICES SPEED Over of financial market survey participants will increase infrastructure

More information

White Paper: What You Need To Know About Hadoop

White Paper: What You Need To Know About Hadoop CTOlabs.com White Paper: What You Need To Know About Hadoop June 2011 A White Paper providing succinct information for the enterprise technologist. Inside: What is Hadoop, really? Issues the Hadoop stack

More information

Interactive data analytics drive insights

Interactive data analytics drive insights Big data Interactive data analytics drive insights Daniel Davis/Invodo/S&P. Screen images courtesy of Landmark Software and Services By Armando Acosta and Joey Jablonski The Apache Hadoop Big data has

More information

Big Data Challenges and Opportunities

Big Data Challenges and Opportunities Big Data Challenges and Opportunities Ira A. (Gus) Hunt Chief Technology Officer Our Mission We are the nation's first line of defense. We accomplish what others cannot accomplish and go where others cannot

More information

HadoopTM Analytics DDN

HadoopTM Analytics DDN DDN Solution Brief Accelerate> HadoopTM Analytics with the SFA Big Data Platform Organizations that need to extract value from all data can leverage the award winning SFA platform to really accelerate

More information

Addressing Risk Data Aggregation and Risk Reporting Ben Sharma, CEO. Big Data Everywhere Conference, NYC November 2015

Addressing Risk Data Aggregation and Risk Reporting Ben Sharma, CEO. Big Data Everywhere Conference, NYC November 2015 Addressing Risk Data Aggregation and Risk Reporting Ben Sharma, CEO Big Data Everywhere Conference, NYC November 2015 Agenda 1. Challenges with Risk Data Aggregation and Risk Reporting (RDARR) 2. How a

More information

INDUSTRY BRIEF DATA CONSOLIDATION AND MULTI-TENANCY IN FINANCIAL SERVICES

INDUSTRY BRIEF DATA CONSOLIDATION AND MULTI-TENANCY IN FINANCIAL SERVICES INDUSTRY BRIEF DATA CONSOLIDATION AND MULTI-TENANCY IN FINANCIAL SERVICES Data Consolidation and Multi-Tenancy in Financial Services CLOUDERA INDUSTRY BRIEF 2 Table of Contents Introduction 3 Security

More information

Driving Growth in Insurance With a Big Data Architecture

Driving Growth in Insurance With a Big Data Architecture Driving Growth in Insurance With a Big Data Architecture The SAS and Cloudera Advantage Version: 103 Table of Contents Overview 3 Current Data Challenges for Insurers 3 Unlocking the Power of Big Data

More information

The Open Cloud Near-Term Infrastructure Trends in Cloud Computing

The Open Cloud Near-Term Infrastructure Trends in Cloud Computing The Open Cloud Near-Term Infrastructure Trends in Cloud Computing Markus Leberecht BELNET Networking Conference 25-Oct-2012 1 Growth & IT Challenges Drive Need for Cloud Computing IT Pros Growth IT Challenges

More information

Microsoft Analytics Platform System. Solution Brief

Microsoft Analytics Platform System. Solution Brief Microsoft Analytics Platform System Solution Brief Contents 4 Introduction 4 Microsoft Analytics Platform System 5 Enterprise-ready Big Data 7 Next-generation performance at scale 10 Engineered for optimal

More information

Enterprise Workloads on the IBM X6 Portfolio: Driving Business Advantages

Enterprise Workloads on the IBM X6 Portfolio: Driving Business Advantages WHITE PAPER Enterprise Workloads on the IBM X6 Portfolio: Driving Business Advantages Sponsored by: IBM Jed Scaramella January 2014 EXECUTIVE SUMMARY Enterprise information technology (IT) leaders are

More information

Optimizing Storage for Better TCO in Oracle Environments. Part 1: Management INFOSTOR. Executive Brief

Optimizing Storage for Better TCO in Oracle Environments. Part 1: Management INFOSTOR. Executive Brief Optimizing Storage for Better TCO in Oracle Environments INFOSTOR Executive Brief a QuinStreet Excutive Brief. 2012 To the casual observer, and even to business decision makers who don t work in information

More information

Luncheon Webinar Series May 13, 2013

Luncheon Webinar Series May 13, 2013 Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration

More information

The evolution of data archiving

The evolution of data archiving The evolution of data archiving 1 1 How archiving needs to change for the modern enterprise Today s enterprises are buried by data, and this problem is being exacerbated by the unfettered growth of unstructured

More information

Breaking News! Big Data is Solved. What Is In-Memory Computing and What Does It Mean to U.S. Leaders? EXECUTIVE WHITE PAPER

Breaking News! Big Data is Solved. What Is In-Memory Computing and What Does It Mean to U.S. Leaders? EXECUTIVE WHITE PAPER Breaking News! Big Data is Solved. What Is In-Memory Computing and What Does It Mean to U.S. Leaders? EXECUTIVE WHITE PAPER There is a revolution happening in information technology, and it s not just

More information

Virtualization s Evolution

Virtualization s Evolution Virtualization s Evolution Expect more from your IT solutions. Virtualization s Evolution In 2009, most Quebec businesses no longer question the relevancy of virtualizing their infrastructure. Rather,

More information

Your Data, Any Place, Any Time. Microsoft SQL Server 2008 provides a trusted, productive, and intelligent data platform that enables you to:

Your Data, Any Place, Any Time. Microsoft SQL Server 2008 provides a trusted, productive, and intelligent data platform that enables you to: Your Data, Any Place, Any Time. Microsoft SQL Server 2008 provides a trusted, productive, and intelligent data platform that enables you to: Run your most demanding mission-critical applications. Reduce

More information

What Is In-Memory Computing and What Does It Mean to U.S. Leaders? EXECUTIVE WHITE PAPER

What Is In-Memory Computing and What Does It Mean to U.S. Leaders? EXECUTIVE WHITE PAPER What Is In-Memory Computing and What Does It Mean to U.S. Leaders? EXECUTIVE WHITE PAPER A NEW PARADIGM IN INFORMATION TECHNOLOGY There is a revolution happening in information technology, and it s not

More information

The Future of Data Management with Hadoop and the Enterprise Data Hub

The Future of Data Management with Hadoop and the Enterprise Data Hub The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah Cofounder & CTO, Cloudera, Inc. Twitter: @awadallah 1 2 Cloudera Snapshot Founded 2008, by former employees of Employees

More information

Cisco Unified Data Center: The Foundation for Private Cloud Infrastructure

Cisco Unified Data Center: The Foundation for Private Cloud Infrastructure White Paper Cisco Unified Data Center: The Foundation for Private Cloud Infrastructure Providing Agile and Efficient Service Delivery for Sustainable Business Advantage What You Will Learn Enterprises

More information

Cloud Computing and Cyber Defense

Cloud Computing and Cyber Defense Cloud Computing and Cyber Defense A White Paper provided to the National Security Council and Homeland Security Council as input to the White House Review of Communications and Information Infrastructure.

More information

Your Data, Any Place, Any Time.

Your Data, Any Place, Any Time. Your Data, Any Place, Any Time. Microsoft SQL Server 2008 provides a trusted, productive, and intelligent data platform that enables you to: Run your most demanding mission-critical applications. Reduce

More information

Empowering Analysts With Big Data

Empowering Analysts With Big Data White Paper: Empowering Analysts With Big Data Inside: Balancing your approach to Big Data Criteria for evaluating your enterprise approach Tips for getting started 1 Four Years of Research Into Big Data

More information

Windows Embedded Security and Surveillance Solutions

Windows Embedded Security and Surveillance Solutions Windows Embedded Security and Surveillance Solutions Windows Embedded 2010 Page 1 Copyright The information contained in this document represents the current view of Microsoft Corporation on the issues

More information

MarkLogic Enterprise Data Layer

MarkLogic Enterprise Data Layer MarkLogic Enterprise Data Layer MarkLogic Enterprise Data Layer MarkLogic Enterprise Data Layer September 2011 September 2011 September 2011 Table of Contents Executive Summary... 3 An Enterprise Data

More information

Why DBMSs Matter More than Ever in the Big Data Era

Why DBMSs Matter More than Ever in the Big Data Era E-PAPER FEBRUARY 2014 Why DBMSs Matter More than Ever in the Big Data Era Having the right database infrastructure can make or break big data analytics projects. TW_1401138 Big data has become big news

More information

Infrastructure Matters: POWER8 vs. Xeon x86

Infrastructure Matters: POWER8 vs. Xeon x86 Advisory Infrastructure Matters: POWER8 vs. Xeon x86 Executive Summary This report compares IBM s new POWER8-based scale-out Power System to Intel E5 v2 x86- based scale-out systems. A follow-on report

More information

INCREASING EFFICIENCY WITH EASY AND COMPREHENSIVE STORAGE MANAGEMENT

INCREASING EFFICIENCY WITH EASY AND COMPREHENSIVE STORAGE MANAGEMENT INCREASING EFFICIENCY WITH EASY AND COMPREHENSIVE STORAGE MANAGEMENT UNPRECEDENTED OBSERVABILITY, COST-SAVING PERFORMANCE ACCELERATION, AND SUPERIOR DATA PROTECTION KEY FEATURES Unprecedented observability

More information

SQLstream Blaze and Apache Storm A BENCHMARK COMPARISON

SQLstream Blaze and Apache Storm A BENCHMARK COMPARISON SQLstream Blaze and Apache Storm A BENCHMARK COMPARISON 2 The V of Big Data Velocity means both how fast data is being produced and how fast the data must be processed to meet demand. Gartner The emergence

More information

Business-centric Storage FUJITSU Hyperscale Storage System ETERNUS CD10000

Business-centric Storage FUJITSU Hyperscale Storage System ETERNUS CD10000 Business-centric Storage FUJITSU Hyperscale Storage System ETERNUS CD10000 Clear the way for new business opportunities. Unlock the power of data. Overcoming storage limitations Unpredictable data growth

More information

WHITE PAPER. Hadoop and HDFS: Storage for Next Generation Data Management. Version: Q414-102

WHITE PAPER. Hadoop and HDFS: Storage for Next Generation Data Management. Version: Q414-102 Storage for Next Generation Data Management Version: Q414-102 Table of Content Storage for the Modern Enterprise 3 The Challenges of Big Data 5 Data at the Center of the Enterprise 6 The Internals of HDFS

More information

Maximizing Hadoop Performance and Storage Capacity with AltraHD TM

Maximizing Hadoop Performance and Storage Capacity with AltraHD TM Maximizing Hadoop Performance and Storage Capacity with AltraHD TM Executive Summary The explosion of internet data, driven in large part by the growth of more and more powerful mobile devices, has created

More information

SCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS

SCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS Sean Lee Solution Architect, SDI, IBM Systems SCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS Agenda Converging Technology Forces New Generation Applications Data Management Challenges

More information

In-Database Analytics

In-Database Analytics Embedding Analytics in Decision Management Systems In-database analytics offer a powerful tool for embedding advanced analytics in a critical component of IT infrastructure. James Taylor CEO CONTENTS Introducing

More information

IBM Netezza High Capacity Appliance

IBM Netezza High Capacity Appliance IBM Netezza High Capacity Appliance Petascale Data Archival, Analysis and Disaster Recovery Solutions IBM Netezza High Capacity Appliance Highlights: Allows querying and analysis of deep archival data

More information

Overview: X5 Generation Database Machines

Overview: X5 Generation Database Machines Overview: X5 Generation Database Machines Spend Less by Doing More Spend Less by Paying Less Rob Kolb Exadata X5-2 Exadata X4-8 SuperCluster T5-8 SuperCluster M6-32 Big Memory Machine Oracle Exadata Database

More information

Big Data and Natural Language: Extracting Insight From Text

Big Data and Natural Language: Extracting Insight From Text An Oracle White Paper October 2012 Big Data and Natural Language: Extracting Insight From Text Table of Contents Executive Overview... 3 Introduction... 3 Oracle Big Data Appliance... 4 Synthesys... 5

More information

Networking Modernize. Open Your Network to Innovation

Networking Modernize. Open Your Network to Innovation Networking Modernize. Open Your Network to Innovation In a world where real-time information is critical, there s just no room for unnecessary complexity. Did you know? Dell Networking Active Fabric solutions

More information

Exploiting Tier 0 Storage and Virtualization to Maximize Storage Performance

Exploiting Tier 0 Storage and Virtualization to Maximize Storage Performance White Paper Exploiting Tier 0 Storage and Virtualization to Maximize Storage Performance Solutions address the high expense of Tier 0 Storage and how to enhance application performance with a three-step

More information

TOP 5 REASONS WHY FINANCIAL SERVICES FIRMS SHOULD CONSIDER SDN NOW

TOP 5 REASONS WHY FINANCIAL SERVICES FIRMS SHOULD CONSIDER SDN NOW TOP 5 REASONS WHY FINANCIAL SERVICES FIRMS SHOULD CONSIDER SDN NOW Abstract Software-defined networking, or SDN, is a relatively new technology that is already having a major impact on companies in the

More information

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved. Mike Maxey Senior Director Product Marketing Greenplum A Division of EMC 1 Greenplum Becomes the Foundation of EMC s Big Data Analytics (July 2010) E M C A C Q U I R E S G R E E N P L U M For three years,

More information

Data Warehousing in the Cloud

Data Warehousing in the Cloud TDWI RESEARCH TDWI CHECKLIST REPORT Data Warehousing in the Cloud By David Loshin Sponsored by: tdwi.org JULY 2015 TDWI CHECKLIST REPORT Data Warehousing in the Cloud By David Loshin TABLE OF CONTENTS

More information

TBR. IBM x86 Servers in the Cloud: Serving the Cloud. February 2012

TBR. IBM x86 Servers in the Cloud: Serving the Cloud. February 2012 IBM x86 Servers in the Cloud: Serving the Cloud February 2012 TBR T ECH N O LO G Y B U SI N ES S RES EAR CH, I N C. 1 IBM System x Cloud White Paper February 2012 2012 Technology Business Research Inc.

More information

Simplified Management With Hitachi Command Suite. By Hitachi Data Systems

Simplified Management With Hitachi Command Suite. By Hitachi Data Systems Simplified Management With Hitachi Command Suite By Hitachi Data Systems April 2015 Contents Executive Summary... 2 Introduction... 3 Hitachi Command Suite v8: Key Highlights... 4 Global Storage Virtualization

More information

Understanding the Value of In-Memory in the IT Landscape

Understanding the Value of In-Memory in the IT Landscape February 2012 Understing the Value of In-Memory in Sponsored by QlikView Contents The Many Faces of In-Memory 1 The Meaning of In-Memory 2 The Data Analysis Value Chain Your Goals 3 Mapping Vendors to

More information

Cray: Enabling Real-Time Discovery in Big Data

Cray: Enabling Real-Time Discovery in Big Data Cray: Enabling Real-Time Discovery in Big Data Discovery is the process of gaining valuable insights into the world around us by recognizing previously unknown relationships between occurrences, objects

More information

CDH AND BUSINESS CONTINUITY:

CDH AND BUSINESS CONTINUITY: WHITE PAPER CDH AND BUSINESS CONTINUITY: An overview of the availability, data protection and disaster recovery features in Hadoop Abstract Using the sophisticated built-in capabilities of CDH for tunable

More information

Analytics With Hadoop. SAS and Cloudera Starter Services: Visual Analytics and Visual Statistics

Analytics With Hadoop. SAS and Cloudera Starter Services: Visual Analytics and Visual Statistics Analytics With Hadoop SAS and Cloudera Starter Services: Visual Analytics and Visual Statistics Everything You Need to Get Started on Your First Hadoop Project SAS and Cloudera have identified the essential

More information

SCI Briefing: A Review of the New Hitachi Unified Storage and Hitachi NAS Platform 4000 Series. Silverton Consulting, Inc.

SCI Briefing: A Review of the New Hitachi Unified Storage and Hitachi NAS Platform 4000 Series. Silverton Consulting, Inc. SCI Briefing: A Review of the New Hitachi Unified Storage and Hitachi NAS Platform 4000 Series Silverton Consulting, Inc. StorInt Briefing Written by: Ray Lucchesi, President and Founder Published: July,

More information

WHITE PAPER LOWER COSTS, INCREASE PRODUCTIVITY, AND ACCELERATE VALUE, WITH ENTERPRISE- READY HADOOP

WHITE PAPER LOWER COSTS, INCREASE PRODUCTIVITY, AND ACCELERATE VALUE, WITH ENTERPRISE- READY HADOOP WHITE PAPER LOWER COSTS, INCREASE PRODUCTIVITY, AND ACCELERATE VALUE, WITH ENTERPRISE- READY HADOOP CLOUDERA WHITE PAPER 2 Table of Contents Introduction 3 Hadoop's Role in the Big Data Challenge 3 Cloudera:

More information

Cisco Data Preparation

Cisco Data Preparation Data Sheet Cisco Data Preparation Unleash your business analysts to develop the insights that drive better business outcomes, sooner, from all your data. As self-service business intelligence (BI) and

More information

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Important Notice 2010-2015 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, Impala, and

More information

The Virtualization Practice

The Virtualization Practice The Virtualization Practice White Paper: Managing Applications in Docker Containers Bernd Harzog Analyst Virtualization and Cloud Performance Management October 2014 Abstract Docker has captured the attention

More information

Maximum performance, minimal risk for data warehousing

Maximum performance, minimal risk for data warehousing SYSTEM X SERVERS SOLUTION BRIEF Maximum performance, minimal risk for data warehousing Microsoft Data Warehouse Fast Track for SQL Server 2014 on System x3850 X6 (95TB) The rapid growth of technology has

More information

Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010

Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010 Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010 Better Together Writer: Bill Baer, Technical Product Manager, SharePoint Product Group Technical Reviewers: Steve Peschka,

More information

Using In-Memory Computing to Simplify Big Data Analytics

Using In-Memory Computing to Simplify Big Data Analytics SCALEOUT SOFTWARE Using In-Memory Computing to Simplify Big Data Analytics by Dr. William Bain, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T he big data revolution is upon us, fed

More information

GRIDS IN DATA WAREHOUSING

GRIDS IN DATA WAREHOUSING GRIDS IN DATA WAREHOUSING By Madhu Zode Oct 2008 Page 1 of 6 ABSTRACT The main characteristic of any data warehouse is its ability to hold huge volume of data while still offering the good query performance.

More information

Oracle Big Data Building A Big Data Management System

Oracle Big Data Building A Big Data Management System Oracle Big Building A Big Management System Copyright 2015, Oracle and/or its affiliates. All rights reserved. Effi Psychogiou ECEMEA Big Product Director May, 2015 Safe Harbor Statement The following

More information

Microsoft Private Cloud Fast Track

Microsoft Private Cloud Fast Track Microsoft Private Cloud Fast Track Microsoft Private Cloud Fast Track is a reference architecture designed to help build private clouds by combining Microsoft software with Nutanix technology to decrease

More information

Driving IBM BigInsights Performance Over GPFS Using InfiniBand+RDMA

Driving IBM BigInsights Performance Over GPFS Using InfiniBand+RDMA WHITE PAPER April 2014 Driving IBM BigInsights Performance Over GPFS Using InfiniBand+RDMA Executive Summary...1 Background...2 File Systems Architecture...2 Network Architecture...3 IBM BigInsights...5

More information

Business Usage Monitoring for Teradata

Business Usage Monitoring for Teradata Managing Big Analytic Data Business Usage Monitoring for Teradata Increasing Operational Efficiency and Reducing Data Management Costs How to Increase Operational Efficiency and Reduce Data Management

More information

Why Oracle Database Runs Best on Oracle Servers and Storage. Optimize the Performance of the World s #1 Enterprise Database.

Why Oracle Database Runs Best on Oracle Servers and Storage. Optimize the Performance of the World s #1 Enterprise Database. Why Oracle Database Runs Best on Oracle Servers and Storage Optimize the Performance of the World s #1 Enterprise Database. 2 Contents 4 Engineered to Work Together 6 Oracle Optimized Solutions 10 Lower

More information

INTRODUCTION ADVANTAGES OF RUNNING ORACLE 11G ON WINDOWS. Edward Whalen, Performance Tuning Corporation

INTRODUCTION ADVANTAGES OF RUNNING ORACLE 11G ON WINDOWS. Edward Whalen, Performance Tuning Corporation ADVANTAGES OF RUNNING ORACLE11G ON MICROSOFT WINDOWS SERVER X64 Edward Whalen, Performance Tuning Corporation INTRODUCTION Microsoft Windows has long been an ideal platform for the Oracle database server.

More information

WRITTEN TESTIMONY OF NICKLOUS COMBS CHIEF TECHNOLOGY OFFICER, EMC FEDERAL ON CLOUD COMPUTING: BENEFITS AND RISKS MOVING FEDERAL IT INTO THE CLOUD

WRITTEN TESTIMONY OF NICKLOUS COMBS CHIEF TECHNOLOGY OFFICER, EMC FEDERAL ON CLOUD COMPUTING: BENEFITS AND RISKS MOVING FEDERAL IT INTO THE CLOUD WRITTEN TESTIMONY OF NICKLOUS COMBS CHIEF TECHNOLOGY OFFICER, EMC FEDERAL ON CLOUD COMPUTING: BENEFITS AND RISKS MOVING FEDERAL IT INTO THE CLOUD BEFORE THE COMMITTEE ON OVERSIGHT AND GOVERNMENT REFORM

More information

Integrated email archiving: streamlining compliance and discovery through content and business process management

Integrated email archiving: streamlining compliance and discovery through content and business process management Make better decisions, faster March 2008 Integrated email archiving: streamlining compliance and discovery through content and business process management 2 Table of Contents Executive summary.........

More information

A Look at the New Converged Data Center

A Look at the New Converged Data Center Organizations around the world are choosing to move from traditional physical data centers to virtual infrastructure, affecting every layer in the data center stack. This change will not only yield a scalable

More information

Demystifying Deduplication for Backup with the Dell DR4000

Demystifying Deduplication for Backup with the Dell DR4000 Demystifying Deduplication for Backup with the Dell DR4000 This Dell Technical White Paper explains how deduplication with the DR4000 can help your organization save time, space, and money. John Bassett

More information

Survey Report: Federal Government Data Deduplication Strategies Worth Duplicating

Survey Report: Federal Government Data Deduplication Strategies Worth Duplicating Survey Report: Federal Government Data Deduplication Strategies Worth Duplicating April 2011 Survey of U.S. government enterprise technologists, architects and systems integrators identifies current strategies,

More information

Real-Time Big Data Analytics SAP HANA with the Intel Distribution for Apache Hadoop software

Real-Time Big Data Analytics SAP HANA with the Intel Distribution for Apache Hadoop software Real-Time Big Data Analytics with the Intel Distribution for Apache Hadoop software Executive Summary is already helping businesses extract value out of Big Data by enabling real-time analysis of diverse

More information

Data Virtualization Overview

Data Virtualization Overview Data Virtualization Overview Take Big Advantage of Your Data "Using a data virtualization technique is: number one, much quicker time to market; number two, much more cost effective; and three, gives us

More information

Business opportunities from IOT and Big Data. Joachim Aertebjerg Director Enterprise Solution Sales Intel EMEA

Business opportunities from IOT and Big Data. Joachim Aertebjerg Director Enterprise Solution Sales Intel EMEA Business opportunities from IOT and Big Data Joachim Aertebjerg Director Enterprise Solution Sales Intel EMEA HOW INTEL IS TRANSFORMING COMPUTING? Smarter Devices Applications of Big Data Compute for Internet

More information

Introducing the Singlechip Cloud Computer

Introducing the Singlechip Cloud Computer Introducing the Singlechip Cloud Computer Exploring the Future of Many-core Processors White Paper Intel Labs Jim Held Intel Fellow, Intel Labs Director, Tera-scale Computing Research Sean Koehl Technology

More information

OmniCube. SimpliVity OmniCube and Multi Federation ROBO Reference Architecture. White Paper. Authors: Bob Gropman

OmniCube. SimpliVity OmniCube and Multi Federation ROBO Reference Architecture. White Paper. Authors: Bob Gropman OmniCube SimpliVity OmniCube and Multi Federation ROBO Reference Architecture White Paper Authors: Bob Gropman Date: April 13, 2015 SimpliVity and OmniCube are trademarks of SimpliVity Corporation. All

More information

can you effectively plan for the migration and management of systems and applications on Vblock Platforms?

can you effectively plan for the migration and management of systems and applications on Vblock Platforms? SOLUTION BRIEF CA Capacity Management and Reporting Suite for Vblock Platforms can you effectively plan for the migration and management of systems and applications on Vblock Platforms? agility made possible

More information

Solving the Big Data Intention-Deployment Gap

Solving the Big Data Intention-Deployment Gap Whitepaper Solving the Big Data Intention-Deployment Gap Big Data is on virtually every enterprise s to-do list these days. Recognizing both its potential and competitive advantage, companies are aligning

More information

A Comprehensive Cloud Management Platform with Vblock Systems and Cisco Intelligent Automation for Cloud

A Comprehensive Cloud Management Platform with Vblock Systems and Cisco Intelligent Automation for Cloud WHITE PAPER A Comprehensive Cloud Management Platform with Vblock Systems and Cisco Intelligent Automation for Cloud Abstract Data center consolidation and virtualization have set the stage for cloud computing.

More information

Advancing Towards the Future of Cloud Computing: Intel Open Cloud Vision

Advancing Towards the Future of Cloud Computing: Intel Open Cloud Vision Advancing Towards the Future of Cloud Computing: Intel Open Cloud Vision Nikos G. Panagiotidis Market Development Manager Cisco Connect Athens, 23/4/2013 Growth & IT Challenges Drive Need for Cloud Computing

More information

Delivering Quality Service with IBM Service Management

Delivering Quality Service with IBM Service Management Delivering Quality Service with IBM Service Milos Nikolic, Global Technology Services Manager Dragan Jeremic, Technical Sales Specialist for Tivoli December 10th 2008 Innovation is the Process, Success

More information

Silverton Consulting, Inc. StorInt Briefing Introduction Enterprise challenges

Silverton Consulting, Inc. StorInt Briefing Introduction Enterprise challenges Silverton Consulting, Inc. StorInt Briefing Introduction In today s enterprise, IT staff often work under an unrelenting flood of data and a much more demanding business environment. To survive these challenges,

More information

Big data management with IBM General Parallel File System

Big data management with IBM General Parallel File System Big data management with IBM General Parallel File System Optimize storage management and boost your return on investment Highlights Handles the explosive growth of structured and unstructured data Offers

More information