and Deeper Insight solution brief Open Source on Intel

Size: px
Start display at page:

Download "and Deeper Insight solution brief Open Source on Intel"

Transcription

1 Open Source on Intel harness apache hadoop* for faster performance and Deeper Insight solution brief Do More with Apache Hadoop* on Intel Architecture Simplify Development With Intel reference architectures, tuning guides, and best practice recommendations Boost Performance Through simpler and more effective performance tuning Deliver Higher Value With advanced server technologies and highly optimized software Stay on Track With rapid innovation on a proven platform How important is big data to the future success of businesses around the world? According to the McKinsey Global Institute, a retailer using big data to the full could increase its operating margin by more than 60 percent. 1 The competitive advantages can be enormous, and they aren t just for retailers. Big data solutions are being used for everything from fraud detection and genetic research to IT infrastructure optimization and social network analysis and we re still in the earliest stages of the big data revolution. As big data analytics moves into the mainstream, business success will increasingly depend on the ability to store, process, and analyze massive volumes of structured and unstructured data in near real time. But traditional data warehouse technologies can t handle the growing flood of diverse, fast-moving data or the rapid change in requirements that businesses face today. A far more flexible, scalable, and open infrastructure is required. Together with the open-source community and the big data ecosystem, Intel is helping businesses overcome these obstacles so they can harness big data to make smarter business decisions. Intel software engineers make extensive contributions to open-source projects that span the breadth of the big data solution stack, including Linux*, Java*, Hadoop*, HBase*, and many others. We also collaborate with leading software vendors and global IT organizations to optimize and deploy their big data solutions on Intel architecture; work with academia to foster technology advancements; and invest in commercial ventures to nurture critical new technologies and solutions. These efforts help to accelerate open-source innovation across the big data ecosystem. They also help to ensure you and your customers get the highest possible value from big data solutions built on Intel architecture. S According to IDC, more than half a billion dollars in venture capital has been invested in big data technology, and the market is growing at a compound annual rate (CAGR) of 40 percent. 2

2 SPEED INNOVATIVE BIG DATA SOLUTIONS TO MARKET with Less Effort As the demand for big data solutions has grown, Apache Hadoop has quickly become one of the preferred platforms for storing and processing large volumes of structured and unstructured data. Businesses can deploy this opensource software framework across a small number of Intel Xeon processor-based servers to get started with big data analytics quickly and at remarkably low cost. They can then gradually scale their Apache Hadoop cluster to hundreds or even thousands of nodes to enable sub-second query response times across multiple petabytes of data. There are many factors to consider when designing, provisioning, and tuning Apache Hadoop solutions, and the decisions you make can have a direct impact on the depth, breadth, and timeliness of the insights your customers can glean from their fast-growing data sets. Intel is collaborating with the Apache Hadoop community to enable system administrators to squeeze the maximum performance out of their Apache Hadoop clusters with minimum complexity. As a member of the open-source community, we have made extensive contributions to provide you with resources that will help you deliver Apache Hadoop solutions that are optimized for Intel architecture more quickly and with less effort. Go from planning to production in just weeks. You can design and implement Apache Hadoop solutions in less time and with greater confidence using Intel reference architectures, tuning guides, and best practice recommendations. Detailed technical recommendations provide you with a solid starting point for designing your own best-fit solutions. Deliver higher returns through faster analytics. Identify and resolve performance issues that would be intractable using traditional software tools. Intel developed the HiTune performance analyzer and the HiBench benchmark suite to cut through the complexity of performance tuning for Apache Hadoop and now makes them freely available as open-source software tools. What is Apache Hadoop*? Apache Hadoop is an open-source software framework that enables distributed storage and processing of massive volumes of structured and unstructured data. It has already become a key competitive differentiator for some of today s most successful companies, enabling them to extract valuable insights from up to hundreds of petabytes of data in near real time. 2

3 OPTIMIZE HADOOP CLUSTERS AND WORKLOADS FOR FASTER ANALYTICS HiTune Performance Analyzer A key advantage of Apache Hadoop is that it s easier to deploy and use than a traditional data warehouse. Yet optimizing Apache Hadoop clusters and workloads for high performance can be challenging due to the complex interactions among hardware and software in a distributed environment. Intel developed HiTune to address this challenge, providing developers with simple tools to develop highly scalable applications. This scalable, lightweight, and extensible performance analyzer can help you deliver higher performing Apache Hadoop clusters and applications to your customers. It can also help your customers get higher value throughout the life of their cluster. the dynamic interactions between different tasks and stages, and can quickly pinpoint performance bottlenecks, application hotspots, and hardware problems that slow performance. Simplify and accelerate performance tuning. HiTune provides detailed analysis and visualizations, has negligible performance impact on running applications, and requires no modifications to source code. Intel engineers have used it extensively and have realized performance gains as high as six times, in many cases through relatively simple hardware or software adjustments. S 60 % It only took three years for Apache Hadoop* to advance from a pilot project to large-scale commercial distributions. That healthy growth rate continues, pointing toward mainstream adoption throughout the industry and the emergence of a thriving ecosystem of hardware and software vendors. IDC predicts the market for software related to Hadoop will grow at 60 percent a year, reaching $812.8 million by Typical Apache Hadoop queries are written using an intuitive, high-level, data-flow model. This is great for programmers, because all the messy details of data partitioning, task distribution, load balancing, fault tolerance, and node communications are handled by the Apache Hadoop runtime environment. However, hiding that low-level complexity makes performance tuning a daunting challenge. Engineers may have little or no insight into the low-level interactions between hardware and software that are so critical for understanding and optimizing performance. They typically must rely on trial and error, which is not only time-consuming, but also often results in less-than-optimal performance. HiTune monitors the key performance metrics on each server in an Apache Hadoop cluster, then aggregates and correlates these low-level indicators with the highlevel data flow model. Engineers gain deep insight into Scale analyses across thousands of servers. HiTune can be used to analyze applications with tens of thousands of simultaneous processes running across thousands of servers in production environments. The HiTune analysis engine runs as a Apache Hadoop job to enable fast analysis of large amounts of performance data through massively parallel execution. There is no need to analyze just part of an application running on part of a cluster. Engineers can gather and analyze complete information to obtain more useful insights. Get higher value over time. Intel will continue to extend and optimize HiTune for Apache Hadoop and other distributed, big data solutions. HiTune has already been used at Intel to tune and optimize performance for Apache Hive, an open-source data warehouse built on top of Apache Hadoop. The tuning expertise you develop today will deliver even higher value in the future. discover 3

4 QUANTIFY PERFORMANCE TO VERIFY VALUE HiBench Benchmark Suite Optimizing and verifying performance for Apache Hadoop clusters will become increasingly important as the market grows and customers begin using big data insights in near-real time to improve revenue flows, profitability, and operational efficiency. With the HiBench benchmark suite, you can measure, validate, and compare performance for Apache Hadoop clusters accurately and consistently across diverse workloads to provide your customers with better information and greater confidence. HiBench provides convenient access to 10 Apache Hadoop workloads that are simple to use and have been extended, configured, and customized to reflect typical deployments. You can measure performance for specific, common tasks, such as sorting and word counting, or for more comprehensive real-world applications, such as web searching, machine learning, and data analytics. The different workloads have different characteristics, so you can establish test matrices that reflect the resource demands of specific environments. Intel will continue to extend and improve HiBench and is also working with leading vendors and standards bodies to develop industry-standard performance benchmarks for Apache Hadoop. Once these benchmarks are established, you ll have an even better foundation for understanding architectural issues and for measuring and verifying the performance of your Apache Hadoop solutions. Get the Technical Information You Need System administrators can squeeze the maximum performance out of their Apache Hadoop* clusters with minimum complexity using a wealth of tools and resources. White Paper: Optimizing Hadoop Deployments Reference Architecture: Intel Cloud Builders Guide to Apache Hadoop Hadoop Performance Best Practices HiTune: Dataflow-Based Performance Analysis for Big Data Cloud HiBench: A Representative and Comprehensive Hadoop Benchmark Suite 4

5 power BUILD ON A PROVEN FOUNDATION Reference Architectures, Tuning Guides, Best Practice Recommendations Designing fully-optimized Apache Hadoop clusters requires a deep understanding of the entire solution stack. You could spend months exploring the characteristics of Apache Hadoop workloads and how they interact with the underlying hardware and software. Or you could take advantage of the expertise Intel has developed through years of research and collaboration with companies that are now running some of the largest and most successful Apache Hadoop implementations in the world, including Google, Yahoo!, and several leading telecommunications and financial services companies. Intel engineers have distilled this expertise into reference architectures, tuning guides, and best practice recommendations you can use as a starting point for designing and deploying your Apache Hadoop clusters. With clear guidance that extends all the way from hardware specifications through the complete software stack, you can design, build, and configure best-fit solutions more quickly and at lower cost. You can also choose from a number of leading Apache Hadoop distributions, all of which are highly optimized for Intel Xeon processors. Intel works with Cloudera, Hortonworks, IBM, and other commercial distributors to help ensure you get the best possible performance on Intel architecture using software that has been extended, hardened, and tested for production-readiness in enterprise environments. Deliver Higher Value through Intel Technologies Businesses face new challenges as they work to store and mine growing data volumes and distribute real-time insights to people and processes throughout their organizations. Performance, security, compliance, and reliability become increasingly important especially when open-source-based solutions are used to support revenue-producing transactions. Intel helps you meet these demands more easily and at lower cost by integrating forward-looking technologies into Intel Xeon processors and working with the software ecosystem to ensure optimized support. Higher performance. Built-in technologies such as Intel Turbo Boost Technology and Intel Hyper-Threading Technology help to deliver higher performance and superior scalability for massively parallel, data-intensive applications, such as Apache Hadoop. Mission-critical reliability. Advanced reliability, availability, and serviceability (RAS) features protect systems and data more effectively by detecting and correcting errors throughout the server platform, automatically recovering from a wide range of faults, and making it easier for IT organizations to predict, identify, and resolve problems without downtime. create Stronger security. Integrated security technologies provide a better foundation for protecting systems and data. For example, Intel Advanced Encryption Standard New Instructions (Intel AES-NI) provides integrated support for high-speed, low-overhead encryption. Your customers can use it to improve security and compliance without slowing performance all the way from their data centers to their mobile clients. Intel also works extensively with vendors to optimize and enhance traditional relational database and analytics solutions. Innovative new technologies, such as in-memory analytics, in-database analytics, and columnar data structures take advantage of key advancements in Intel Xeon processors to enable significant gains in performance and scalability. Building your Apache Hadoop solutions on the same server platform helps to ensure you and your customers have a common foundation for optimizing performance, reliability, and security across complex and widely distributed analytics environments. 5

6 Stay on Track as the Pace Accelerates Big data solutions are poised to transform the competitive landscape across many industries, and keeping pace with ongoing developments will be both essential and challenging. Intel can help you stay on track. We work directly with academic researchers, the open-source community, hardware and software vendors, cloud providers, and standards bodies throughout the world to help advance open-source innovation. We then build on these efforts by integrating key technologies into next-generation Intel Xeon processors and working to ensure optimized support throughout the solution stack and vendor community so your customers get leading performance and functionality with each new server they add to their Apache Hadoop clusters. Fostering open-source innovation As a member of the open-source community, Intel works upstream to help ensure that critical new capabilities are widely diffused throughout the big data ecosystem. Intel is one of the leading contributors to the Linux kernel and Java open-source software. We are now poised to make substantial contributions to Hadoop, HBase, R, Cassandra*, and many other big data projects to help you get better performance, increased functionality, and higher value in future distributions. Furthering technology breakthroughs through academic research Intel Labs has injected USD 140 million over five years to fuel research through the rollout of global academic centers Intel Science and Technology Centers (ISTCs) and Intel Collaborative Research Institutes (ICRIs) that bring together top professionals and researchers in strategic areas of computing. Research is conducted using an open-source model to facilitate collaboration, and results are shared with the educational community and the technology industry. Researchers at the ISTC for Big Data are working to produce new data management systems and compute architectures that together can help users process data that exceeds the scale, rate, or sophistication of data processing that existing systems provide. The center will also demonstrate the effectiveness of these solutions on real-world applications in science, engineering, and medicine. The new Intel Science and Technology Center (ISTC) of Big Data is located at the Computer Science and Artificial Intelligence Laboratory (CSAIL) at the Massachusetts Institute of Technology (MIT). Advancing industry standards Businesses need standards-based solutions they can deploy with confidence to solve real-world challenges. As technical advisor to the Open Data Center Alliance (ODCA) and its new Data Services workgroup, Intel maintains a connection with more than 300 global IT organizations. Intel is also a founding underwriter of the International Institute for Analytics (IIA). These and many other engagements help Intel shape research and development efforts to ensure that future technologies deliver high value. Our goal is to innovate and guide the work of the Intel Science and Technology Center for Big Data across multiple fields from medical to media to extract meaning from large amounts of data. Justin Rattner, Chief Technology Officer, Intel 6

7 Spurring the development of pivotal technologies The most groundbreaking innovations sometimes come from small startup companies that have big ideas. Intel Capital identifies and invests in these companies to help them thrive and to increase their impact on the big data ecosystem. A prime example is Revolution Analytics, a company that helps enterprise customers get higher value from R, an open-source statistics language that has exploded in popularity to become one of the programming languages of choice for many data analysts. There is no doubt in my mind that as trends like big data and open source continue to converge, all of that will be taking place on high-performance Intel architecture in the cloud.. Zack Urlocker, Chief Operating Officer, Zendesk and Board Member, Revolution Analytics Readying the Linux* Kernel for the Era of Big Data As businesses look to harness big data to make smarter business decisions, scalability and performance of the Linux kernel only becomes more critical. With each successive release of the kernel and its own platform hardware Intel aims to ensure its ongoing scalability to take best advantage of the compute capabilities of Intel architecture-based servers. Together with other members of the Linux community, Intel initiated the Linux Kernel Performance Project, to continuously monitor kernel performance, evaluating every dot release with key workloads. Beyond contributions to the scalability of Linux, Intel has helped improve Linux power efficiency, graphics operations, wired and wireless networking, and firmware and platform integration. Innovate Iceland s Advania Thor Data Center, powered by 288 Intel Xeon processorbased clusters (each featuring 3,456 compute cores), houses the world s first zero-emissions supercomputer, drawing power from 100-percent renewable resources, including the geothermal plant shown here. Intel strongly supports the Apache Hadoop*/NoSQL ecosystem and other open-source projects that make facilities such as the Advania Thor Data Center possible. 7

8 Intel takes pride in being a long-standing member of the open-source community. We believe in open source development as a means to create rich business opportunities, advance promising technologies, and bring together top talent from diverse fields to solve computing challenges. Our contributions to the community include reliable hardware architectures, professional development tools, work on essential open-source components, collaboration and co-engineering with leading companies, investment in academic research and commercial businesses, and helping to build a thriving ecosystem around open source. spark tools and resources customer solutions academic research OPEN SOURCE on Intel Linux contributions building blocks commercial ecosystem industry standards get started now Apache Hadoop is transforming the way companies store and use data by delivering powerful new capabilities on a distributed architecture that is far more scalable, flexible, and affordable than traditional data warehouse platforms. Forward-thinking companies are gaining firstmover advantages by mining insight from massive data sets and fast-moving data streams that would otherwise be impossible to analyze. Many others are following in their footsteps, leading to rapid market growth for big data products and solutions. Intel offers tools, resources, and platforms that can help you get innovative big data solutions to market faster and with less effort and deliver higher value to your customers both now and in the future. The big data revolution is underway. Join us. 1 Source: Big data: The next frontier for innovation, competition and productivity, by James Manyika, Michael Chui, Brad Brown, Jacques Bughin, Richard Dobbs, Charles Roxburgh, and Angela Hung Byers, McKinsey Global Institute, May, innovation/big_data_the_next_frontier_for_innovation 2 Source: IDC Press Release: IDC Releases First Worldwide Big Data Technology and Services Market Forecast, Shows Big Data as the Next Essential Capability and a Foundation for the Intelligent Economy, March 7, ontainerid=prus Hadoop software market to hit $812.8 million in 2016, ZDNet, Information in this document is provided in connection with Intel products. No license, express or implied, by estoppel or otherwise, to any intellectual property rights is granted by this document. Except as provided in Intel s terms and conditions of sale for such products, Intel assumes no liability whatsoever, and Intel disclaims any express or implied warranty, relating to sale and/or use of Intel products including liability or warranties relating to fitness for a particular purpose, merchantability, or infringement of any patent, copyright or other intellectual property right. Unless otherwise agreed in writing by Intel, the Intel products are not designed nor intended for any application in which the failure of the Intel product could create a situation where personal injury or death may occur. Copyright 2012 Intel Corporation. All rights reserved. Intel, Xeon, and the Intel logo are trademarks of Intel Corporation in the U.S. and other countries. *Other names and brands may be claimed as the property of others. 1012/NR/PRW/PDF US

Fast, Low-Overhead Encryption for Apache Hadoop*

Fast, Low-Overhead Encryption for Apache Hadoop* Fast, Low-Overhead Encryption for Apache Hadoop* Solution Brief Intel Xeon Processors Intel Advanced Encryption Standard New Instructions (Intel AES-NI) The Intel Distribution for Apache Hadoop* software

More information

Real-Time Big Data Analytics SAP HANA with the Intel Distribution for Apache Hadoop software

Real-Time Big Data Analytics SAP HANA with the Intel Distribution for Apache Hadoop software Real-Time Big Data Analytics with the Intel Distribution for Apache Hadoop software Executive Summary is already helping businesses extract value out of Big Data by enabling real-time analysis of diverse

More information

Interactive data analytics drive insights

Interactive data analytics drive insights Big data Interactive data analytics drive insights Daniel Davis/Invodo/S&P. Screen images courtesy of Landmark Software and Services By Armando Acosta and Joey Jablonski The Apache Hadoop Big data has

More information

Intel Platform and Big Data: Making big data work for you.

Intel Platform and Big Data: Making big data work for you. Intel Platform and Big Data: Making big data work for you. 1 From data comes insight New technologies are enabling enterprises to transform opportunity into reality by turning big data into actionable

More information

Dell* In-Memory Appliance for Cloudera* Enterprise

Dell* In-Memory Appliance for Cloudera* Enterprise Built with Intel Dell* In-Memory Appliance for Cloudera* Enterprise Find out what faster big data analytics can do for your business The need for speed in all things related to big data is an enormous

More information

HPC & Big Data THE TIME HAS COME FOR A SCALABLE FRAMEWORK

HPC & Big Data THE TIME HAS COME FOR A SCALABLE FRAMEWORK HPC & Big Data THE TIME HAS COME FOR A SCALABLE FRAMEWORK Barry Davis, General Manager, High Performance Fabrics Operation Data Center Group, Intel Corporation Legal Disclaimer Today s presentations contain

More information

IBM PureFlex System. The infrastructure system with integrated expertise

IBM PureFlex System. The infrastructure system with integrated expertise IBM PureFlex System The infrastructure system with integrated expertise 2 IBM PureFlex System IT is moving to the strategic center of business Over the last 100 years information technology has moved from

More information

Intel Cloud Builder Guide: Cloud Design and Deployment on Intel Platforms

Intel Cloud Builder Guide: Cloud Design and Deployment on Intel Platforms EXECUTIVE SUMMARY Intel Cloud Builder Guide Intel Xeon Processor-based Servers Red Hat* Cloud Foundations Intel Cloud Builder Guide: Cloud Design and Deployment on Intel Platforms Red Hat* Cloud Foundations

More information

In-Memory Analytics for Big Data

In-Memory Analytics for Big Data In-Memory Analytics for Big Data Game-changing technology for faster, better insights WHITE PAPER SAS White Paper Table of Contents Introduction: A New Breed of Analytics... 1 SAS In-Memory Overview...

More information

IBM System x reference architecture solutions for big data

IBM System x reference architecture solutions for big data IBM System x reference architecture solutions for big data Easy-to-implement hardware, software and services for analyzing data at rest and data in motion Highlights Accelerates time-to-value with scalable,

More information

The power of collaboration: Accenture capabilities + Dell solutions

The power of collaboration: Accenture capabilities + Dell solutions The power of collaboration: Accenture capabilities + Dell solutions IT must run like a business grow with efficiency, deliver results, and deliver long-term strategic value. As technology changes accelerate

More information

Hur hanterar vi utmaningar inom området - Big Data. Jan Östling Enterprise Technologies Intel Corporation, NER

Hur hanterar vi utmaningar inom området - Big Data. Jan Östling Enterprise Technologies Intel Corporation, NER Hur hanterar vi utmaningar inom området - Big Data Jan Östling Enterprise Technologies Intel Corporation, NER Legal Disclaimers All products, computer systems, dates, and figures specified are preliminary

More information

WELCOME TO THE WORLD OF BIG DATA. NEW WORLD PROBLEMS, NEW WORLD SOLUTIONS

WELCOME TO THE WORLD OF BIG DATA. NEW WORLD PROBLEMS, NEW WORLD SOLUTIONS WELCOME TO THE WORLD OF BIG DATA. NEW WORLD PROBLEMS, NEW WORLD SOLUTIONS TECHNOLOGY by Zachary Zeus Data in our world has been exploding. According to IBM research, 90% of today s data was created in

More information

Intel, Cisco, and Red Hat deliver a proven solution that reduces risk. Advance Your Cloud Strategy with OpenStack

Intel, Cisco, and Red Hat deliver a proven solution that reduces risk. Advance Your Cloud Strategy with OpenStack Technology Overview Simplify OpenStack * Cloud Deployment Intel, Cisco, and Red Hat deliver a proven solution that reduces risk According to a global survey of 3,643 enterprise executives responsible for

More information

1 Performance Moves to the Forefront for Data Warehouse Initiatives. 2 Real-Time Data Gets Real

1 Performance Moves to the Forefront for Data Warehouse Initiatives. 2 Real-Time Data Gets Real Top 10 Data Warehouse Trends for 2013 What are the most compelling trends in storage and data warehousing that motivate IT leaders to undertake new initiatives? Which ideas, solutions, and technologies

More information

IBM Analytics. Just the facts: Four critical concepts for planning the logical data warehouse

IBM Analytics. Just the facts: Four critical concepts for planning the logical data warehouse IBM Analytics Just the facts: Four critical concepts for planning the logical data warehouse 1 2 3 4 5 6 Introduction Complexity Speed is businessfriendly Cost reduction is crucial Analytics: The key to

More information

Big Data and Natural Language: Extracting Insight From Text

Big Data and Natural Language: Extracting Insight From Text An Oracle White Paper October 2012 Big Data and Natural Language: Extracting Insight From Text Table of Contents Executive Overview... 3 Introduction... 3 Oracle Big Data Appliance... 4 Synthesys... 5

More information

Accelerating Business Intelligence with Large-Scale System Memory

Accelerating Business Intelligence with Large-Scale System Memory Accelerating Business Intelligence with Large-Scale System Memory A Proof of Concept by Intel, Samsung, and SAP Executive Summary Real-time business intelligence (BI) plays a vital role in driving competitiveness

More information

High Performance Computing and Big Data: The coming wave.

High Performance Computing and Big Data: The coming wave. High Performance Computing and Big Data: The coming wave. 1 In science and engineering, in order to compete, you must compute Today, the toughest challenges, and greatest opportunities, require computation

More information

Accelerating Business Intelligence with Large-Scale System Memory

Accelerating Business Intelligence with Large-Scale System Memory Accelerating Business Intelligence with Large-Scale System Memory A Proof of Concept by Intel, Samsung, and SAP Executive Summary Real-time business intelligence (BI) plays a vital role in driving competitiveness

More information

SOLUTION BRIEF BIG DATA MANAGEMENT. How Can You Streamline Big Data Management?

SOLUTION BRIEF BIG DATA MANAGEMENT. How Can You Streamline Big Data Management? SOLUTION BRIEF BIG DATA MANAGEMENT How Can You Streamline Big Data Management? Today, organizations are capitalizing on the promises of big data analytics to innovate and solve problems faster. Big Data

More information

Tap into Big Data at the Speed of Business

Tap into Big Data at the Speed of Business SAP Brief SAP Technology SAP Sybase IQ Objectives Tap into Big Data at the Speed of Business A simpler, more affordable approach to Big Data analytics A simpler, more affordable approach to Big Data analytics

More information

Ubuntu and Hadoop: the perfect match

Ubuntu and Hadoop: the perfect match WHITE PAPER Ubuntu and Hadoop: the perfect match February 2012 Copyright Canonical 2012 www.canonical.com Executive introduction In many fields of IT, there are always stand-out technologies. This is definitely

More information

Intel Cloud Builder Guide to Cloud Design and Deployment on Intel Platforms

Intel Cloud Builder Guide to Cloud Design and Deployment on Intel Platforms Intel Cloud Builder Guide to Cloud Design and Deployment on Intel Platforms Ubuntu* Enterprise Cloud Executive Summary Intel Cloud Builder Guide Intel Xeon Processor Ubuntu* Enteprise Cloud Canonical*

More information

Intel and Qihoo 360 Internet Portal Datacenter - Big Data Storage Optimization Case Study

Intel and Qihoo 360 Internet Portal Datacenter - Big Data Storage Optimization Case Study Intel and Qihoo 360 Internet Portal Datacenter - Big Data Storage Optimization Case Study The adoption of cloud computing creates many challenges and opportunities in big data management and storage. To

More information

SAS and Oracle: Big Data and Cloud Partnering Innovation Targets the Third Platform

SAS and Oracle: Big Data and Cloud Partnering Innovation Targets the Third Platform SAS and Oracle: Big Data and Cloud Partnering Innovation Targets the Third Platform David Lawler, Oracle Senior Vice President, Product Management and Strategy Paul Kent, SAS Vice President, Big Data What

More information

McAfee and SAP HANA. Executive Summary. Real-Time Business Requires Real-Time Security

McAfee and SAP HANA. Executive Summary. Real-Time Business Requires Real-Time Security White Paper SAP HANA Intel Xeon Processor E7 Family Enterprise-class Security McAfee and SAP HANA Real-time, data-driven business with enterprise-class security Executive Summary There s an old saying:

More information

Big Data Performance Growth on the Rise

Big Data Performance Growth on the Rise Impact of Big Data growth On Transparent Computing Michael A. Greene Intel Vice President, Software and Services Group, General Manager, System Technologies and Optimization 1 Transparent Computing (TC)

More information

Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities

Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities Technology Insight Paper Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities By John Webster February 2015 Enabling you to make the best technology decisions Enabling

More information

Make the Most of Big Data to Drive Innovation Through Reseach

Make the Most of Big Data to Drive Innovation Through Reseach White Paper Make the Most of Big Data to Drive Innovation Through Reseach Bob Burwell, NetApp November 2012 WP-7172 Abstract Monumental data growth is a fact of life in research universities. The ability

More information

Cisco Data Preparation

Cisco Data Preparation Data Sheet Cisco Data Preparation Unleash your business analysts to develop the insights that drive better business outcomes, sooner, from all your data. As self-service business intelligence (BI) and

More information

Delivering new insights and value to consumer products companies through big data

Delivering new insights and value to consumer products companies through big data IBM Software White Paper Consumer Products Delivering new insights and value to consumer products companies through big data 2 Delivering new insights and value to consumer products companies through big

More information

Navigating the Big Data infrastructure layer Helena Schwenk

Navigating the Big Data infrastructure layer Helena Schwenk mwd a d v i s o r s Navigating the Big Data infrastructure layer Helena Schwenk A special report prepared for Actuate May 2013 This report is the second in a series of four and focuses principally on explaining

More information

SAP Makes Big Data Real Real Time. Real Results.

SAP Makes Big Data Real Real Time. Real Results. SAP Makes Big Data Real Real Time. Real Results. MAKE BIG DATA REAL WITH SAP SOLUTIONS: ACCELERATE. APPLY. ACHIEVE Accelerate, Apply, and Achieve Big Results from Your Big Data Big Data represents an opportunity

More information

An Oracle White Paper November 2010. Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics

An Oracle White Paper November 2010. Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics An Oracle White Paper November 2010 Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics 1 Introduction New applications such as web searches, recommendation engines,

More information

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances INSIGHT Oracle's All- Out Assault on the Big Data Market: Offering Hadoop, R, Cubes, and Scalable IMDB in Familiar Packages Carl W. Olofson IDC OPINION Global Headquarters: 5 Speen Street Framingham, MA

More information

A financial software company

A financial software company A financial software company Projecting USD10 million revenue lift with the IBM Netezza data warehouse appliance Overview The need A financial software company sought to analyze customer engagements to

More information

IBM Software Hadoop in the cloud

IBM Software Hadoop in the cloud IBM Software Hadoop in the cloud Leverage big data analytics easily and cost-effectively with IBM InfoSphere 1 2 3 4 5 Introduction Cloud and analytics: The new growth engine Enhancing Hadoop in the cloud

More information

CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data

CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data Research Report CA Technologies Big Data Infrastructure Management Executive Summary CA Technologies recently exhibited new technology innovations, marking its entry into the Big Data marketplace with

More information

Extending the Power of Analytics with a Proven Data Warehousing. Solution

Extending the Power of Analytics with a Proven Data Warehousing. Solution SAP Brief SAP s for Small Businesses and Midsize Companies SAP IQ, Edge Edition Objectives Extending the Power of Analytics with a Proven Data Warehousing Uncover deep insights and reach new heights Uncover

More information

How To Make Data Streaming A Real Time Intelligence

How To Make Data Streaming A Real Time Intelligence REAL-TIME OPERATIONAL INTELLIGENCE Competitive advantage from unstructured, high-velocity log and machine Big Data 2 SQLstream: Our s-streaming products unlock the value of high-velocity unstructured log

More information

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica

More information

IBM DB2 Near-Line Storage Solution for SAP NetWeaver BW

IBM DB2 Near-Line Storage Solution for SAP NetWeaver BW IBM DB2 Near-Line Storage Solution for SAP NetWeaver BW A high-performance solution based on IBM DB2 with BLU Acceleration Highlights Help reduce costs by moving infrequently used to cost-effective systems

More information

An Oracle White Paper May 2012. Oracle Database Cloud Service

An Oracle White Paper May 2012. Oracle Database Cloud Service An Oracle White Paper May 2012 Oracle Database Cloud Service Executive Overview The Oracle Database Cloud Service provides a unique combination of the simplicity and ease of use promised by Cloud computing

More information

Apache Hadoop: The Big Data Refinery

Apache Hadoop: The Big Data Refinery Architecting the Future of Big Data Whitepaper Apache Hadoop: The Big Data Refinery Introduction Big data has become an extremely popular term, due to the well-documented explosion in the amount of data

More information

The ROI from Optimizing Software Performance with Intel Parallel Studio XE

The ROI from Optimizing Software Performance with Intel Parallel Studio XE The ROI from Optimizing Software Performance with Intel Parallel Studio XE Intel Parallel Studio XE delivers ROI solutions to development organizations. This comprehensive tool offering for the entire

More information

Unisys ClearPath Forward Fabric Based Platform to Power the Weather Enterprise

Unisys ClearPath Forward Fabric Based Platform to Power the Weather Enterprise Unisys ClearPath Forward Fabric Based Platform to Power the Weather Enterprise Introducing Unisys All in One software based weather platform designed to reduce server space, streamline operations, consolidate

More information

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014 Forecast of Big Data Trends Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014 Big Data transforms Business 2 Data created every minute Source http://mashable.com/2012/06/22/data-created-every-minute/

More information

An Oracle White Paper June 2012. High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database

An Oracle White Paper June 2012. High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database An Oracle White Paper June 2012 High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database Executive Overview... 1 Introduction... 1 Oracle Loader for Hadoop... 2 Oracle Direct

More information

Fiserv. Saving USD8 million in five years and helping banks improve business outcomes using IBM technology. Overview. IBM Software Smarter Computing

Fiserv. Saving USD8 million in five years and helping banks improve business outcomes using IBM technology. Overview. IBM Software Smarter Computing Fiserv Saving USD8 million in five years and helping banks improve business outcomes using IBM technology Overview The need Small and midsize banks and credit unions seek to attract, retain and grow profitable

More information

Simplify IT and Reduce TCO: Oracle s End-to-End, Integrated Infrastructure for SAP Data Centers

Simplify IT and Reduce TCO: Oracle s End-to-End, Integrated Infrastructure for SAP Data Centers Simplify IT and Reduce TCO: Oracle s End-to-End, Integrated Infrastructure for SAP Data Centers Over time, IT infrastructures have become increasingly complex and costly to manage and operate. Oracle s

More information

Intel Network Builders: Lanner and Intel Building the Best Network Security Platforms

Intel Network Builders: Lanner and Intel Building the Best Network Security Platforms Solution Brief Intel Xeon Processors Lanner Intel Network Builders: Lanner and Intel Building the Best Network Security Platforms Internet usage continues to rapidly expand and evolve, and with it network

More information

IBM Software Cloud service delivery and management

IBM Software Cloud service delivery and management IBM Software Cloud service delivery and management Rethink IT. Reinvent business. 2 Cloud service delivery and management Virtually unparalleled change and complexity On this increasingly instrumented,

More information

Big data management with IBM General Parallel File System

Big data management with IBM General Parallel File System Big data management with IBM General Parallel File System Optimize storage management and boost your return on investment Highlights Handles the explosive growth of structured and unstructured data Offers

More information

Big Data Services From Hitachi Data Systems

Big Data Services From Hitachi Data Systems SOLUTION PROFILE Big Data Services From Hitachi Data Systems Create Strategy, Implement and Manage a Solution for Big Data for Your Organization Big Data Consulting Services and Big Data Transition Services

More information

Optimizing Data Centers for Big Infrastructure Applications

Optimizing Data Centers for Big Infrastructure Applications white paper Optimizing Data Centers for Big Infrastructure Applications Contents Whether you need to analyze large data sets or deploy a cloud, building big infrastructure is a big job. This paper discusses

More information

Why Oracle Database Runs Best on Oracle Servers and Storage. Optimize the Performance of the World s #1 Enterprise Database.

Why Oracle Database Runs Best on Oracle Servers and Storage. Optimize the Performance of the World s #1 Enterprise Database. Why Oracle Database Runs Best on Oracle Servers and Storage Optimize the Performance of the World s #1 Enterprise Database. 2 Contents 4 Engineered to Work Together 6 Oracle Optimized Solutions 10 Lower

More information

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the

More information

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved. Collaborative Big Data Analytics 1 Big Data Is Less About Size, And More About Freedom TechCrunch!!!!!!!!! Total data: bigger than big data 451 Group Findings: Big Data Is More Extreme Than Volume Gartner!!!!!!!!!!!!!!!

More information

Hadoop for Enterprises:

Hadoop for Enterprises: Hadoop for Enterprises: Overcoming the Major Challenges Introduction to Big Data Big Data are information assets that are high volume, velocity, and variety. Big Data demands cost-effective, innovative

More information

Cray: Enabling Real-Time Discovery in Big Data

Cray: Enabling Real-Time Discovery in Big Data Cray: Enabling Real-Time Discovery in Big Data Discovery is the process of gaining valuable insights into the world around us by recognizing previously unknown relationships between occurrences, objects

More information

Getting the most out of big data

Getting the most out of big data IBM Software White Paper Financial Services Getting the most out of big data How banks can gain fresh customer insight with new big data capabilities 2 Getting the most out of big data Banks thrive on

More information

How To Build A Cloud Computer

How To Build A Cloud Computer Introducing the Singlechip Cloud Computer Exploring the Future of Many-core Processors White Paper Intel Labs Jim Held Intel Fellow, Intel Labs Director, Tera-scale Computing Research Sean Koehl Technology

More information

IBM Cognos 10: Enhancing query processing performance for IBM Netezza appliances

IBM Cognos 10: Enhancing query processing performance for IBM Netezza appliances IBM Software Business Analytics Cognos Business Intelligence IBM Cognos 10: Enhancing query processing performance for IBM Netezza appliances 2 IBM Cognos 10: Enhancing query processing performance for

More information

High-Performance Business Analytics: SAS and IBM Netezza Data Warehouse Appliances

High-Performance Business Analytics: SAS and IBM Netezza Data Warehouse Appliances High-Performance Business Analytics: SAS and IBM Netezza Data Warehouse Appliances Highlights IBM Netezza and SAS together provide appliances and analytic software solutions that help organizations improve

More information

Redefining Infrastructure Management for Today s Application Economy

Redefining Infrastructure Management for Today s Application Economy WHITE PAPER APRIL 2015 Redefining Infrastructure Management for Today s Application Economy Boost Operational Agility by Gaining a Holistic View of the Data Center, Cloud, Systems, Networks and Capacity

More information

Intel Cloud Builder Guide to Cloud Design and Deployment on Intel Xeon Processor-based Platforms

Intel Cloud Builder Guide to Cloud Design and Deployment on Intel Xeon Processor-based Platforms Intel Cloud Builder Guide to Cloud Design and Deployment on Intel Xeon Processor-based Platforms Enomaly Elastic Computing Platform, * Service Provider Edition Executive Summary Intel Cloud Builder Guide

More information

Pentaho High-Performance Big Data Reference Configurations using Cisco Unified Computing System

Pentaho High-Performance Big Data Reference Configurations using Cisco Unified Computing System Pentaho High-Performance Big Data Reference Configurations using Cisco Unified Computing System By Jake Cornelius Senior Vice President of Products Pentaho June 1, 2012 Pentaho Delivers High-Performance

More information

Advanced Big Data Analytics with R and Hadoop

Advanced Big Data Analytics with R and Hadoop REVOLUTION ANALYTICS WHITE PAPER Advanced Big Data Analytics with R and Hadoop 'Big Data' Analytics as a Competitive Advantage Big Analytics delivers competitive advantage in two ways compared to the traditional

More information

Cloud based Holdfast Electronic Sports Game Platform

Cloud based Holdfast Electronic Sports Game Platform Case Study Cloud based Holdfast Electronic Sports Game Platform Intel and Holdfast work together to upgrade Holdfast Electronic Sports Game Platform with cloud technology Background Shanghai Holdfast Online

More information

RAPID EMBEDDED LINUX* DEVELOPMENT

RAPID EMBEDDED LINUX* DEVELOPMENT Open Source on Intel case study Digital signage solutions from QNAP Systems Inc. use embedded Linux* to support usage models for advertising, marketing, and other types of public multimedia displays in

More information

How To Get A Client Side Virtualization Solution For Your Financial Services Business

How To Get A Client Side Virtualization Solution For Your Financial Services Business SOLUTION BRIEF Financial Services Industry 2nd Generation Intel Core i5 vpro and Core i7 vpro Processors Benefits of Client-Side Virtualization A Flexible, New Solution for Improving Manageability, Security,

More information

TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP

TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP Pythian White Paper TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP ABSTRACT As companies increasingly rely on big data to steer decisions, they also find themselves looking for ways to simplify

More information

Oracle Big Data Building A Big Data Management System

Oracle Big Data Building A Big Data Management System Oracle Big Building A Big Management System Copyright 2015, Oracle and/or its affiliates. All rights reserved. Effi Psychogiou ECEMEA Big Product Director May, 2015 Safe Harbor Statement The following

More information

Accelerating and Simplifying Apache

Accelerating and Simplifying Apache Accelerating and Simplifying Apache Hadoop with Panasas ActiveStor White paper NOvember 2012 1.888.PANASAS www.panasas.com Executive Overview The technology requirements for big data vary significantly

More information

Proactive Performance Management for Enterprise Databases

Proactive Performance Management for Enterprise Databases Proactive Performance Management for Enterprise Databases Abstract DBAs today need to do more than react to performance issues; they must be proactive in their database management activities. Proactive

More information

Unlocking the Intelligence in. Big Data. Ron Kasabian General Manager Big Data Solutions Intel Corporation

Unlocking the Intelligence in. Big Data. Ron Kasabian General Manager Big Data Solutions Intel Corporation Unlocking the Intelligence in Big Data Ron Kasabian General Manager Big Data Solutions Intel Corporation Volume & Type of Data What s Driving Big Data? 10X Data growth by 2016 90% unstructured 1 Lower

More information

Leading Virtualization 2.0

Leading Virtualization 2.0 Leading Virtualization 2.0 How Intel is driving virtualization beyond consolidation into a solution for maximizing business agility within the enterprise White Paper Intel Virtualization Technology (Intel

More information

Big Data Buzzwords From A to Z. By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012

Big Data Buzzwords From A to Z. By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012 Big Data Buzzwords From A to Z By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012 Big Data Buzzwords Big data is one of the, well, biggest trends in IT today, and it has spawned a whole new generation

More information

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics ESSENTIALS EMC ISILON Use the industry's first and only scale-out NAS solution with native Hadoop

More information

Quick Reference Selling Guide for Intel Lustre Solutions Overview

Quick Reference Selling Guide for Intel Lustre Solutions Overview Overview The 30 Second Pitch Intel Solutions for Lustre* solutions Deliver sustained storage performance needed that accelerate breakthrough innovations and deliver smarter, data-driven decisions for enterprise

More information

BIG DATA-AS-A-SERVICE

BIG DATA-AS-A-SERVICE White Paper BIG DATA-AS-A-SERVICE What Big Data is about What service providers can do with Big Data What EMC can do to help EMC Solutions Group Abstract This white paper looks at what service providers

More information

Minimize cost and risk for data warehousing

Minimize cost and risk for data warehousing SYSTEM X SERVERS SOLUTION BRIEF Minimize cost and risk for data warehousing Microsoft Data Warehouse Fast Track for SQL Server 2014 on System x3850 X6 (55TB) Highlights Improve time to value for your data

More information

BIG Data. An Introductory Overview. IT & Business Management Solutions

BIG Data. An Introductory Overview. IT & Business Management Solutions BIG Data An Introductory Overview IT & Business Management Solutions What is Big Data? Having been a dominating industry buzzword for the past few years, there is no contesting that Big Data is attracting

More information

Different NFV/SDN Solutions for Telecoms and Enterprise Cloud

Different NFV/SDN Solutions for Telecoms and Enterprise Cloud Solution Brief Artesyn Embedded Technologies* Telecom Solutions Intel Xeon Processors Different NFV/SDN Solutions for Telecoms and Enterprise Cloud Networking solutions from Artesyn Embedded Technologies*

More information

Big Data Are You Ready? Thomas Kyte http://asktom.oracle.com

Big Data Are You Ready? Thomas Kyte http://asktom.oracle.com Big Data Are You Ready? Thomas Kyte http://asktom.oracle.com The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated

More information

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM Sneha D.Borkar 1, Prof.Chaitali S.Surtakar 2 Student of B.E., Information Technology, J.D.I.E.T, sborkar95@gmail.com Assistant Professor, Information

More information

Intel Cloud Builders Guide to Cloud Design and Deployment on Intel Platforms

Intel Cloud Builders Guide to Cloud Design and Deployment on Intel Platforms Intel Cloud Builders Guide Intel Xeon Processor-based Servers RES Virtual Desktop Extender Intel Cloud Builders Guide to Cloud Design and Deployment on Intel Platforms Client Aware Cloud with RES Virtual

More information

Big Data. Value, use cases and architectures. Petar Torre Lead Architect Service Provider Group. Dubrovnik, Croatia, South East Europe 20-22 May, 2013

Big Data. Value, use cases and architectures. Petar Torre Lead Architect Service Provider Group. Dubrovnik, Croatia, South East Europe 20-22 May, 2013 Dubrovnik, Croatia, South East Europe 20-22 May, 2013 Big Data Value, use cases and architectures Petar Torre Lead Architect Service Provider Group 2011 2013 Cisco and/or its affiliates. All rights reserved.

More information

Five Technology Trends for Improved Business Intelligence Performance

Five Technology Trends for Improved Business Intelligence Performance TechTarget Enterprise Applications Media E-Book Five Technology Trends for Improved Business Intelligence Performance The demand for business intelligence data only continues to increase, putting BI vendors

More information

See the Big Picture. Make Better Decisions. The Armanta Technology Advantage. Technology Whitepaper

See the Big Picture. Make Better Decisions. The Armanta Technology Advantage. Technology Whitepaper See the Big Picture. Make Better Decisions. The Armanta Technology Advantage Technology Whitepaper The Armanta Technology Advantage Executive Overview Enterprises have accumulated vast volumes of structured

More information

Deploying an Operational Data Store Designed for Big Data

Deploying an Operational Data Store Designed for Big Data Deploying an Operational Data Store Designed for Big Data A fast, secure, and scalable data staging environment with no data volume or variety constraints Sponsored by: Version: 102 Table of Contents Introduction

More information

Sybase IQ Supercharges Predictive Analytics

Sybase IQ Supercharges Predictive Analytics SOLUTIONS BROCHURE Sybase IQ Supercharges Predictive Analytics Deliver smarter predictions with Sybase IQ for SAP BusinessObjects users Optional Photos Here (fill space) www.sybase.com SOLUTION FEATURES

More information

How does Big Data disrupt the technology ecosystem of the public cloud?

How does Big Data disrupt the technology ecosystem of the public cloud? How does Big Data disrupt the technology ecosystem of the public cloud? Copyright 2012 IDC. Reproduction is forbidden unless authorized. All rights reserved. Agenda Market trends 2020 Vision Introduce

More information

Introduction. Various user groups requiring Hadoop, each with its own diverse needs, include:

Introduction. Various user groups requiring Hadoop, each with its own diverse needs, include: Introduction BIG DATA is a term that s been buzzing around a lot lately, and its use is a trend that s been increasing at a steady pace over the past few years. It s quite likely you ve also encountered

More information

Get More Scalability and Flexibility for Big Data

Get More Scalability and Flexibility for Big Data Solution Overview LexisNexis High-Performance Computing Cluster Systems Platform Get More Scalability and Flexibility for What You Will Learn Modern enterprises are challenged with the need to store and

More information