RED HAT AND HORTONWORKS: OPEN MODERN DATA ARCHITECTURE FOR THE ENTERPRISE

Similar documents
OPEN MODERN DATA ARCHITECTURE FOR FINANCIAL SERVICES RISK MANAGEMENT

ON-PREMISE OR IN THE CLOUD, A SINGLE JAVA EE APPLICATION PLATFORM

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

TRANSFORMING I.T. WITH AN OPEN HYBRID CLOUD

Create a single 360 view of data Red Hat JBoss Data Virtualization consolidates master and transactional data

FINANCIAL SERVICES: FRAUD MANAGEMENT A solution showcase

CHOOSING THE RIGHT STORAGE PLATFORM FOR SPLUNK ENTERPRISE

cloud functionality: advantages and Disadvantages

TRANSFORM BIG DATA INTO ACTIONABLE INFORMATION

JBOSS ENTERPRISE SOA PLATFORM AND JBOSS ENTERPRISE DATA SERVICES PLATFORM VALUE PROPOSITION AND DIFFERENTIATION

RED HAT OPENSTACK PLATFORM A COST-EFFECTIVE PRIVATE CLOUD FOR YOUR BUSINESS

RED HAT CLOUD SUITE FOR APPLICATIONS

IS AN OPEN SOURCE BUSINESS PROCESS MANAGEMENT SOLUTION RIGHT FOR YOU?

Independent process platform

Build A private PaaS.

HOW RED HAT BRINGS OPENSTACK INTO THE ENTERPRISE by Bryan Che and Gordon Haff

JBoss Enterprise MIDDLEWARE

BRIDGING THE GAP BETWEEN BUSINESS AND IT

SUBSCRIPTION GUIDE FOR RED HAT JBOSS MIDDLEWARE

Introducing Red Hat s JBoss Portfolio

Zend and IBM: Bringing the power of PHP applications to the enterprise

Red Hat Cloud, HP Edition:

RED HAT JBOSS FUSE. An open source enterprise service bus

Hadoop in the Hybrid Cloud

Open Source Business Rules Management System Enables Active Decisions

RED HAT CLOUDFORMS ENTERPRISE- GRADE MANAGEMENT FOR AMAZON WEB SERVICES

OPENSTACK IN THE ENTERPRISE Best practices for deploying enterprise-grade OpenStack implementations

Analance Data Integration Technical Whitepaper

CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data

The Road To enterprise paas

RED HAT ISV PROGRAM GUIDE NORTH AMERICA

WORLDWIDE ISV PROGRAM GUIDE - RED HAT CONNECT FOR BUSINESS PARTNERS

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

Apache Hadoop: The Big Data Refinery

Red Hat CloudForms: Open Clouds Under

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

Organizations that are standardizing today are enjoying lower management costs, better uptime. INTRODUCTION

The IBM Cognos Platform

A Comprehensive Solution for API Management

PLATFORM-AS-A-SERVICE, DEVOPS, AND APPLICATION INTEGRATION. An introduction to delivering applications faster

Microsoft Big Data. Solution Brief

A Modern Data Architecture with Apache Hadoop

34% DOING MORE WITH LESS How Red Hat Enterprise Linux shrinks total cost of ownership (TCO) compared to Windows. I n a study measuring

Data Integration Checklist

MONITORING RED HAT GLUSTER SERVER DEPLOYMENTS With the Nagios IT infrastructure monitoring tool

Increase Agility and Reduce Costs with a Logical Data Warehouse. February 2014

Introducing the Reimagined Power BI Platform. Jen Underwood, Microsoft

RED HAT ONLINE PARTNER ENABLEMENT NETWORK (OPEN)

Analance Data Integration Technical Whitepaper

Databricks. A Primer

Harnessing big data with Hortonworks Data Platform and Red Hat JBoss Data Virtualization

Foundations for your. portable cloud

Open source business rules management system

HDP Hadoop From concept to deployment.

HDP Enabling the Modern Data Architecture

Red Hat Enterprise Linux is open, scalable, and flexible

JBoss Enterprise MIDDLEWARE

Top five lessons learned from enterprise hybrid cloud projects

Bringing Open Choice & Application

ON THE ROAD TO OPEN HYBRID CLOUD BRYAN CHE GENERAL MANAGER, CLOUD BU, RED HAT

JBoss enterprise soa platform

FUJITSU Software Interstage Business Operations Platform: A Foundation for Smart Process Applications

Databricks. A Primer

Enterprise Data Integration

openshift enterprise whitepaper Gordon Haff

Business Intelligence and Service Oriented Architectures. An Oracle White Paper May 2007

An Oracle White Paper June Integration Technologies for Primavera Solutions

Red Hat JBoss Overview Intelligent Integrated Enterprise!!!! Blaine Mincey Sr. Middleware Solutions Architect

Maximize strategic flexibility by building an open hybrid cloud Gordon Haff

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

UNLEASHING THE VALUE OF THE TERADATA UNIFIED DATA ARCHITECTURE WITH ALTERYX

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

What is it? What does it do? Benefits

GigaSpaces Real-Time Analytics for Big Data

JOURNAL OF OBJECT TECHNOLOGY

A TECHNICAL WHITE PAPER ATTUNITY VISIBILITY

RED HAT JBOSS A-MQ COMPARED WITH IBM WEBSPHERE MQ 7.5

Apache Hadoop Patterns of Use

Investor Presentation. Second Quarter 2015

Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities

How To Build An Open Cloud

RED HAT JBOSS FUSE. A lightweight, flexible integration platform

Transcription:

WHITEPAPER RED HAT AND HORTONWORKS: OPEN MODERN DATA ARCHITECTURE FOR THE ENTERPRISE A Hortonworks and Red Hat whitepaper INTRODUCTION WHAT IS HADOOP? Apache Hadoop is an opensource technology born out of the experience of web scale consumer companies such as Yahoo, Facebook and others, who were among the first to confront the need to store and process massive quantities of digital data. Big data represents an increasingly important opportunity for enterprise business leaders. According to a 2013 study by consulting firm Bain & Company 1, companies that have excelled in developing advanced analytical capabilities are twice as likely to be a top-quartile financial performer within their industries, three times more likely to execute decisions as intended, and five times more likely to make decisions faster. With these kinds of results at stake, it s no surprise that big data deployments have become a top priority for enterprise IT leaders. When it comes to deploying big data capabilities, the starting place for many enterprises is Apache Hadoop. A popular initial application is the construction of a data lake an enterprise-wide repository for the volumes of unstructured and semi-structured data that today s businesses are amassing. With a data lake as the centerpiece of a modern data architecture, enterprises are able to take advantage of the efficiency, opportunity, and insight that comes from managing and analyzing their data with a scalable and flexible data platform. But doing so is not without challenges. Specifically, IT leaders are seeking solutions for the following: How to integrate Hadoop into existing information infrastructure How to build robust, data-centric applications with Hadoop How to operationalize Hadoop and deliver high-quality service as the enterprise increases its dependence on the technology To help meet the demands of enterprise big data initiatives, Red Hat has partnered with Hortonworks, an industry leader whose Hortonworks Data Platform (HDP) provides a tested, trusted, and open foundation for the modern data architecture. Together, the companies provide a comprehensive solution that addresses the challenges of building big data applications, integrating information across the enterprise, and operating and supporting an enterprise-grade Hadoop environment. facebook.com/redhatinc @redhatnews linkedin.com/company/red-hat 1 http://www.bain.com/publications/articles/the-value-of-big-data.aspx

PROFILE: FINANCIAL SERVICES A large financial institution took separate data warehouses from multiple departments and combined them into a geographical repository in Hadoop for analysis. THE HORTONWORKS AND RED HAT BIG DATA SOLUTION Red Hat and Hortonworks are building on a shared, open source approach that addresses the big data requirements of key enterprise stakeholders. By tightly integrating HDP with open hybrid cloud technologies including Red Hat Storage, Red Hat JBoss Middleware, Red Hat Enterprise Linux, and Red Hat Enterprise Linux OpenStack Platform with OpenJDK support Hortonworks and Red Hat are delivering solutions that enable the next generation of big data applications. This allows the bank to construct new and more accurate scores of the risk in its customer portfolios. The more accurate scores allowed the bank to manage its exposure and offer customers better products and advice. Analyst Statistical analysis BI / reporting, ad hoc analysis Interactive web and mobile application Enterprise applications DEVELOPMENT AND INTEGRATION SERVICE App Developer HORTONWORKS DATA PLATFORM HADOOP SERVICES Data architect INFRASTRUCTURE SERVICES Operator JB0074 INTEGRATING HADOOP DATA WITH ENTERPRISE INFORMATION The recent emergence and explosion of new types of data has put tremendous pressure on traditional enterprise data repositories. One technology that has emerged as a way to realize the value in big data is Apache Hadoop, with momentum described as unstoppable by Forrester Research in the Forrester WaveTM: Big Data Hadoop Solutions, Q1 2014. Once Hadoop is in place, the challenge for IT departments becomes how to put all that data to good use. To get the most value, unstructured data collected in Hadoop must be integrated with information spread across many applications and systems in the enterprise. 2

PROFILE: FOOD SERVICE For food retailers, the fresh food category is important for customer satisfaction. Providing sufficient stock while avoiding food waste makes for happy customers and keeps the retailer profitable. Data virtualization is a straightforward approach to integrating enterprise information. It enables business leaders, data scientists, innovators, and operational decision makers to easily obtain the information they need to be effective, without worrying about the details of accessing, storing, converting, integrating, copying, moving, securing, and distributing data. To support big data, an organization s integration technology must work with a range of underlying data sources, including conventional databases, flat files, Hadoop, in-memory computing, and other legacy and emerging technologies. It must tap into streams of data, and be able to harvest data from transactional systems, business applications, and enterprise data warehouses. One retailer uses Hadoop to create a fully automated, data-driven replenishment process based on a variety of internal and external data sources combined with advanced predictive analytics. Business intelligence tools and analytical applications Mobile and enterprise applications DATA CONSUMERS Enterprise service bus (ESB), extract transform load (ETL) Service-oriented architecture (SOA) applications and portals RED HAT JBOSS DATA VIRTUALIZATION CONSUME COMPOSE CONNECT Provision data via any interface JDBC, ODBC, REST, OData, SOAP etc. Unified, reusable, virtual data layer Access data from any source Design tools Dashboard Optimization Caching Security Metadata NoSQL Hadoop Databases and data warehouse Mainframe Enterprise applications Excel, CSV or XML files SaaS and cloud applications DATA SOURCES JB0041-2 3

PROFILE: GOVERNMENT A federal law enforcement agency uses Hadoop to process massive amounts of data from numerous sources to monitor criminal behavior and communications and expose non-obvious relationships between suspects. This allows the agency to proactively engage and disrupt planned criminal activities, while executing on its mission in a costeffective and efficient manner. Red Hat JBoss Data Virtualization meets all the requirements of big data integration. It sits in front of multiple data sources and allows them to be treated like a single source, delivering the desired data in the required form and at the right time to any application or user. Through the Red Hat and Hortonworks alliance, Red Hat JBoss Data Virtualization has been integrated with Hortonworks Data Platform. Red Hat JBoss Data Virtualization facilitates and improves the use of HDP in the enterprise by: Abstracting Hadoop and other data sources into relational-like views. Integrating HDP with existing enterprise data sources. Adding the ability to query HDP data in real-time. Providing access to HDP data via web services, JDBC and ODBC, and other standard interfaces. Adding security and governance to the big data infrastructure. Flattening data silos through a unified data layer. These capabilities allow companies deploying Red Hat JBoss Data Virtualization in conjunction with HDP to experience a significant reduction in the time and effort required to turn raw data into actionable insights. BUILDING BIG DATA APPLICATIONS WITH HADOOP With a strategy in place for integrating enterprise information, an organization s architects and developers must be able easily incorporate the data and capabilities of Hadoop and other systems into the applications and systems that they create. Robust enterprise applications typically depend on the careful orchestration of a variety of middleware services. Red Hat and Hortonworks are working together to ensure interoperability between HDP and the Red Hat JBoss Middleware portfolio, including: PROFILE: HEALTHCARE A healthcare services provider uses Hadoop to aggregate and analyze patient Electronic Health Records (EHR) from hospitals and other healthcare providers and make them available online to doctors. This allows them to increase the quality of care and lower the cost of providing service, while eliminating duplicative tests and procedures. Red Hat JBoss BRMS Red Hat JBoss BRMS is a comprehensive business decision automation platform that combines business rules management and complex event processing (CEP) into a single open source solution. Today s enterprise operates in an environment of ever-increasing business, market, and regulatory complexity. Traditionally, the rules required to navigate this complexity are implicitly encoded in and scattered across the enterprise application portfolio. Red Hat JBoss BRMS helps the enterprise: Increase agility by externalizing business rules so that they may be more easily managed and updated. Reduce time-to-market by incorporating existing rules into new applications. Enhance responsiveness by applying rules to operational decisions in both batch and real-time. Drive cohesion by allowing business, IT, and analytics professionals to work together to develop and apply analytics. 4

Using Red Hat JBoss BRMS in conjunction with Red Hat JBoss Data Virtualization allows IT organizations to connect enterprise decision services to Hadoop and other enterprise data sources. Seamless access to all organizational data sources without time-consuming and costly integration and custom coding means organizations can improve data-driven decision-making and optimize business outcomes. Red Hat JBoss Data Grid Red Hat JBoss Data Grid is a distributed NoSQL in-memory data grid that provides high-performance, low-latency access to big data. As the amount of data enterprise applications must deal with grows, reads and writes to traditional back-end data stores become a major performance bottleneck. Because the cost of memory has continued to decrease, storing data in-memory is becoming mainstream. Red Hat JBoss Data Grid provides: Improved application performance by allowing data to be quickly accessed from memory. Greater flexibility by storing different types of objects without a fixed data model, and allowing data to be access via a variety of APIs. Elastic scaling allowing grid nodes to be added simply and non-disruptively. High availability, by replicating data across multiple distributed nodes. Using Red Hat JBoss Data Grid with Red Hat JBoss Data Virtualization, an organization can easily combine historical information from Hadoop and other sources with real-time data stored in the data grid. This data mashup gives enterprises greater visibility into operational conditions and allows them to more rapidly react to changing market conditions. As a result of the partnership between Red Hat and Hortonworks, data from HDP is easily integrated into traditional enterprise-grade applications built around Red Hat JBoss middleware. OPERATIONALIZING ENTERPRISE HADOOP As enterprise big data programs mature and Hadoop becomes more deeply embedded in critical operations, the ability to support and operate it efficiently and reliably becomes increasingly important. To aid in operating a modern data architecture at scale, Red Hat and Hortonworks have collaborated to integrate HDP with Red Hat s platform technologies. As a result of the companies efforts: Red Hat Enterprise Linux with OpenJDK now provides an open and flexible operating and development environment for Hadoop and enterprise analytical applications. Deploying HDP on trusted and familiar Red Hat Enterprise Linux assures long-term supportability, speedy issue remediation, and reduced operating costs. Red Hat Enterprise Linux OpenStack Platform now delivers elastic Hadoop services on a secure, private cloud infrastructure. Deploying HDP on a private cloud helps enterprises reduce costs while providing the ability to quickly scale the Hadoop environment to meet business demands, thus improving agility. 5

OpenShift Enterprise by Red Hat, the company s on-premise Platform-as-a-Service (PaaS) offering, gives developers the ability to rapidly develop and deploy entire application stacks on Hortonworks Data Platform. This allows developers to standardize their workflow and create repeatable processes for application delivery, streamlining application development while tapping into the insight and value from data stored in Hadoop. Red Hat Storage Server, an open, software-defined storage platform built on the robust GlusterFS distributed file system, can now be used with HDP to provide a secure and resilient general-purpose storage system with multiple APIs, making all data simultaneously available via Hadoop, POSIX, and web services interfaces. Through a joint engineering effort, Red Hat and Hortonworks have created a tightly integrated, easily operable, and highly supportable platform for next-generation data applications and services. CONCLUSION The strategic alliance between Hortonworks and Red Hat is making it easier for enterprises to harness the power of big data. Working together, the companies are building on a shared, community-oriented, open source approach that addresses the growing big data requirements of key enterprise stakeholders: Integrating Hadoop into existing information infrastructure. Building enterprise-grade, data-centric applications with Hadoop. Operationalizing Hadoop and delivering high-quality services around it. With an enterprise Hadoop platform tightly integrated with open hybrid cloud technologies, Hortonworks and Red Hat are delivering open, secure, cloud-ready, and cost-effective solutions that help customers more easily harness the power of Hadoop. For more information on the Red Hat and Hortonworks partnership, visit http://hortonworks.com/ partner/redhat/. For details on the open engineering collaboration, visit the Red Hat labs page. 6

WHY HORTONWORKS FOR HADOOP? Founded in 2011 by 24 engineers from the original Yahoo! Hadoop development and operations team, Hortonworks has amassed more Hadoop experience under one roof than any other organization. Hortonworks team members are active participants and leaders in Hadoop development; designing, building, and testing the core of the Hadoop platform. The company has years of experience in Hadoop operations and is best suited to support your mission-critical Hadoop project. Read more at http://hortonworks.com/why. OPEN LEADERSHIP ECOSYSTEM ENDORSEMENT ENTERPRISE RIGOR Hortonworks has a singular focus and commitment to drive innovation in the open exclusively via the Apache Software Foundation process. Hortonworks is responsible for the majority of core code-base advances to deliver Apache Hadoop as an enterprise data platform. Hortonworks is focused on the deep integration of Hadoop with existing data center technologies and team capabilities. Hortonworks has secured strategic relationships with trusted data center partners including Microsoft, SAP, Teradata, Rackspace, and many more. Hortonworks has a world-class enterprise support and services organization with vast experience in the largest Hadoop deployments. Hortonworks engineers and certifies Apache Hadoop with the enterprise in mind, all tested with realworld rigor in the world s largest Hadoop clusters. For an independent analysis of Hortonworks Data Platform, download the Forrester Wave : Big Data Hadoop Solutions, Q1 2014 from Forrester Research. ABOUT HORTONWORKS Hortonworks develops, distributes, and supports the only 100% open source Apache Hadoop data platform. Our team comprises the largest contingent of builders and architects within the Hadoop ecosystem who represent and lead the broader enterprise requirements within these communities. The Hortonworks Data Platform provides an open platform that deeply integrates with existing IT investments and upon which enterprises can build and deploy Hadoop-based applications. Hortonworks has deep relationships with the key strategic data center partners that enable our customers to unlock the broadest opportunities from Hadoop. 7

WHITEPAPER Red Hat and Hortonworks: Open modern data architecture for the enterprise DISCLAIMER: The OpenStack Word Mark and OpenStack Logo are either registered trademarks / service marks or trademarks / service marks of the OpenStack Foundation, in the United States and other countries, and are used with the OpenStack Foundation s permission. Neither Hortonworks nor Red Hat are affiliated with, endorsed or sponsored by the OpenStack Foundation or the OpenStack community. ABOUT RED HAT Red Hat is the world s leading provider of open source solutions, using a community-powered approach to provide reliable and high-performing cloud, virtualization, storage, Linux, and middleware technologies. Red Hat also offers award-winning support, training, and consulting services. Red Hat is an S&P company with more than 80 offices spanning the globe, empowering its customers businesses. facebook.com/redhatinc @redhatnews linkedin.com/company/red-hat NORTH AMERICA 1 888 REDHAT1 EUROPE, MIDDLE EAST, AND AFRICA 00800 7334 2835 europe@ ASIA PACIFIC +65 6490 4200 apac@ LATIN AMERICA +54 11 4329 7300 info-latam@ #12525077_v1_0914 Copyright 2014 Red Hat, Inc. Red Hat, Red Hat Enterprise Linux, the Shadowman logo, and JBoss are trademarks of Red Hat, Inc., registered in the U.S. and other countries. Linux is the registered trademark of Linus Torvalds in the U.S. and other countries.