CA Big Data Management: It s here, but what can it do for your business?



Similar documents
CA Chorus for Security and Compliance Management Deep Dive

Web Admin Console - Release Management. Steve Parker Richard Lechner

agility made possible

Change for the Better: Improved Productivity via CA Service Desk Manager

Dynamic Data Center Update:

Hands-on Lab: CA ehealth PM Integration with Cisco Unified Communications Manager. Eve Curcio

are you helping your customers achieve their expectations for IT based service quality and availability?

can you effectively plan for the migration and management of systems and applications on Vblock Platforms?

CA Workload Automation Strategy and Roadmap. Bill Sherwin Principal Consultant EMEA Workload Automation Owner

CA Workload Automation Restart Option for z/os Schedulers: NJE Restarts. Jared Moran

journey to a hybrid cloud

CA Workload Automation CA 7 Edition r11.3

CA Virtual Assurance for Infrastructure Managers

Virtualizing Apache Hadoop. June, 2012

LAB: Assembling a Business Service Insight (BSI) Dashboard

CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data

Deploying Hadoop with Manager

CA NSM System Monitoring Option for OpenVMS r3.2

can you improve service quality and availability while optimizing operations on VCE Vblock Systems?

RECOVERY OF CA ARCSERVE DATABASE IN A CLUSTER ENVIRONMENT AFTER DISASTER RECOVERY

CA Technologies optimizes business systems worldwide with enterprise data model

how can I improve performance of my customer service level agreements while reducing cost?

CA Process Automation for System z 3.1

PEPPERDATA OVERVIEW AND DIFFERENTIATORS

CA ERwin Data Modeling's Role in the Application Development Lifecycle

IBM InfoSphere Guardium Data Activity Monitor for Hadoop-based systems

Testing 3Vs (Volume, Variety and Velocity) of Big Data

CA Scheduler Job Management r11

CA SYSVIEW Performance Management r13.0

CA s Cloud Storage for System z

CA Capacity Manager. Product overview. agility made possible

The Role of Service Catalog in IT Asset Management. Faisal Faquih Khalid

The Future of Workload Automation in the Application Economy

Measuring end-to-end application performance in an on-demand world. Shajeer Mohammed Enterprise Architect

IBM Tivoli Netcool network management solutions for enterprise

solution brief September 2011 Can You Effectively Plan For The Migration And Management of Systems And Applications on Vblock Platforms?

Deployment Options for Microsoft Hyper-V Server

Session 0202: Big Data in action with SAP HANA and Hadoop Platforms Prasad Illapani Product Management & Strategy (SAP HANA & Big Data) SAP Labs LLC,

Connecting the dots from automated software discovery to asset management

Bringing Big Data to People

CA Virtual Assurance for Infrastructure Managers

Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014

CA Workload Automation Agents for Mainframe-Hosted Implementations

CA Workload Automation for SAP Software

Testing Big data is one of the biggest

CA NetSpy Network Performance r12

CA Encryption Key Manager r14.5

Apache Hadoop: The Big Data Refinery

CA Workload Automation Agents Operating System, ERP, Database, Application Services and Web Services

CA SOLVE:Central Service Desk for z/os

Data Modeling for Big Data

can you simplify your infrastructure?

CA Telon Application Generator r5.1

CA Compliance Manager for z/os

Big Data Management and Security

#TalendSandbox for Big Data

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE

CA SiteMinder SSO Agents for ERP Systems

CA CMDB Connector for z/os version 2.0

CA Service Desk Manager Change Management. Ken Laufmann Raymond Cadden

Service Virtualization CA LISA introduction. Jim Dugger CA LISA Product Marketing Manager Steve Mazzuca CA LISA Public Sector Alliances Director

Transforming IT Processes and Culture to Assure Service Quality and Improve IT Operational Efficiency

Tips & Tricks: CA CMDB Data Mining Techniques. John Sorensen & Neil Mitchell

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

CA NetMaster Network Management for TCP/IP r12.0

Workshop on Hadoop with Big Data

CA Cloud Storage for System z

Application Virtualisation Management. Steve Parker

Enterprise Report Management CA View, CA Deliver, CA Dispatch, CA Bundl, CA Spool, CA Output Management Web Viewer

CA Spectrum r Overview. agility made possible

Luncheon Webinar Series May 13, 2013

CA High Performance Recovery for IMS for z/os

Data Deduplication: An Essential Component of your Data Protection Strategy

CA Cloud Service Delivery Platform

Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

Dell Reference Configuration for Hortonworks Data Platform

CA Service Desk Manager

An Oracle White Paper June High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database

Integrating CA Software Change Management with CA Service Desk Manager for Enterprise Change Control

10A CA Plex in the Cloud. Rob Layzell CA Technologies

Driving workload automation across the enterprise

Modernizing Your Data Warehouse for Hadoop

CA Aion Business Rules Expert 11.0

CA Endevor Software Change Manager Release 15.1

Transcription:

CA Big Data Management: It s here, but what can it do for your business? Mike Harer CA Technologies August 7, 2014 Session Number: 16256 Insert Custom Session QR if Desired. Test link: www.share.org

Big Data Management Topics 1 2 3 4 5 6 Big Data Background Big Data Management Challenges Our Vision Big Data Management Solution Overview Target Personas Solving the Important Management Pains 2

Big Data Means Different Things To Different People Customers tend to define Big Data in a broad sense Any type of net new analytical processing that is different from the traditional data warehouse applications in place today This is differentiated by (near) real time analytical capabilities Defined by the types and speed of data being analyzed 3

Big Data Definition Large Volumes of a wide Variety of data collected from various sources across the enterprise High-Velocity capture, discovery and/or analysis Veracity keeping the right, trusted data 4

Big Data Growing Fast CAPTURING AND MANAGING LOTS OF INFORMATION Data volumes are doubling every year Organizations are also storing three or more years of data WORKING WITH MANY NEW TYPES OF DATA 80 percent of data is unstructured (such as images, audio, tweets, etc.) EXPLOITING THESE MASSES OF INFORMATION AND NEW DATA TYPES WITH NEW STYLES OF APPLICATIONS New classes of analytic applications are reaching the market, all based on a next generation big data platform COMMODITIZED HARDWARE AND SOFTWARE The final piece of the big data puzzle is the low-cost hardware and software environments Capturing and exploiting big data would be much more difficult and costly without the contributions of these cost-effective advances 5

Enterprises Ahead in Big Data Initiatives 70% OF ENTERPRISE ORGANIZATIONS already deployed/plan to deploy big data projects vs. 56% OF SMBs 44% 30% 20% 9% 16% 12% 10% 10% 9% 12% 15% 12% We have already deployed/implemented big data projects <1,000 1,000+ We are in the process of implementing or pilot testing big data projects We have plans to deploy projects over the next 12 months We are considering deploying or implementing big data projects within the next 13-24 months We are likely to implement big data projects in the future but have no specific timeframe in place We have no plans to deploy or implement big data projects Q. Is your company currently implementing, planning or considering projects (i.e. devising strategies and projects to generate more value from existing data)? Source: IDG Enterprise Big Data Study, 2014 6

Going from the Science Project to Production The organization realizes that the analytics and insights coming out of a Big Data project are essential To keep costs down, you start with the basic Hadoop distribution from Apache Maybe a free tool or two and off you go Gain traction, under a huge amount of pressure to deliver or the business gets farther behind More tools and software and data sources are added Hadoop Common Common utilities HDFS Hadoop distributed file system MapReduce Parallel processing Pig High-level language for MapReduce Streaming Jobs based on any exe Hive Data warehouse HBase Non-relational dbase on HDFS Nagios Open Source Monitoring Tool Ganglia Open Source Monitoring Tool Oozie Workflow Engine / Scheduler Cloudera Manager Hadoop Management Ambari Hadoop Management Console HCatalog, ZooKeeper, Sqoop By the time you are ready to put this to productive use, you realize you have a huge number of moving parts, tools from many vendors and a ton of complexity has been created. 7

The Big Big Data Management Pains The Need to Overcome Many Challenges Managing complex multi-vendor big data environments Finding Hadoop/Big Data experts Understanding capacity requirements for rapidly changing business needs Complexity increases, manual processes often required System problems hard to isolate, downtime increases Unique tools and shortcomings Multiple licensing agreements to manage Driving forces acquisitions, department consolidations, driving need for greater operational efficiency MULTI-VENDOR HADOOP MANAGEMENT DOMAINS Cloudera Manager Apache Ambari Apache Ambari Apache Ambari Apache Ambari Cloudera Manager AMZ EMR Console Mainframe 8

Ask Yourself? 1 2 3 4 5 6 How many people do you have to manage your Big Data infrastructure? Do your Big Data administrators always know the health of the systems? Can you detect most problems before significant system outages occur? How many different monitoring tools do you have in place now? How do you know if your capacity is optimized for cost and performance? What was the financial impact of downtime over the past year? 9

CA Big Data Management Our Vision PROVIDE CUSTOMERS AN INNOVATIVE APPROACH TO MORE EFFICIENTLY MANAGE HETEROGENEOUS BIG DATA ENVIRONMENTS TO MAXIMIZE ENTERPRISE OPERATIONAL EFFICIENCY. Traditional data management is changing rapidly. Evolving big data technologies offer a range of new business advantages when managing disparate environments that can be a mix of traditional, mainframe and distributed big data environments. CA Technologies is the market leader in delivering world-class IT Management solutions. Delivering a world-class Big Data Management solution is a logical extension to our IT management portfolio and will continue to demonstrate our leadership and innovation in the Big Data market. 10

CA Big Data Management Solution Overview (Release 1.0 -- CA World Launch Nov/2014) MANAGE ALL BIG DATA ENVIRONMENTS FROM ONE PLACE SINGLE, CONSISTENT MANAGEMENT UI EXPERIENCE Example UI mock-up SECURE ACCESS TO BIG DATA (HADOOP) ENVIRONMENTS SIMPLIFIED HETEROGENEO US ENVIRONMENT MANAGEMENT Big Data Management Server OPERATIONALIZE, MANAGE & SECURE HADOOP ENVIRONMENTS MULTI-VENDOR HADOOP MANAGEMENT DOMAINS Linux / x86 11

A New Role in the Organization is Born Targeted User Personas PERSONAS: Big Data / Hadoop Administrator (Rel 1.0) Very different than the traditional DBA Big Data environment is much more complex than traditional database application even compared to large ERP systems Many more moving parts Volume, Velocity, Variety Big Data / Hadoop Developer (Rel 1.x -2.0) Role / Responsibilities: HADOOP ADMINISTRATOR Day-to-day operations & support of Hadoop infrastructure platforms Monitor/maintain existing clusters and provision new ones Integrate enterprise monitoring tools like Ganglia and OpenTSDB Analyze current workloads & enable subsequent capacity planning. Publish various production metrics to system owners & mgmt. Utilize enterprise automation tools as well as configuration management tools e.g. Puppet, Chef and/or Cloudera Mgr Perform regular back ups for replication as well as disaster recovery Educate existing SysOps team members on the Hadoop ecosystem 12

Solving the Important Management Pains CA Big Data Management 13 Applying best practices to... Quickly and easily deploy new clusters Receive alerts before system degradation causes significant cluster/job failures (w/ all details) Monitor system capacity metrics and suggest corrective action(s) Optimize job performance based on watching heap size and autoadjusting thru automation Detect misconfigured settings and send alert with suggested actions to remedy Schedule/monitor jobs thru a single, consistent UI experience Hadoop Administrator Challenges We get alerts from zabbix saying workflow is slow or job failed. Some workflows automatically try and restart after the failure but if it doesn't start, we manually start it. We use one Cloudera Manager instance for every cluster so that we do not have a single point of failure. Only admins can login to CM, dev team use native Apache Hadoop UI

Solving the Important Management Pains Example Use Case Heterogeneous System Management Ability to control system operations across all clusters in a heterogeneous big data environment from a single, consistent UI experience Quick access to system management operations Based on Best Practices, eliminates manual processes and decreases system errors/downtime screen mock-up 14

Solving the Important Management Pains Example Use Case Resource Reporting Aggregated data enables Hadoop administrators to more easily identify the cause of capacity, performance and system disruptions. Underperforming clusters Job execution slowed or incomplete screen mock-up 15 Visual graphing of time-series data coming from system metrics (CPU, Network IO, disk space, Map/Reduce slots, memory, swap space) Generate Historical Job Reports (over hours, days, weeks, months)

Solving the Important Management Pains Example Use Case Activity (Job) Monitoring Assess integrity of all Hadoop jobs (running, killed, suspended,...) Visually identify performance issues Unified view across all clusters, nodes within a mixed multivendor big data environment (e.g. Cloudera, Hortonworks, Apache) screen mock-up 16

Solving the Important Management Pains Example Use Case Alert Management Tightly integrated with System Monitoring to send alerts when configured conditions are detected. Integrated with an Automation Framework Can be configured to trigger alerts based on absolute values and percentages. For example, generate alert: based on metrics threshold violations based on complex rules, like Event Frequency or patterns of Events based on a Severity screen mock-up 17

Q & A Mike Harer Sr. Principal Product Manager 100 Staples Drive Framingham, MA 01702 Office: +1-508-598-6547 Mobile: +1-978-424-5677 Michael.harer@ca.com www.ca.com/bigdata 18

Disclaimer Copyright 2014 CA. All rights reserved. IBM, System z, zenterprise and z/os are trademarks of International Business Machines Corporation in the United States, other countries, or both. All trademarks, trade names, service marks and logos referenced herein belong to their respective companies. This presentation was based on current information and resource allocations as of July 2014 and is subject to change or withdrawal by CA at any time without notice. Notwithstanding anything in this presentation to the contrary, this presentation shall not serve to (i) affect the rights and/or obligations of CA or its licensees under any existing or future written license agreement or services agreement relating to any CA software product; or (ii) amend any product documentation or specifications for any CA software product. The development, release and timing of any features or functionality described in this presentation remain at CA s sole discretion. Notwithstanding anything in this presentation to the contrary, upon the general availability of any future CA product release referenced in this presentation, CA will make such release available (i) for sale to new licensees of such product; and (ii) to existing licensees of such product on a when and if-available basis as part of CA maintenance and support, and in the form of a regularly scheduled major product release. Such releases may be made available to current licensees of such product who are current subscribers to CA maintenance and support on a when and ifavailable basis. In the event of a conflict between the terms of this paragraph and any other information contained in this presentation, the terms of this paragraph shall govern. Certain information in this presentation may outline CA s general product direction. All information in this presentation is for your informational purposes only and may not be incorporated into any contract. CA assumes no responsibility for the accuracy or completeness of the information. To the extent permitted by applicable law, CA provides this presentation as is without warranty of any kind, including without limitation, any implied warranties or merchantability, fitness for a particular purpose, or non-infringement. In no event will CA be liable for any loss or damage, direct or indirect, from the use of this document, including, without limitation, lost profits, lost investment, business interruption, goodwill, or lost data, even if CA is expressly advised in advance of the possibility of such damages. CA confidential and proprietary. No unauthorized copying or distribution permitted. 19