Cisco Solutions for Big Data and Analytics Tarek Elsherif, Solutions Executive November, 2015
Agenda Major Drivers & Challengs Data Virtualization & Analytics Platform Considerations for Big Data & Analytics Cisco UCS Integrated Infrastructure for Big Data Q & A
Big Data & Analytics Major Drivers and Market
Number of Devices Growing Exponentially 50 Billion 25 Billion 12.5 Billion 2010 2015 2020
Accelerating Transitions Increasing Pressure TECHNOLOGY TRANSITIONS Cloud Mobility/ Video New Breed of Apps Internet of Things Big Data & Analytics Growth & Productivity New Business Models User Experience & Expectations Globalization Security & Regulatory Compliance INCREASING BUSINESS DEMANDS
Business Outcomes Business Opportunity As Data Grows, Leading Businesses Use It To Drive Better Outcomes And Beat Their Competition Business Leaders Business Outcomes Other Businesses Customer Profitability Faster Time to Market Cost Reduction Risk Management Compliance Overall Agility Data
Business Pain Data Silos Proliferating, Data is Now Distributed Everywhere How Does the Business Leverage All the Data? Traditional Data Sources Big Data/IOE Sources Cloud Data Sources
GARTNER: Organizations that modernize their information management capability will display 20% higher financial performance (than themselves previously) HARVARD BUSINESS REVIEW OCT 2012: Datadriven Companies are 5% more productive and 6% more profitable than competitors GARTNER: Through 2017, 90% of the information assets from big data analytic efforts will be siloed and un-leveragable across multiple business processes.
Data Virtualization & Analytics
What is Data Analytics? Data analytics (DA) is the science of examining raw data with the purpose of drawing conclusions about that information..
Analytics Classification Future Looking - Data Mining - StatisticalAnalysis - Predicting Future Descriptive After the fact - Dashboards - Reports Analytics How to - Define Symptoms - Prescribe Instructions Predictive Prescriptive Cisco and/or its affiliates. All rights reserved. CiscoPublic 11
What is Data Virtualization? Data virtualization (DV) is any approach to data management that allows an application to retrieve and manipulate data without requiring technical details about the data, such as how it is formatted or where it is physically located
Cisco Data Virtualization Suite Business Intelligence Customer Experience Management Governance, Risk & Compliance Human Capital Management Mergers & Acquisitions Single View of Enterprise Data Supply Chain Management Analytics Development Environment Business Directory Cisco Data Virtualization Suite Runtime Server Environment Management Environment Manager Discovery Studio Cisco Information Server Deployment Manager Monitor Adapters Active Cluster Packaged Apps RDBMS Excel Files Data Warehouse OLAP Cubes Hadoop/Big Data XML Docs Flat Files Web Services XML
Cisco Data Virtualization Solution Overview Cisco Information Server (CIS) is the main software product with options that include: Business Directory Active Cluster Adapters Deployment Manager Licenses include: Production Development Staging Failover/backup Data Virtualization Plan and Build Services Cisco Plan and Build Services for Data Virtualization Training offers are optional components of Plan and Build Services that include: Basic Training for Data Virtualization Admin Training for Data Virtualization Advanced Training for Data Virtualization Cisco Health Check Services for Data Virtualization Cisco Migration Services for Data Virtualization Cisco Data Virtualization Study Data Virtualization Manage Services Cisco Software Application Support with Upgrades (SASU) Cisco Data Virtualization Optimization Service Cisco Mission Critical Support Services Services from Cisco and Cisco Data Virtualization ATP partners help customers accelerate the time to value of Cisco Data Virtualization with quick deployment.
Cisco Data Virtualization Better Business Outcomes, Faster, for Less Business Intelligence/Analytics Square Header Immediate Access Cisco Data Virtualization Rounded Header 5-10x Faster Up to 75% Cost Savings Higher Impact More Agile Less Expensive
Platform Considerations for Big Data & Analytics
Platform for Big Data & Analytics: What is important? Performance Management Deployment Speed Scalability TCO As big data solutions become more critical to dayto-day decision making, high performance and availability will become table stakes Hundreds/thousands of servers, switches will require large numbers of management tasks As big data grows, IT will need to quickly, cost-effectively scale resources Are we able to scale as fast as data? Solution price/performance, operations efficiencies, power consumption, and facilities footprint will become more important PSODCT-2020 2015 Cisco and/or its affiliates. All rights reserved.
Platform Considerations for Big Data & Analytics Analytics Data Movers Visualization Virtualization Data Management (Hadoop, MPP DBs, NoSQL) Infrastructure Compute (2 socket Server) Storage (Internal DAS) Hadoop: RAID 0 + 3 Way replication MPP DB: RADI 5 + 2 Way replication Network (dual 10Gbit) Operating System Infrastructure Management: Provision Manage Monitor Complementary to data tier management Application Management Cisco UCS Integrated Infrastructur e for Big Data PSODCT-2020 2015 Cisco and/or its affiliates. All rights reserved.
Cisco UCS Integrated Infrastructure for Big Data
PSODCT-2020 2015 Cisco and/or its affiliates. All rights reserved.
Integrated Infrastructure Solutions Integrated infrastructure is now an industry standard term. Expected to make up nearly 14% of all IT infrastructure by 2016. (IDC) Cisco is the Leader in Integrated Infrastructure 3 rd generation of Big Data solution is now a Cisco Integrated Infrastructure Initiative By year-end 2015, 35% of total server shipped valuewill be as integratedsystems. By 2015, converged infrastructure will represent 9.5% of the $64Bservices, software, and hardware markets. 1. IDG research PSODCT-2020 2015 Cisco and/or its affiliates. All rights reserved.
Cisco UCS Integrated Infrastructure for Big Data 3 rd generation of Cisco UCS Common PlatformArchitecture Industry leading solution deployed across major industryverticals Areas of focus: Data Management Platforms (Hadoop, NoSQL, MPP Databases) andanalytics Broad ecosystem partnerships with leading ISVs Major Hadoop distributions in GPL UCS Director Express Pre-tested, pre-validated and documented best practice designs optimized for performance and capacity lowering risk and TCO Designed to scale from small to very large as business demands Unified and centralized management with seamless Integration with enterprise applications Easy to {order, deploy,service} BigData Starter High Performance Performance Optimized Capacity Optimized Extreme Capacity PSODCT-2020 2015 Cisco and/or its affiliates. All rights reserved.
Complete Infrastructure for End to End Analytics IOT Tier Mobile App Private Cloud Big Data Cold Data Parking Spot Availability CCTV Edge Processing Application Front End Real-Time Price Optimization Trend Analysis Cassandra Apps Tier Real-Time Analytics EDW UCS Mini on demand provisioning Oracle, SAP, MS UCS B200 M4 Hadoop EMC,NTAP SAP HANA Greenplum UCS C240M4 Recommendation Engine Big Data Compliance Cold Data Product Information (Cached) WebServer Apache, ISS M-series Transaction Processing DB Tier Oracle UCS B460 EMC, NTAP, Invita Fraud Detection Analytics SAS UCS B460 Hadoop UCS C240M4 20-30% better in all aspects but same $/performance as previous generation Hadoop UCS C3160 4 x Compute density, on demand provisioning 1.5X transaction processing power 6x speed up in fraud detection by utilizing large memory Network Plane, Control Plane ACI, End-to-End Isolation Management: UCS Central, Director, Manager, Express PSODCT-2020 2015 Cisco and/or its affiliates. All rights reserved.
Unified Management Programmability, Scalability and Automation End-to-end management software offers speed and enterprise-grade reliability, while simplifying deployment and operations Provisioning Growth Monitoring Maintenance Inventory & Asset Mgmt Fault Detection & SW Updates QoS Policies & Power Capping UCS Manager Provides unified, embedded management of all software and hardware components Policy and model-based management, with service profiles, that improves agility and reduces risk Auto-discovery to detect, inventory, manage, and provision system components A comprehensive open XML API, which facilitates integration with third-party management tools UCS Central Manages multiple, globally distributed Cisco UCS domains with thousands of servers from a single pane Provides global configuration capabilities for pools, policies, and firmware UCS Director Express for Big Data End to end deployment tool for Hadoop UCS Director Unified converged infrastructure management solution Provides programmable application containers across computing, networking, and storage resources and extend automation benefits to the entire infrastructure stack. PSODCT-2020 2015 Cisco and/or its affiliates. All rights reserved.
UCS-D Express for Big Data End to end solution for Hadoop End to end provisioning, installation, and monitoring tool for Hadoop Clusters Better business outcomes with faster time to value from Big Data Provides appliance like experience with out inflexibilities Centralized visibility across Hadoop and physicalinfrastructure Powerful interface for further integration into third party tools and services PSODCT-2020 2015 Cisco and/or its affiliates. All rights reserved.
Cisco Big Data and Analytics Partner Ecosystem Hadoop Data Management Data Integration Analytics / Business Intelligence Massive Parallel Processing NoSQL NoSQL DB PSODCT-2020 2015 Cisco and/or its affiliates. All rights reserved.
Data Analytics Journey Lifecycle Getting started Identify Use Cases Train Your People Setup the Infrastructure Bring Data Together Descriptive Analytics Predictive Analytics Get Help 2012 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 27
Thank you