Analytics and the Context Multiplier
|
|
- Byron Evans
- 8 years ago
- Views:
Transcription
1 The Deal About
2 Analytics and the Context Multiplier Actuarial data Epidemic data Government statistics Patient records Weather history... Location risk Occupational risk Raw Data Feature extraction metadata Travel history Dietary risk Family history... Personal financial situation Domain linkages Chemical exposure Full contextual analytics Social relationships
3 New Era of Cognitive Computing Tabulating Systems Programmable Systems Cognitive Systems
4 IBM Watson IBM Watson is a breakthrough in analytic innovation, but it is only successful because of the quality of the information from which it is working.
5 Watson Engagement Advisor Customer engagement through cognitive computing What it does Automates customer interaction to increase customer engagement in sales and service Transforms customer engagement by knowing, engaging and empowering clients Develops customer relationships through a transformative user experience How it does it Provides answers not links and webpages Answers with evidence not guesses Not restricted to a predefined questionanswer set Learns from every interaction
6 Watson Discovery Advisor Answer previously unanswerable research problems Gain Awareness Harness all available scientific knowledge in the hunt for a breakthrough and identifies better leads for any researcher to pursue Understand Relationships Enable every scientist to identify new relationships and explore never before considered options that lead to real differentiating scientific innovations. Clarify Ideas Drive insights from scientists who ve made recent advances to peers, who can accelerate findings and raise productivity of the entire R&D group 6 Watson can read these medical records in six seconds!
7 Big Data Definition Volume Variety Velocity Veracity Data at Scale Data in Many Forms Data in Motion Data Uncertainty Big Data is data that can t be stored or analyzed using traditional tools.
8 without analytics BigData is just a bunch of data MYTH: Big Data is only about large datasets; we should just say larger than what you have MYTH: Big Data means Hadoop..that s it MYTH: Big Data means rip-and-replace, death to the RDBMS and no governance MYTH: NoSQL means no SQL, never, utter hatred for SQL MYTH: Big Data means unstructured data and only for sentiment
9 In 2005 there were 1.3 billion RFID tags in circulation around the world by the end of 2011, this was about 30 billion and growing even faster. 9
10 An increasingly sensor-enabled and instrumented business environment generates HUGE volumes of data with MACHINE SPEED characteristics 1 BILLION lines of code EACH engine generating 10 TB every 30 minutes!
11 Applications for Big Data Analytics Smarter Healthcare Multi-channel sales Finance Log Analysis Homeland Security Traffic Control Telecom Search Quality Manufacturing Trading Analytics Fraud and Risk Retail: Churn, NBO
12 Automatic Temporal and Spatially Enriched Data 12
13 Use Cases: Law Enforcement and Security Video surveillance, wire taps, communications, call records, etc. Millions of messages per second with low density of critical data Identify patterns and relationships among vast information sources The US Government has been working with IBM Research since 2003 on a radical new approach to data analysis that enables high speed, scalable and complex analytics of heterogeneous data streams in motion. The project has been so successful that US Government will deploy additional installations to enable other agencies to achieve greater success in various future projects US Government
14 Velocity Creating Actionable Intelligence in Real Time Example Analytics:
15 Volume - The Government Industry Data Challenge IBM Multimedia Analysis & Retrieval Automatic Semantic Classification of Images and Video Content based feature extraction & Search Gigapixel Panorama Photography Think Big!
16 Predictive Analytics in a Neonatal ICU Real-time analytics and correlations on physiological data streams Blood pressure, Temperature, EKG, Blood oxygen saturation etc., Early detection of the onset of potentially life-threatening conditions Up to 24 hours earlier than current medical practices Early intervention leads to lower patient morbidity and better long term outcomes Technology also enables physicians to verify new clinical hypotheses
17 Why Didn t We Use All of the Big Data Before?
18 Warehouse Modernization Has to Themes Traditional Analytics Structured & Repeatable Structure built to store data Hypothesis Question? Big Data Analytics Iterative & Exploratory Data is the structure Data All Information Exploration Analyzed Information Answer Data Start with hypothesis Test against selected data Actionable Insight Correlation Data leads the way Explore all data, identify correlations 18 Analyze after landing Analyze in motion
19 Analyze all TRADITIONAL APPROACH BIG DATA APPROACH All available information Analyzed information All available information analyzed Analyze small subsets of information Analyze all information
20 Analyze as is TRADITIONAL APPROACH BIG DATA APPROACH Small amount of carefully organized information Large amount of messy information Carefully cleanse information before any analysis Analyze information as is, cleanse as needed
21 Find corellation TRADITIONAL APPROACH BIG DATA APPROACH Hypothesis Question Data Exploration Answer Data Insight Correlation Start with hypothesis and test against selected data Explore all data and identify correlations
22 Analyze in motion TRADITIONAL APPROACH BIG DATA APPROACH Data Analysis Data Repository Analysis Insight Insight Analyze data after it s been processed and landed in a warehouse or mart Analyze data in motion as it s generated, in real-time
23 Complementary Analytics Traditional Approach Structured, analytical, logical New Approach Creative, holistic thought, intuition Internal App Data Mainframe Data Transaction Data OLTP System Data Data Warehouse Structured Repeatable Linear Hadoop and Streams Unstructured Exploratory Dynamic Multimedia Web Logs Social Data Text Data: s Sensor data: images ERP Data RFID Traditional Sources New Sources 23
24 The NoSQL Revolution Different requirements require different tools Document stores Key/value stores BigTable implementations (columnar) Graph databases Values (there are exceptions) Huge data volumes easy scale-out Developers code integrity if it s needed Relaxed (eventual) consistency Semi-structured data Schema on read
25 Why NoSQL? Pressures on Traditional Relational Stores Budgetary constraints Technical change/ Different forms of data Regulatory pressures (SLAs, Archive, Governance)
26 Database Landscape Overview Description SQL nosql database Hadoop Relational SQL (RDBMS) Operational and Analytic E.g. DB2, Oracle, Microsoft, Teradata, etc. nosql database Mainly operational E.g. Cloudant, MongoDB, Redis, Riak, Aerospike, Amazon Dynamo DB, etc. SQL on Hadoop (mainly analytic) HBase (evolving OLTP, ACID) E.g. BigInsights, Cloudera, Hortonworks, MapR, Pivotal HP Labs Trafodion Typical Infrastructure Proprietary database storage Unix, Linux, Windows SMP, MPP, SAN, Integrated Systems, Appliances Proprietary database storage Linux Commodity clusters Local attach disks, NAS Cloud Mobile HDFS files Linux Commodity clusters Local attach disks Primary Driver Traditional I/T ACID Developer Agility, scalability, workload, cost Lower Cost All types of data
27 Different Categories of nosql Databases NoSQL Category Use this when. Application Examples Vendors Document 63% revenue share* Schema is not well defined Schema is very likely to change, need to maintain flexibility Commonly described with JSON Frequently changing product catalogs Cloudant** MongoDB Couchbase MarkLogic Key-Value 24% revenue share* Your data is not highly related All you need is basic Create, Read, Update, Delete (CRUD) Rapid Scaling for simple data collections User Sessions Shopping Cart Redis Aerospike AWS (DynamoDB) Basho Technologies (Riak) BigTable/ Columnar 9% revenue share* High volume, low latency write Big Data, sparse data Need compression or versioning Telco, heavy ingest, petabyte scale User Activity logs Sensor data HBase (Hadoop)** BigTable Cassandra Graph DB 4% revenue Share* Your data looks like a graph Have highly interconnected data, need to trace relationships Website Purchase Recommendations Social Network Processing Titan** Neo Technology (Neo4J) * Source: IBM study 2013 estimated by splitting total nosql revenue ($288m) by ratio of top 10 vendors reported 2013 revenue. Total 2013 nosql database revenue estimated $343m ** IBM Solutions of Choice.
28 Hadoop Open-source software framework from Apache Inspired by Google MapReduce GFS (Google File System) HDFS Map/Reduce
29 Hadoop Explained Hadoop computation model Data stored in a distributed file system spanning many inexpensive computers Bring function to the data Distribute application to the compute resources where the data is stored Scalable to thousands of nodes and petabytes of data public static class TokenizerMapper extends Mapper<Object,Text,Text,IntWritable> { private final static IntWritable one = new IntWritable(1); private Text word = new Text(); Hadoop Data Nodes public void map(object key, Text val, Context StringTokenizer itr = new StringTokenizer(val.toString()); while (itr.hasmoretokens()) { word.set(itr.nexttoken()); context.write(word, one); } } } public static class IntSumReducer extends Reducer<Text,IntWritable,Text,IntWrita private IntWritable result = new IntWritable(); public void reduce(text key, Iterable<IntWritable> val, Context context){ int sum = 0; for (IntWritable v : val) { sum += v.get();... apreduce Application Distribute map tasks to cluster Shuffle 1. Map Phase (break job into small parts) 2. Shuffle (transfer interim output for final processing) 3. Reduce Phase (boil all output down to a single result set) Result Set Return a single result set
30 Big Data Enterprise platform Visualization & Discovery Applications & Development Administration Integration Big SQL JDBC BigSheets Dashboard & Visualization Apps Workflow Text Analytics Pig & Jaql MapReduce Hive Admin Console Monitoring Netezza DB2 Advanced Analytic Engines Adaptive Algorithms Text Processing Engine & Extractor Library) R Streams DataStage Workload Optimization Integrated Installer ZooKeeper Enhanced Security Oozie Splittable Text Compression Jaql Adaptive MapReduce Flexible Scheduler High Availability Guardium Platform Computing Lucene Pig H Catalog Index Cognos Runtime MapReduce Management Security Flume Data Store HBase Hive Audit & History Sqoop File System HDFS GPFS Lineage Open Source IBM
31 Future: The SQL interface.... Rich SQL query capabilities SQL '92 and 2011 features Correlated subqueries Windowed aggregates SQL access to all data stored in InfoSphere BigInsights Robust JDBC/ODBC support Take advantage of key features of each data source Application SQL Language JDBC / ODBC Driver JDBC / ODBC Server SQL interface Engine Leverage MapReduce parallelism OR achieving low-latency Data Sources HiveTables HBase tables CSV Files InfoSphere BigInsights
32 Spreadsheet-style Analysis Web-based analysis and visualization Spreadsheet-like interface Define and manage long running data collection jobs Analyze content of the text on the pages that have been retrieved
33 BigInsights Text Analytics Statistical Analysis (R module) Machine learning (SystemML) Ad-Hoc analysis (BigSheets) (Integration) DB2, Netezza, Streams, JAQL IBM s programming language in hadoop world Jaql is a complete solutions environment supporting all other BigInsights components Integration point for various analytics Text analytics Statistical analysis Machine learning Ad-hoc analysis Integration point for various data sources Local and distributed file systems NoSQL data bases Content repositories Relational sources (Warehouses, operational data bases) Jaql I/O Jaql Jaql Core Operators DFS NoSQL RDBMS Jaql Modules File System
34 Data In Motion and At Rest: Complementary Unit of analysis High 1PB 100TB 10TB Warehouse/ Hadoop At Rest: Warehouse/Hadoop -Scalable processing of huge data stores 1TB Sweet spot Capability Med 100GB 10GB GB Warehouse In Motion: Streams - scalable low-latency processing of stream data Low MB KB Streams Streams Sweet spot Capability B ms ms sec min hr day wk mo Low Med High yr Latency
35 Streams Analyzes All Kinds of Data Text (listen, verb), (radio, noun) Mining in Microseconds (included with Streams) ***New*** Simple & Advanced Text (included with Streams) Predictive (IBM Research) Geospatial (IBM Research) Acoustic (IBM Research) (Open Source) ( population R s t, a t ) Image & Video (Open Source) Advanced Mathematical Models (IBM Research) Statistics (included with Streams)
36 How Streams Works continuous ingestion Continuous ingestion Continuous analysis IBM Corporation
37 How Streams Works Continuous ingestion Continuous analysis Filter / Sample Infrastructure provides services for Scheduling analytics across hardware hosts, Establishing streaming connectivity Transform Annotate Correlate Classify Achieve scale: By partitioning applications into software components By distributing across stream-connected hardware hosts Where appropriate: Elements can be fused together for lower communication latency IBM Corporation
38 Streams Runtime Supports Placement Criteria Host pools can force operators to be on hosts with soliddb installed Operator placement constraints allow for co-location, ex-location, and isolation of operators soliddb could be wrapped as a custom operator for dynamic deployment and relocation Meters Meters Company Filter Usage Model Temp Action Usage Contract Text Extract Season Adjust Daily Adjust Text Extract Degree History Compare History Store History x86 host x86 host x86 host x86 host x86 host
39 Data Warehouse Augmentation: Value & Diagram Pre-Processing Hub Query-able Archive Exploratory Analysis Streams Real-time processing Data Explorer BigInsights Landing zone for all data BigInsights Information Integration Data Explorer Find and view the data Can combine with unstructured information BigInsights Streams Offload analytics for microsecond latency Data Warehouse Data Warehouse Data Warehouse IBM Corporation
40 Individual Silos can Answer Typical Questions, One-by-One Who is this customer? What products can I upsell this customer? What products has she purchased? What impact will inventory have on her? What issues has this customer What marketing had the past? materials should I send? What is her view of our company? What should I know before calling her for Where renewal? else has she worked? What s going on with What this customer is available inventory? TODAY? How can we increase How is her company engagement with her? using our products? How can we get more Who customers is best like able her? to help this customer? CRM DBMS Support Ticketing Social Media External Sources Supply Chain Content Mgt. Experts Fulfillment Wiki BUT! An enhanced 360º view provides answers in one application Fusion of data from multiple systems enables deeper insights not just facts 40
41 Customer search Janet Robertson Transaction history Customer s Products Salesforce Customer info SAP Systems Microsoft Dynamics SharePoint Unstructured internal information related to customer Indexed 3 rd party information related to customer
42 IBM Cloud Offering for Analysts: Watson Analytics Natural language dialogue Data access and refinement Integrated social business Intelligent automation Report and dashboard creation Visual storytelling Guided data discovery Unified analytics experience 100% cloud based Mobile ready
43 The IBM Big Data Platform Hadoop-based low latency analytics for variety and volume Hadoop Information Integration Stream Computing High volume data integration and transformation Low Latency Analytics for streaming data MPP Data Warehouse Large volume structured data analytics Queryable Archive Structured Data BI+Ad Hoc Analytics on Structured Data Operational Analytics on Structured Data Time-structured analytics
44 Data Refineries Some water can be consumed raw Water is treated at source Heated Ready for consumption Softened Charcoal Filter Pumped into Landing Zone Reverse Osmosis
45 Data Reservoir: Refinery Services Information Governance Catalog Metadata for Data Sets Stored in Reservoir Repositories Integration Load Trickle feed Operational Systems Transactional DB Document Storage Landed Raw Data Landing, Exploration, Archive Discovery Sandbox Transformation Staging Trusted Data, Warehousing Deep Analytics, Modeling Analytic Appliance Reporting, Interactive Analysis Security Masking Test data generation NoSQL Doc Store Hadoop Mixed Workload RDBMS Data Mart Data Quality Cleansing Standardization Matching Reference data generation Data Reservoir Repositories (Zones) IBM DataWorks Data Lifecycle Archiving
46 Information Management Zones Data Types Real-Time Analytical Processing Actionable Insight Machine and Sensor Data Operational Systems Landing, Exploration, Archive Trusted Data, Warehousing Deep Analytics, Modeling Decision Management Image and Video Enterprise Content Transaction and Application Data Transactional DB Document Storage Landed Raw Data Discovery Sandbox Transformation Staging Analytic Appliance Reporting, Interactive Analysis Predictive Analytics, Modeling Reporting, Analysis Social Data Third-Party Data NoSQL Doc Store Hadoop Mixed Workload RDBMS Data Mart Governance and Lifecycle Management Fabric Integration Matching Masking Lineage Security Privacy Glossary Discovery, Exploration Mainframe, Power8, Intel, Cloud (Managed/Hosted), Bluemix Services
47 Emerging Big Data Implementation Pattern Ingestion and Real-time Analytic Zone Ingest Filter, Transform Analytics and Reporting Zone Correlate, Classify Warehousing Zone Query Engines Cubes Data Sinks Connectors Extract, Annotate Landing and Analytics Sandbox Zone Enterprise Warehouse Descriptive, Predictive Models Analytics MapReduce Hive/HBase Col Stores Indexes, facets Data Marts Widgets Discovery, Visualizer Search Ingest Documents In Variety of Formats Models Metadata and Governance Zone Repository, Workbench
48 IBM InfoSphere BigInsights Enterprise Edition Visualization & Discovery Applications & Development Administration Integration Big SQL JDBC BigSheets Dashboard & Visualization Apps Workflow Text Analytics Pig & Jaql MapReduce Hive Admin Console Monitoring Netezza DB2 Advanced Analytic Engines Adaptive Algorithms Text Processing Engine & Extractor Library) R Streams DataStage Workload Optimization Integrated Installer ZooKeeper Enhanced Security Oozie Splittable Text Compression Jaql Adaptive MapReduce Flexible Scheduler High Availability Guardium Platform Computing Lucene Pig H Catalog Index Cognos Runtime MapReduce Management Security Flume Data Store HBase Hive Audit & History Sqoop File System HDFS GPFS Lineage Open Source IBM
49 Integration Integration Integration Caixabank Big Data Reference Architecture CaixaBank Electronic Journal (structured) CaixaBank at rest / in motion (unstructured) Text Analytics Predictive Model Integration Streams (Data in Motion) Real Time Event Detection Big Data (Data At Rest) Offers Creation and Management System Marketing unstructured data Text Analytics Deep Analytics Pattern Detection Matching System External Social Media (unstructured) structured data Integration Multichannel Notification System CaixaBank operational system (structured) Datawarehouse Customers Profiles 50
50 Integration Integration Integration Caixabank Big Data Reference Architecture CaixaBank Electronic Journal (structured) CaixaBank at rest / in motion (unstructured) Text Analytics Predictive Model Integration Streams (Data in Motion) Real Time Event Detection Big Data (Data At Rest) Offers Creation and Management System Marketing unstructured data Text Deep Deep Analytics Analytics Analytics Pattern Detection Matching System External Social Media (unstructured) CaixaBank operational system (structured) structured data Datawarehouse Integration Deep Analytics (Research, Existing, Third-party) Behavior Analysis Data linkage Customers Profiles Location Based Analysis Multichannel Sentiment Analysis Notification System Concept Labeling & Classification Intent Analysis Influence Analysis Topic Detection 51
51 IBM Cloud Offering for Developers: Bluemix
52 Why are Developers Using Bluemix? To rapidly bring products and services to market at lower cost Go from zero to running code in a matter of minutes. To continuously deliver new functionality to their applications Automate the development and delivery of many applications. To extend existing investments in IT infrastructure Extend existing investments by connecting securely to on-premise infrastructure.
53 Cloudant: Database as a Service (Documents) Infrastructure Services Mobile Database as-a-service Systems of Engagement Social Internet of Things Embedded Systems Analytics Systems of Record SQLDB Relation DB
54 SQLDB: Database as a Service (Relational)
55 dashdb: Data Warehouse as a Service Build More Netezza Analytics Deploy in hours with rapid cloud provisioning No infrastructure investment for cloud agility Cloud dashdb Grow More Load and Go with no tuning required Columnar optimized for analytic workloads Memory optimized takes analytics beyond in-memory BLU Acceleration 3 rd Party DW Know More In-Database analytics built in R Integration for predictive modeling Partner Ecosystem for analytics IBM Watson Analytics ready
56 Enterprise Hadoop as a Service (EHaaS)
57 THINK
IBM Big Data Platform
Mike Winer IBM Information Management IBM Big Data Platform The big data opportunity Extracting insight from an immense volume, variety and velocity of data, in a timely and cost-effective manner. Variety:
More informationExploiting Data at Rest and Data in Motion with a Big Data Platform
Exploiting Data at Rest and Data in Motion with a Big Data Platform Sarah Brader, sarah_brader@uk.ibm.com What is Big Data? Where does it come from? 12+ TBs of tweet data every day 30 billion RFID tags
More informationIBM Big Data Platform
IBM Big Data Platform Turning big data into smarter decisions Stefan Söderlund. IBM kundarkitekt, Försvarsmakten Sesam vår-seminarie Big Data, Bigga byte kräver Pigga Hertz! May 16, 2013 By 2015, 80% of
More informationLuncheon Webinar Series May 13, 2013
Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration
More informationHow the oil and gas industry can gain value from Big Data?
How the oil and gas industry can gain value from Big Data? Arild Kristensen Nordic Sales Manager, Big Data Analytics arild.kristensen@no.ibm.com, tlf. +4790532591 April 25, 2013 2013 IBM Corporation Dilbert
More informationIBM InfoSphere BigInsights Enterprise Edition
IBM InfoSphere BigInsights Enterprise Edition Efficiently manage and mine big data for valuable insights Highlights Advanced analytics for structured, semi-structured and unstructured data Professional-grade
More informationHDP Hadoop From concept to deployment.
HDP Hadoop From concept to deployment. Ankur Gupta Senior Solutions Engineer Rackspace: Page 41 27 th Jan 2015 Where are you in your Hadoop Journey? A. Researching our options B. Currently evaluating some
More informationBeyond Watson: The Business Implications of Big Data
Beyond Watson: The Business Implications of Big Data Shankar Venkataraman IBM Program Director, STSM, Big Data August 10, 2011 The World is Changing and Becoming More INSTRUMENTED INTERCONNECTED INTELLIGENT
More informationThe Future of Data Management
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class
More informationHDP Enabling the Modern Data Architecture
HDP Enabling the Modern Data Architecture Herb Cunitz President, Hortonworks Page 1 Hortonworks enables adoption of Apache Hadoop through HDP (Hortonworks Data Platform) Founded in 2011 Original 24 architects,
More informationIBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!
The Bloor Group IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS VENDOR PROFILE The IBM Big Data Landscape IBM can legitimately claim to have been involved in Big Data and to have a much broader
More informationIBM BigInsights for Apache Hadoop
IBM BigInsights for Apache Hadoop Efficiently manage and mine big data for valuable insights Highlights: Enterprise-ready Apache Hadoop based platform for data processing, warehousing and analytics Advanced
More informationChukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84
Index A Amazon Web Services (AWS), 50, 58 Analytics engine, 21 22 Apache Kafka, 38, 131 Apache S4, 38, 131 Apache Sqoop, 37, 131 Appliance pattern, 104 105 Application architecture, big data analytics
More informationDatenverwaltung im Wandel - Building an Enterprise Data Hub with
Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees
More informationBIG DATA TRENDS AND TECHNOLOGIES
BIG DATA TRENDS AND TECHNOLOGIES THE WORLD OF DATA IS CHANGING Cloud WHAT IS BIG DATA? Big data are datasets that grow so large that they become awkward to work with using onhand database management tools.
More informationIBM Data Warehousing and Analytics Portfolio Summary
IBM Information Management IBM Data Warehousing and Analytics Portfolio Summary Information Management Mike McCarthy IBM Corporation mmccart1@us.ibm.com IBM Information Management Portfolio Current Data
More informationNative Connectivity to Big Data Sources in MSTR 10
Native Connectivity to Big Data Sources in MSTR 10 Bring All Relevant Data to Decision Makers Support for More Big Data Sources Optimized Access to Your Entire Big Data Ecosystem as If It Were a Single
More informationBig Data and Trusted Information
Dr. Oliver Adamczak Big Data and Trusted Information CAS Single Point of Truth 7. Mai 2012 The Hype Big Data: The next frontier for innovation, competition and productivity McKinsey Global Institute 2012
More informationSo What s the Big Deal?
So What s the Big Deal? Presentation Agenda Introduction What is Big Data? So What is the Big Deal? Big Data Technologies Identifying Big Data Opportunities Conducting a Big Data Proof of Concept Big Data
More informationIBM Big Data in Government
IBM Big in Government Turning big data into smarter decisions Deepak Mohapatra Sr. Consultant Government IBM Software Group dmohapatra@us.ibm.com The Big Paradigm Shift 2 Big Creates A Challenge And an
More informationA Tour of the Zoo the Hadoop Ecosystem Prafulla Wani
A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani Technical Architect - Big Data Syntel Agenda Welcome to the Zoo! Evolution Timeline Traditional BI/DW Architecture Where Hadoop Fits In 2 Welcome to
More informationAligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap
Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap 3 key strategic advantages, and a realistic roadmap for what you really need, and when 2012, Cognizant Topics to be discussed
More information2015 Ironside Group, Inc. 2
2015 Ironside Group, Inc. 2 Introduction to Ironside What is Cloud, Really? Why Cloud for Data Warehousing? Intro to IBM PureData for Analytics (IPDA) IBM PureData for Analytics on Cloud Intro to IBM dashdb
More informationFocus on the business, not the business of data warehousing!
Focus on the business, not the business of data warehousing! Adam M. Ronthal Technical Product Marketing and Strategy Big Data, Cloud, and Appliances @ARonthal 1 Disclaimer Copyright IBM Corporation 2014.
More informationEnd to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ
End to End Solution to Accelerate Data Warehouse Optimization Franco Flore Alliance Sales Director - APJ Big Data Is Driving Key Business Initiatives Increase profitability, innovation, customer satisfaction,
More informationCollaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.
Collaborative Big Data Analytics 1 Big Data Is Less About Size, And More About Freedom TechCrunch!!!!!!!!! Total data: bigger than big data 451 Group Findings: Big Data Is More Extreme Than Volume Gartner!!!!!!!!!!!!!!!
More informationIl mondo dei DB Cambia : Tecnologie e opportunita`
Il mondo dei DB Cambia : Tecnologie e opportunita` Giorgio Raico Pre-Sales Consultant Hewlett-Packard Italiana 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject
More informationBig Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect
Big Data & QlikView Democratizing Big Data Analytics David Freriks Principal Solution Architect TDWI Vancouver Agenda What really is Big Data? How do we separate hype from reality? How does that relate
More informationIntroducing Oracle Exalytics In-Memory Machine
Introducing Oracle Exalytics In-Memory Machine Jon Ainsworth Director of Business Development Oracle EMEA Business Analytics 1 Copyright 2011, Oracle and/or its affiliates. All rights Agenda Topics Oracle
More informationKlarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance
Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance IBM s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice
More informationSafe Harbor Statement
Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment
More informationIBM BigInsights Has Potential If It Lives Up To Its Promise. InfoSphere BigInsights A Closer Look
IBM BigInsights Has Potential If It Lives Up To Its Promise By Prakash Sukumar, Principal Consultant at iolap, Inc. IBM released Hadoop-based InfoSphere BigInsights in May 2013. There are already Hadoop-based
More informationBig Data Technologies Compared June 2014
Big Data Technologies Compared June 2014 Agenda What is Big Data Big Data Technology Comparison Summary Other Big Data Technologies Questions 2 What is Big Data by Example The SKA Telescope is a new development
More informationVIEWPOINT. High Performance Analytics. Industry Context and Trends
VIEWPOINT High Performance Analytics Industry Context and Trends In the digital age of social media and connected devices, enterprises have a plethora of data that they can mine, to discover hidden correlations
More informationBig Data Use Case Deep Dive 5 Game Changing Use Cases for Big Data
Big Data Use Case Deep Dive 5 Game Changing Use Cases for Big Data Disruptive forces impact long standing business models across industries Pressure to do more with less Shift of power to the consumer
More informationPeninsula Strategy. Creating Strategy and Implementing Change
Peninsula Strategy Creating Strategy and Implementing Change PS - Synopsis Professional Services firm Industries include Financial Services, High Technology, Healthcare & Security Headquartered in San
More informationWELCOME TO The Future of Analytics in Action: The Art of the Possible
WELCOME TO The Future of Analytics in Action: The Art of the Possible Goals for Today Share the cloud-based data management and analytics technologies that are enabling rapid development of new mobile
More informationArchitecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics
More informationGet Ready for Big Data with IBM System z
Get Ready for Big Data with IBM System z Product strategy SHARE 2012, Anaheim Mark Simmonds System z Information Management Product Marketing Disclaimer IBM s statements regarding its plans, directions,
More informationThe Future of Data Management with Hadoop and the Enterprise Data Hub
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah Cofounder & CTO, Cloudera, Inc. Twitter: @awadallah 1 2 Cloudera Snapshot Founded 2008, by former employees of Employees
More informationTapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru
Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy Presented by: Jeffrey Zhang and Trishla Maru Agenda Big Data Overview All About Hadoop What is Hadoop? How does MicroStrategy connects to Hadoop?
More informationGanzheitliches Datenmanagement
Ganzheitliches Datenmanagement für Hadoop Michael Kohs, Senior Sales Consultant @mikchaos The Problem with Big Data Projects in 2016 Relational, Mainframe Documents and Emails Data Modeler Data Scientist
More informationAddressing Open Source Big Data, Hadoop, and MapReduce limitations
Addressing Open Source Big Data, Hadoop, and MapReduce limitations 1 Agenda What is Big Data / Hadoop? Limitations of the existing hadoop distributions Going enterprise with Hadoop 2 How Big are Data?
More informationOracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>
s Big Data solutions Roger Wullschleger DBTA Workshop on Big Data, Cloud Data Management and NoSQL 10. October 2012, Stade de Suisse, Berne 1 The following is intended to outline
More informationBig Data Analytics Platform @ Nokia
Big Data Analytics Platform @ Nokia 1 Selecting the Right Tool for the Right Workload Yekesa Kosuru Nokia Location & Commerce Strata + Hadoop World NY - Oct 25, 2012 Agenda Big Data Analytics Platform
More informationAn Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise
An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise Solutions Group The following is intended to outline our
More informationForecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014
Forecast of Big Data Trends Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014 Big Data transforms Business 2 Data created every minute Source http://mashable.com/2012/06/22/data-created-every-minute/
More informationA holistic approach to Big Data
A holistic approach to Big Data Raul F. Chong Senior Big Data and Cloud Program Manager Big Data University Community Leader rfchong@ca.ibm.com 2013 BigDataUniversity.com Agenda The state of Big Data adoption
More informationBig Data and Data Science: Behind the Buzz Words
Big Data and Data Science: Behind the Buzz Words Peggy Brinkmann, FCAS, MAAA Actuary Milliman, Inc. April 1, 2014 Contents Big data: from hype to value Deconstructing data science Managing big data Analyzing
More informationApplications for Big Data Analytics
Smarter Healthcare Applications for Big Data Analytics Multi-channel sales Finance Log Analysis Homeland Security Traffic Control Telecom Search Quality Manufacturing Trading Analytics Fraud and Risk Retail:
More informationDeploying Big Data to the Cloud: Roadmap for Success
Deploying Big Data to the Cloud: Roadmap for Success James Kobielus Chair, CSCC Big Data in the Cloud Working Group IBM Big Data Evangelist. IBM Data Magazine, Editor-in- Chief. IBM Senior Program Director,
More informationNoSQL for SQL Professionals William McKnight
NoSQL for SQL Professionals William McKnight Session Code BD03 About your Speaker, William McKnight President, McKnight Consulting Group Frequent keynote speaker and trainer internationally Consulted to
More informationWHITE PAPER. Four Key Pillars To A Big Data Management Solution
WHITE PAPER Four Key Pillars To A Big Data Management Solution EXECUTIVE SUMMARY... 4 1. Big Data: a Big Term... 4 EVOLVING BIG DATA USE CASES... 7 Recommendation Engines... 7 Marketing Campaign Analysis...
More informationTap into Hadoop and Other No SQL Sources
Tap into Hadoop and Other No SQL Sources Presented by: Trishla Maru What is Big Data really? The Three Vs of Big Data According to Gartner Volume Volume Orders of magnitude bigger than conventional data
More informationThis Symposium brought to you by www.ttcus.com
This Symposium brought to you by www.ttcus.com Linkedin/Group: Technology Training Corporation @Techtrain Technology Training Corporation www.ttcus.com Big Data Analytics as a Service (BDAaaS) Big Data
More informationWelcome to The Future of Analytics In Action. 2015 IBM Corporation
Welcome to The Future of Analytics In Action Goals for Today Share the cloud-based data management and analytics technologies that are enabling rapid development of new mobile applications Discuss examples
More informationBIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata
BIG DATA: FROM HYPE TO REALITY Leandro Ruiz Presales Partner for C&LA Teradata Evolution in The Use of Information Action s ACTIVATING MAKE it happen! Insights OPERATIONALIZING WHAT IS happening now? PREDICTING
More informationGetting Started Practical Input For Your Roadmap
Getting Started Practical Input For Your Roadmap Mike Ferguson Managing Director, Intelligent Business Strategies BA4ALL Big Data & Analytics Insight Conference Stockholm, May 2015 About Mike Ferguson
More informationRaul F. Chong Senior program manager Big data, DB2, and Cloud IM Cloud Computing Center of Competence - IBM Toronto Lab, Canada
What is big data? Raul F. Chong Senior program manager Big data, DB2, and Cloud IM Cloud Computing Center of Competence - IBM Toronto Lab, Canada 1 2011 IBM Corporation Agenda The world is changing What
More informationTransforming Government with Big Data and Analytics
Transforming Government with Big Data and Analytics Deepak Mohapatra Sr. Consultant IBM Software Group dmohapatra@us.ibm.com April 29 th 2014 1 Big Data Creates A Challenge And an Opportunity Yet requires
More informationIndustry Impact of Big Data in the Cloud: An IBM Perspective
Industry Impact of Big Data in the Cloud: An IBM Perspective Inhi Cho Suh IBM Software Group, Information Management Vice President, Product Management and Strategy email: inhicho@us.ibm.com twitter: @inhicho
More informationBig Data and Advanced Analytics Applications and Capabilities Steven Hagan, Vice President, Server Technologies
Big Data and Advanced Analytics Applications and Capabilities Steven Hagan, Vice President, Server Technologies 1 Copyright 2011, Oracle and/or its affiliates. All rights Big Data, Advanced Analytics:
More informationGAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION
GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION Syed Rasheed Solution Manager Red Hat Corp. Kenny Peeples Technical Manager Red Hat Corp. Kimberly Palko Product Manager Red Hat Corp.
More informationHadoop and ecosystem * 本 文 中 的 言 论 仅 代 表 作 者 个 人 观 点 * 本 文 中 的 一 些 图 例 来 自 于 互 联 网. Information Management. Information Management IBM CDL Lab
IBM CDL Lab Hadoop and ecosystem * 本 文 中 的 言 论 仅 代 表 作 者 个 人 观 点 * 本 文 中 的 一 些 图 例 来 自 于 互 联 网 Information Management 2012 IBM Corporation Agenda Hadoop 技 术 Hadoop 概 述 Hadoop 1.x Hadoop 2.x Hadoop 生 态
More informationSmarter Analytics Leadership Summit Big Data. Real Solutions. Big Results.
Smarter Analytics Leadership Summit Big Data. Real Solutions. Big Results. 5 Game Changing Use Cases for Big Data Inhi Cho Suh Vice President Product Management & Strategy Information Management IBM Software
More information5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014
5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for
More informationHow to Leverage Big Data in the Cloud to Gain Competitive Advantage
How to Leverage Big Data in the Cloud to Gain Competitive Advantage James Kobielus, IBM Big Data Evangelist Editor-in-Chief, IBM Data Magazine Senior Program Director, Product Marketing, Big Data Analytics
More informationBIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES
BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES Relational vs. Non-Relational Architecture Relational Non-Relational Rational Predictable Traditional Agile Flexible Modern 2 Agenda Big Data
More informationData Refinery with Big Data Aspects
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 655-662 International Research Publications House http://www. irphouse.com /ijict.htm Data
More informationBig Data Management and Security
Big Data Management and Security Audit Concerns and Business Risks Tami Frankenfield Sr. Director, Analytics and Enterprise Data Mercury Insurance What is Big Data? Velocity + Volume + Variety = Value
More informationTE's Analytics on Hadoop and SAP HANA Using SAP Vora
TE's Analytics on Hadoop and SAP HANA Using SAP Vora Naveen Narra Senior Manager TE Connectivity Santha Kumar Rajendran Enterprise Data Architect TE Balaji Krishna - Director, SAP HANA Product Mgmt. -
More informationHadoop and Relational Database The Best of Both Worlds for Analytics Greg Battas Hewlett Packard
Hadoop and Relational base The Best of Both Worlds for Analytics Greg Battas Hewlett Packard The Evolution of Analytics Mainframe EDW Proprietary MPP Unix SMP MPP Appliance Hadoop? Questions Is Hadoop
More informationBuilding Confidence in Big Data Innovations in Information Integration & Governance for Big Data
Building Confidence in Big Data Innovations in Information Integration & Governance for Big Data IBM Software Group Important Disclaimer THE INFORMATION CONTAINED IN THIS PRESENTATION IS PROVIDED FOR INFORMATIONAL
More informationA New Era Of Analytic
Penang egovernment Seminar 2014 A New Era Of Analytic Megat Anuar Idris Head, Project Delivery, Business Analytics & Big Data Agenda Overview of Big Data Case Studies on Big Data Big Data Technology Readiness
More informationBig Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.
Big Data Analytics 1 Priority Discussion Topics What are the most compelling business drivers behind big data analytics? Do you have or expect to have data scientists on your staff, and what will be their
More informationHadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012. Viswa Sharma Solutions Architect Tata Consultancy Services
Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012 Viswa Sharma Solutions Architect Tata Consultancy Services 1 Agenda What is Hadoop Why Hadoop? The Net Generation is here Sizing the
More informationModernizing Your Data Warehouse for Hadoop
Modernizing Your Data Warehouse for Hadoop Big data. Small data. All data. Audie Wright, DW & Big Data Specialist Audie.Wright@Microsoft.com O 425-538-0044, C 303-324-2860 Unlock Insights on Any Data Taking
More informationThe Future of Business Analytics is Now! 2013 IBM Corporation
The Future of Business Analytics is Now! 1 The pressures on organizations are at a point where analytics has evolved from a business initiative to a BUSINESS IMPERATIVE More organization are using analytics
More informationIntroduction to Big Data! with Apache Spark" UC#BERKELEY#
Introduction to Big Data! with Apache Spark" UC#BERKELEY# So What is Data Science?" Doing Data Science" Data Preparation" Roles" This Lecture" What is Data Science?" Data Science aims to derive knowledge!
More informationBringing the Power of SAS to Hadoop. White Paper
White Paper Bringing the Power of SAS to Hadoop Combine SAS World-Class Analytic Strength with Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities Contents Introduction... 1 What
More information#mstrworld. Tapping into Hadoop and NoSQL Data Sources in MicroStrategy. Presented by: Trishla Maru. #mstrworld
Tapping into Hadoop and NoSQL Data Sources in MicroStrategy Presented by: Trishla Maru Agenda Big Data Overview All About Hadoop What is Hadoop? How does MicroStrategy connects to Hadoop? Customer Case
More informationHadoop Ecosystem B Y R A H I M A.
Hadoop Ecosystem B Y R A H I M A. History of Hadoop Hadoop was created by Doug Cutting, the creator of Apache Lucene, the widely used text search library. Hadoop has its origins in Apache Nutch, an open
More informationOracle Big Data SQL Technical Update
Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical
More informationUpcoming Announcements
Enterprise Hadoop Enterprise Hadoop Jeff Markham Technical Director, APAC jmarkham@hortonworks.com Page 1 Upcoming Announcements April 2 Hortonworks Platform 2.1 A continued focus on innovation within
More informationExecutive Summary... 2 Introduction... 3. Defining Big Data... 3. The Importance of Big Data... 4 Building a Big Data Platform...
Executive Summary... 2 Introduction... 3 Defining Big Data... 3 The Importance of Big Data... 4 Building a Big Data Platform... 5 Infrastructure Requirements... 5 Solution Spectrum... 6 Oracle s Big Data
More informationIntegrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April 9 2013
Integrating Hadoop Into Business Intelligence & Data Warehousing Philip Russom TDWI Research Director for Data Management, April 9 2013 TDWI would like to thank the following companies for sponsoring the
More informationBig Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum
Big Data Analytics with EMC Greenplum and Hadoop Big Data Analytics with EMC Greenplum and Hadoop Ofir Manor Pre Sales Technical Architect EMC Greenplum 1 Big Data and the Data Warehouse Potential All
More informationBIG DATA TECHNOLOGY. Hadoop Ecosystem
BIG DATA TECHNOLOGY Hadoop Ecosystem Agenda Background What is Big Data Solution Objective Introduction to Hadoop Hadoop Ecosystem Hybrid EDW Model Predictive Analysis using Hadoop Conclusion What is Big
More informationAre You Ready for Big Data?
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics February 11, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
More informationBig Data Strategies with IMS
Big Data Strategies with IMS #16103 Richard Tran IMS Development richtran@us.ibm.com Insert Custom Session QR if Desired. Agenda Big Data in an Information Driven economy Why start with System z IMS strategies
More informationHortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc. 2011 2014. All Rights Reserved
Hortonworks & SAS Analytics everywhere. Page 1 A change in focus. A shift in Advertising From mass branding A shift in Financial Services From Educated Investing A shift in Healthcare From mass treatment
More informationUsing Big Data for Smarter Decision Making. Colin White, BI Research July 2011 Sponsored by IBM
Using Big Data for Smarter Decision Making Colin White, BI Research July 2011 Sponsored by IBM USING BIG DATA FOR SMARTER DECISION MAKING To increase competitiveness, 83% of CIOs have visionary plans that
More informationHow To Use Big Data For Business
Big Data Maturity - The Photo and The Movie Mike Ferguson Managing Director, Intelligent Business Strategies BA4ALL Big Data & Analytics Insight Conference Stockholm, May 2015 About Mike Ferguson Mike
More informationHow Cisco IT Built Big Data Platform to Transform Data Management
Cisco IT Case Study August 2013 Big Data Analytics How Cisco IT Built Big Data Platform to Transform Data Management EXECUTIVE SUMMARY CHALLENGE Unlock the business value of large data sets, including
More informationBig Data overview. Livio Ventura. SICS Software week, Sept 23-25 Cloud and Big Data Day
Big Data overview SICS Software week, Sept 23-25 Cloud and Big Data Day Livio Ventura Big Data European Industry Leader for Telco, Energy and Utilities and Digital Media Agenda some data on Data Big Data
More informationIntroduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data
Introduction to Hadoop HDFS and Ecosystems ANSHUL MITTAL Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data Topics The goal of this presentation is to give
More informationBig Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies
Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies Big Data: Global Digital Data Growth Growing leaps and bounds by 40+% Year over Year! 2009 =.8 Zetabytes =.08
More informationBIG Data Analytics Move to Competitive Advantage
BIG Data Analytics Move to Competitive Advantage where is technology heading today Standardization Open Source Automation Scalability Cloud Computing Mobility Smartphones/ tablets Internet of Things Wireless
More information