Hadoop and Data Warehouse Friends, Enemies or Profiteers? What about Real Time?
|
|
|
- Calvin Hood
- 9 years ago
- Views:
Transcription
1 Hadoop and Data Warehouse Friends, Enemies or Profiteers? What about Real Time? Kai
2 Disclaimer! These opinions are my own and do not necessarily represent my employer
3 Key Messages Big Data is not just Hadoop, concentrate on Business Value! A good Big Data Architecture combines DWH, Hadoop and Real Time! The Integration Layer is getting even more important in the Big Data Era!
4 Agenda Terminology Data Warehouse and Business Intelligence Big Data Processing with Hadoop Fast Data Processing in Real Time
5 Agenda Terminology Data Warehouse and Business Intelligence Big Data Processing with Hadoop Fast Data Processing in Real Time
6 Big Data Architecture Big Data Architecture Hadoop DWH / BI Real Time
7 DWH means analyzing structured data
8 Big Data means analyzing everything Store everything Even without structure Use whatever you need (now or later)
9 What is Big Data? The combined Vs of Big Data Volume (terabytes, petabytes) X Velocity (realtime) Value Variety (social networks, blog posts, logs, sensors, etc.)
10 Real Time Wikipedia Definition: Real time programs must guarantee response within strict time constraints, often referred to as "deadlines. Real time responses are often understood to be in the order of milliseconds, and sometimes microseconds. The term "near real time refers to the time delay introduced, by automated data processing or network transmission. The distinction between the terms "near real time" and "real time" is somewhat nebulous and must be defined for the situation at hand. Hereby, for this talk, I define: Real time == response in nanoseconds microseconds milliseconds Near real time == (response time > one second)
11 Agenda Terminology Data Warehouse and Business Intelligence Big Data Processing with Hadoop Fast Data Processing in Real Time
12 Big Data Architecture Big Data Architecture Hadoop DWH / BI Real Time
13 DWH vs. BI Data Warehouse (DWH) Storage Business Intelligence (BI) Analytics Both terms are often used as synonym, i.e. when someone talks about a DWH, this might include analytics BI can be used without a DWH
14 Typical DWH Process A DWH is Business Case driven : Reporting Dashboards Drill Down Analytics Different DWH Options: Enterprise DWH ( == EDW) Department / Project DWH Embedded BI (into Applications)
15 BI == Reporting + Statistics + Data Discovery DWH BI
16 BI Visualization
17 Products DWH SQL: e.g. MySQL MPP: e.g. Teradata, EMC Greenplum, IBM Netezza Scale very well (almost linear), very high performance, hardware / software costs also increase a lot BI Microsoft Excel BI Tools: e.g. TIBCO Spotfire, Tableau, MicroStrategy Hint: Good BI tools allow data discovery / visualization using different sources, not just DWH are easy to use
18 BI Tool Example: TIBCO Spotfire
19 DWH - Real World Use Case
20 Embedded BI - Real World Use Case
21 Problems of a DWH No flexibility / agility Just structured data Just some (maybe aggregated) history data Just good for already known business cases Low speed ETL is batch, usually takes hours or sometimes even days No proactive reactions possible too late architecture High costs (per GB) Just selected data Too old data is often outsourced to archives
22 DWH vs. Big Data
23 Agenda Terminology Data Warehouse and Business Intelligence Big Data Processing with Hadoop Fast Data Processing in Real Time
24 Big Data Architecture Big Data Architecture Hadoop DWH / BI Real Time
25 Why no longer DWH, but Hadoop? Hadoop was built to solve problems of RDBMS and DWH Benefits of Hadoop: Store and analyze all data all data == not just selected (maybe aggregated) data all data == structured + semi-structured + unstructured be more flexible, adapt to changing business cases Better performance (massively parallel) Ad hoc data discovery also for big data volumes Save money (commodity hardware, open source software)
26 What is Hadoop? Apache Hadoop, an open-source software library, is a framework that allows for the distributed processing of large data sets across clusters of commodity hardware using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.
27 MapReduce Simple example: Input: (very large) text files with lists of strings, such as: 318, N We are interested just in some content: year and temperate (marked in red) The Map Reduce function has to compute the maximum temperature for every year
28 Hadoop Products few Apache Hadoop MapReduce HDFS Ecosystem many Features included
29 Hadoop Ecosystem
30 Hadoop Products few Apache Hadoop MapReduce HDFS Ecosystem + Hadoop Distribution Packaging Deployment-Tooling Support many Features included
31 Hadoop Distributions EMR ( more available)
32 Hadoop Products Apache Hadoop Hadoop Distribution Big Data Suite few MapReduce HDFS Ecosystem + Packaging Deployment-Tooling Support + many Tooling / Modeling Code Generation Scheduling Integration Features included
33 Big Data Integration Suite: TIBCO BusinessWorks
34 Hadoop Real World Use Case: Replace ETL to improve Performance The advantage of their new system is that they can now look at their data [from their log processing system] in anyway they want: Nightly MapReduce jobs collect statistics about their mail system such as spam counts by domain, bytes transferred and number of logins. Benefit: Improved speed compared to typical ETL. When they wanted to find out which part of the world their customers logged in from, a quick [ad hoc] MapReduce job was created and they had the answer within a few hours. Not really possible in your typical ETL system. ( no TIBCO reference)
35 Hadoop Real World Use Case: Storage to reduce Costs Global Parcel Service A lot of data must be stored forever Numbers increase exponentially Goal: As cheap as possible Problem: Queries must still be possible (compliance!) Solution: Commodity servers and Hadoop querying ( no TIBCO reference)
36 DWH or Hadoop? DWH Hadoop Data Structured All data Maturity Established in Enterprise New concepts Tooling Installed, good knowledge and experience New tools, coding required, business can still use SQL-similar queries or same BI tool Costs High (per GB) Low (per GB)
37 DWH plus Hadoop? DWH and Hadoop complement each other very well Store all data in Hadoop (cheap per GB) ETL from Hadoop to DWH (expensive per GB) Create specific reports / dashboards in DWH (leverage existing products and knowledge) Do Ad Hoc (Big) Data Discovery directly in Hadoop, no DWH needed Good BI tools support both, DWH and Hadoop! For example, TIBCO Spotfire has connectors to: RDBMS (e.g. MySQL) MPP (e.g. Teradata, IBM Netezza, Greenplum) Hadoop (e.g. Hive, Impala) In-Memory (e.g. TIBCO ActiveSpaces, SAP HANA)...
38 Recommendation DWH vs. Hadoop vs. NoSQL Short term: Use Hadoop (only) when you can save (a lot of) money or when you can not solve your business problem without Hadoop. A lot of things have to be improved, e.g. governance, security, performance, and tool support. Long term: Hadoop can replace DWH (as you can create a DWH on top of Hadoop with SQL interface as of today)! Be aware: A lot of other options emerged for analyzing big data besides Hadoop, e.g. - Analytical databases with SQL interface (MemSQL, Citus Data) - Log Analytics (Splunk, TIBCO LogLogic) - Graph databases (Neo4j, InfiniteGraph) - Cassandra, MongoDB, you name it...
39 Vendors Strategy... Hadoop vendors push Hadoop as DWH replacement Called e.g. Enterprise Data Hub (Cloudera) or Data Lake (Hortonworks)
40 Vendors Strategy... MPP / DWH vendors add Hadoop support as complementary addon to their DWH Reason (probably): Market pressure! Benefit: One platform (including tooling and support) for DWH and Hadoop ( SQL-for-everything )
41 Example: EMC combines DWH and Hadoop
42 Example: Teradata combines DWH and Hadoop
43 Hadoop evolving from Batch to Near Real Time Hadoop is MapReduce == Batch (== hours, minutes, seconds) Good for complex transformations / computations of big data volumes Not so good for ad hoc data exploration Improvements: Hive Stinger (Hortonworks) etc. Non-MapReduce processing engines added in the meantime (YARN makes it possible) Ad hoc data discovery (== seconds) Hive / Pig with Apache Tez replacing MapReduce under the hood for data processing New Query engines, e.g. Impala (Cloudera) or Apache Drill (MapR) MPP vendors (e.g. Teradata, EMC Greenplum) also add own query engines Offer fast data exploration (without MapReduce) SQL-for-everything Some Hadoop problems remain No good, easy tooling (Hadoop ecosystem) might be solved next years Missing maturity (alpha / beta versions) might be solved next years Commodity hardware no longer sufficient with these new emerging technologies (for instance: SQL-on- Hadoop solutions require a lot of memory) No real time (== ms, ns), but near real time (> 1 sec) too late architecture
44 Agenda Terminology Data Warehouse and Business Intelligence Big Data Processing with Hadoop Fast Data Processing in Real Time
45 Big Data Architecture Big Data Architecture Hadoop DWH / BI Real Time
46 Real Time: The Two-Second Advantage A little bit of the right information, just a little bit beforehand whether it is a couple of seconds, minutes or hours is more valuable than all of the information in the world six months later this is the two-second advantage. Vikek Ranadivé, Founder and CEO of TIBCO
47 The Value of Data decreases over Time $$$$ $$$ $$ Business Event Data Ready for Analysis Analysis Completed Decision Made Event Processing speeds action and increases business value by seizing opportunities while they matter $ Action Taken Time
48 What is Big Data? The combined Vs of Big Data Volume (terabytes, petabytes) Velocity (realtime) X Fast Data Variety (social networks, blog posts, logs, sensors, etc.)
49 Complex Event / Stream Processing / In-Memory Concepts Streams: Monitoring millions of events in a specific time window to react proactively Stateful: Collect, filter and correlate events with state to anticipate outcomes and react proactively Transactional: Highly performant transactional event processing Products vs. Frameworks Products are mature, mission-critical, in production, e.g. TIBCO StreamBase, IBM InfoSphere Streams Open Source Frameworks, e.g. Apache Spark and Apache Storm Future will tell us about performance, tooling, support, etc. Can be combined with Hadoop Are complementary to Products such as TIBCO StreamBase In-Memory Can also be used for big data (Terabytes possible!) Usually complementary, i.e. they can respectively have to be combined with stream processing / complex event processing
50 Stream Processing Architecture (Example: TIBCO StreamBase) Connect to streams Snapshot AND always-live updates TIBCO Live Datamart Orders / Executions Transaction Cost TIBCO StreamBase Active Tables Continuous Query Ad Hoc Query Trading Signal Market Data Continuous Query Processor Alerts Alert Setting Anticipate opportunities, proactive action
51 Example: TIBCO StreamBase Tooling StreamBase Development Studio Visual Development Visual Debugging Feed Simulation Unit Testing StreamBase Live Datamart Real Time Analytics and Visualization Ad hoc queries Alerts and Notifications Web, Mobile and API Integration
52 Some Fast Data Use Cases Algorithmic trading (trading) Fraud detection (finance) Predictive sensor analytics (manufacturing) Continuous network analytics (telecom) Omni-channel sales (retail) Let s take a closer look at one example FAST DATA use cases show up everywhere, not just in trading! 56
53 The future of retail technology is real-time and event driven. - CIO, leading retailer
54 Copyright TIBCO Software Inc. PSYCHOLOGICAL ROUTER 88% Inventory 18% 43% 52% Location 28% Spend 23% MATCH 92% Last Experience 76% Browser Type 68% App Version 85% Nice to see you again! 79%
55 The Event-Driven Retail Reference Architecture REAL-TIME CUSTOMER INTERACTION EVENT-DRIVEN PAYMENTS SENTIMENT ANALYTICS & ALERTING LIVE PROMOTIONS & PRICING PROGRAM, CAMPAIGN & OFFER MANAGEMENT WALLET LOYALTY POINTS EVENT-DRIVEN VIRTUAL CUSTOMER IMAGE EVENT-DRIVEN INVENTORY FABRIC EXTERNA L EXTERNA L CRM INVENTORY WAREHOUS E STORE
56 Retailers want to treat their stores like warehouses... Demand (from the ESB) Inventory (from In-Memory) Action (dynamic rules) Cross Sell Aggression (from correlation rules)
57 Real Time plus Hadoop? Hadoop: Storage Complex computing (MapReduce) Real Time: Immediate (proactive) reactions Monitor streaming data in Real Time Example: TIBCO StreamBase and its Apache Flume connector for reading streaming data from Hadoop / HDFS or to send streaming data to Hadoop / HDFS
58 Real Time plus Hadoop Real World Use Case Use Case: Predict pricing movement in live bets Hadoop: Store all history information about all past bets Use MapReduce to precompute odds for new matches, based on all history data TIBCO StreamBase: Compute new odds in real time to react within a live game after events (e.g. when a team scores a goal) Monitor stream data in real time dashboards
59 Streaming Algorithm???????? WHEN 5 KEY BOOKIES RAISE THE SAME ODDS IN A 5-SECOND WINDOW, BET LESS
60 Reference Architecture: Streaming Betting Analytics GLOBAL, DISTRIBUTED INFRASTRUCTURE Event Processing BETTING LINES SCORES B U S MONITOR AGGREGATE REAL-TIME ANALYTICS CORRELATE B U S Predictive odds analytics Historical odds deviations NEWS HISTORICAL COMPARISON CACHE CACHE CACHE Zero Latency Betting Analytics HADOOP Real-Time Analytics Context: Historical Betting Data, Odds, Outcomes Copyright TIBCO Software StreamBase Inc. LiveView
61 Recap: Big Data Architecture Big Data Architecture Hadoop DWH / BI Real Time
62 Off Topic What about Integration?
63 Off Topic Integration is no talking point in this session However: It gets even more important in the future! The number of different data sources and technologies increases even more than in the past CRM, ERP, Host, B2B, etc. will not disappear DWH, Hadoop cluster, event / streaming server, In-Memory DB have to communicate Cloud, Mobile, Internet of Things are no option, but our future!
64 Recap: Key Messages Big Data is not just Hadoop, concentrate on Business Value! A good Big Data Architecture combines DWH, Hadoop and Real Time! The Integration Layer is getting even more important in the Big Data Era!
65 Questions? Kai
Kai Wähner. The Next-Generation BPM for a Big Data World: Intelligent Business Process Management Suites (ibpms)
The Next-Generation BPM for a Big Data World: Intelligent Business Process Management Suites (ibpms) Kai Wähner [email protected] @KaiWaehner www.kai-waehner.de Xing / LinkedIn Please connect! Kai
#mstrworld. Tapping into Hadoop and NoSQL Data Sources in MicroStrategy. Presented by: Trishla Maru. #mstrworld
Tapping into Hadoop and NoSQL Data Sources in MicroStrategy Presented by: Trishla Maru Agenda Big Data Overview All About Hadoop What is Hadoop? How does MicroStrategy connects to Hadoop? Customer Case
Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect
Big Data & QlikView Democratizing Big Data Analytics David Freriks Principal Solution Architect TDWI Vancouver Agenda What really is Big Data? How do we separate hype from reality? How does that relate
Tap into Hadoop and Other No SQL Sources
Tap into Hadoop and Other No SQL Sources Presented by: Trishla Maru What is Big Data really? The Three Vs of Big Data According to Gartner Volume Volume Orders of magnitude bigger than conventional data
Native Connectivity to Big Data Sources in MicroStrategy 10. Presented by: Raja Ganapathy
Native Connectivity to Big Data Sources in MicroStrategy 10 Presented by: Raja Ganapathy Agenda MicroStrategy supports several data sources, including Hadoop Why Hadoop? How does MicroStrategy Analytics
The Future of Data Management
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class
Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru
Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy Presented by: Jeffrey Zhang and Trishla Maru Agenda Big Data Overview All About Hadoop What is Hadoop? How does MicroStrategy connects to Hadoop?
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics
Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014
Forecast of Big Data Trends Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014 Big Data transforms Business 2 Data created every minute Source http://mashable.com/2012/06/22/data-created-every-minute/
GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION
GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION Syed Rasheed Solution Manager Red Hat Corp. Kenny Peeples Technical Manager Red Hat Corp. Kimberly Palko Product Manager Red Hat Corp.
A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani
A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani Technical Architect - Big Data Syntel Agenda Welcome to the Zoo! Evolution Timeline Traditional BI/DW Architecture Where Hadoop Fits In 2 Welcome to
Information Builders Mission & Value Proposition
Value 10/06/2015 2015 MapR Technologies 2015 MapR Technologies 1 Information Builders Mission & Value Proposition Economies of Scale & Increasing Returns (Note: Not to be confused with diminishing returns
SAP and Hortonworks Reference Architecture
SAP and Hortonworks Reference Architecture Hortonworks. We Do Hadoop. June Page 1 2014 Hortonworks Inc. 2011 2014. All Rights Reserved A Modern Data Architecture With SAP DATA SYSTEMS APPLICATIO NS Statistical
Datenverwaltung im Wandel - Building an Enterprise Data Hub with
Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees
HDP Enabling the Modern Data Architecture
HDP Enabling the Modern Data Architecture Herb Cunitz President, Hortonworks Page 1 Hortonworks enables adoption of Apache Hadoop through HDP (Hortonworks Data Platform) Founded in 2011 Original 24 architects,
AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW
AGENDA What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story Hadoop PDW Our BIG DATA Roadmap BIG DATA? Volume 59% growth in annual WW information 1.2M Zetabytes (10 21 bytes) this
Big Data and Data Science: Behind the Buzz Words
Big Data and Data Science: Behind the Buzz Words Peggy Brinkmann, FCAS, MAAA Actuary Milliman, Inc. April 1, 2014 Contents Big data: from hype to value Deconstructing data science Managing big data Analyzing
TRANSFORM BIG DATA INTO ACTIONABLE INFORMATION
TRANSFORM BIG DATA INTO ACTIONABLE INFORMATION Make Big Available for Everyone Syed Rasheed Solution Marketing Manager January 29 th, 2014 Agenda Demystifying Big Challenges Getting Bigger Red Hat Big
How To Handle Big Data With A Data Scientist
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
Big Data Technologies Compared June 2014
Big Data Technologies Compared June 2014 Agenda What is Big Data Big Data Technology Comparison Summary Other Big Data Technologies Questions 2 What is Big Data by Example The SKA Telescope is a new development
April 2016 JPoint Moscow, Russia. How to Apply Big Data Analytics and Machine Learning to Real Time Processing. Kai Wähner. kwaehner@tibco.
April 2016 JPoint Moscow, Russia How to Apply Big Data Analytics and Machine Learning to Real Time Processing Kai Wähner [email protected] @KaiWaehner www.kai-waehner.de LinkedIn / Xing Please connect!
SQLSaturday #399 Sacramento 25 July, 2015. Big Data Analytics with Excel
SQLSaturday #399 Sacramento 25 July, 2015 Big Data Analytics with Excel Presenter Introduction Peter Myers Independent BI Expert Bitwise Solutions BBus, SQL Server MCSE, SQL Server MVP since 2007 Experienced
HDP Hadoop From concept to deployment.
HDP Hadoop From concept to deployment. Ankur Gupta Senior Solutions Engineer Rackspace: Page 41 27 th Jan 2015 Where are you in your Hadoop Journey? A. Researching our options B. Currently evaluating some
Native Connectivity to Big Data Sources in MSTR 10
Native Connectivity to Big Data Sources in MSTR 10 Bring All Relevant Data to Decision Makers Support for More Big Data Sources Optimized Access to Your Entire Big Data Ecosystem as If It Were a Single
Actian SQL in Hadoop Buyer s Guide
Actian SQL in Hadoop Buyer s Guide Contents Introduction: Big Data and Hadoop... 3 SQL on Hadoop Benefits... 4 Approaches to SQL on Hadoop... 4 The Top 10 SQL in Hadoop Capabilities... 5 SQL in Hadoop
Tips and Techniques on how to better Monitor, Manage and Optimize your MicroStrategy System High ROI DW and BI Solutions
Tips and Techniques on how to better Monitor, Manage and Optimize your MicroStrategy System InfoCepts 'LJLWDOO\ VLJQHG E\,QIR&HSWV '1 FQ,QIR&HSWV JQ,QIR&HSWV F 8QLWHG 6WDWHV O 86 R,QIR&HSWV RX,QIR&HSWV
Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012. Viswa Sharma Solutions Architect Tata Consultancy Services
Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012 Viswa Sharma Solutions Architect Tata Consultancy Services 1 Agenda What is Hadoop Why Hadoop? The Net Generation is here Sizing the
SAS BIG DATA SOLUTIONS ON AWS SAS FORUM ESPAÑA, OCTOBER 16 TH, 2014 IAN MEYERS SOLUTIONS ARCHITECT / AMAZON WEB SERVICES
SAS BIG DATA SOLUTIONS ON AWS SAS FORUM ESPAÑA, OCTOBER 16 TH, 2014 IAN MEYERS SOLUTIONS ARCHITECT / AMAZON WEB SERVICES AWS GLOBAL INFRASTRUCTURE 10 Regions 25 Availability Zones 51 Edge locations WHAT
BIG DATA AND THE ENTERPRISE DATA WAREHOUSE WORKSHOP
BIG DATA AND THE ENTERPRISE DATA WAREHOUSE WORKSHOP Business Analytics for All Amsterdam - 2015 Value of Big Data is Being Recognized Executives beginning to see the path from data insights to revenue
Il mondo dei DB Cambia : Tecnologie e opportunita`
Il mondo dei DB Cambia : Tecnologie e opportunita` Giorgio Raico Pre-Sales Consultant Hewlett-Packard Italiana 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject
VIEWPOINT. High Performance Analytics. Industry Context and Trends
VIEWPOINT High Performance Analytics Industry Context and Trends In the digital age of social media and connected devices, enterprises have a plethora of data that they can mine, to discover hidden correlations
The Potential of Big Data in the Cloud. Juan Madera Technology Consultant [email protected]
The Potential of Big Data in the Cloud Juan Madera Technology Consultant [email protected] Agenda How to apply Big Data & Analytics What is it? Definitions, Technology and Data Science
The Internet of Things and Big Data: Intro
The Internet of Things and Big Data: Intro John Berns, Solutions Architect, APAC - MapR Technologies April 22 nd, 2014 1 What This Is; What This Is Not It s not specific to IoT It s not about any specific
Data Lake In Action: Real-time, Closed Looped Analytics On Hadoop
1 Data Lake In Action: Real-time, Closed Looped Analytics On Hadoop 2 Pivotal s Full Approach It s More Than Just Hadoop Pivotal Data Labs 3 Why Pivotal Exists First Movers Solve the Big Data Utility Gap
W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract
W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the
Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap
Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap 3 key strategic advantages, and a realistic roadmap for what you really need, and when 2012, Cognizant Topics to be discussed
Using Big Data for Smarter Decision Making. Colin White, BI Research July 2011 Sponsored by IBM
Using Big Data for Smarter Decision Making Colin White, BI Research July 2011 Sponsored by IBM USING BIG DATA FOR SMARTER DECISION MAKING To increase competitiveness, 83% of CIOs have visionary plans that
Luncheon Webinar Series May 13, 2013
Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration
Comprehensive Analytics on the Hortonworks Data Platform
Comprehensive Analytics on the Hortonworks Data Platform We do Hadoop. Page 1 Page 2 Back to 2005 Page 3 Vertical Scaling Page 4 Vertical Scaling Page 5 Vertical Scaling Page 6 Horizontal Scaling Page
Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook
Hadoop Ecosystem Overview CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook Agenda Introduce Hadoop projects to prepare you for your group work Intimate detail will be provided in future
Bringing the Power of SAS to Hadoop. White Paper
White Paper Bringing the Power of SAS to Hadoop Combine SAS World-Class Analytic Strength with Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities Contents Introduction... 1 What
How Big Is Big Data Adoption? Survey Results. Survey Results... 4. Big Data Company Strategy... 6
Survey Results Table of Contents Survey Results... 4 Big Data Company Strategy... 6 Big Data Business Drivers and Benefits Received... 8 Big Data Integration... 10 Big Data Implementation Challenges...
Simplifying Big Data Analytics: Unifying Batch and Stream Processing. John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!!
Simplifying Big Data Analytics: Unifying Batch and Stream Processing John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!! Streaming Analy.cs S S S Scale- up Database Data And Compute Grid
Reference Architecture, Requirements, Gaps, Roles
Reference Architecture, Requirements, Gaps, Roles The contents of this document are an excerpt from the brainstorming document M0014. The purpose is to show how a detailed Big Data Reference Architecture
Big Data and Trusted Information
Dr. Oliver Adamczak Big Data and Trusted Information CAS Single Point of Truth 7. Mai 2012 The Hype Big Data: The next frontier for innovation, competition and productivity McKinsey Global Institute 2012
TIBCO Live Datamart: Push-Based Real-Time Analytics
TIBCO Live Datamart: Push-Based Real-Time Analytics ABSTRACT TIBCO Live Datamart is a new approach to real-time analytics and data warehousing for environments where large volumes of data require a management
Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum
Big Data Analytics with EMC Greenplum and Hadoop Big Data Analytics with EMC Greenplum and Hadoop Ofir Manor Pre Sales Technical Architect EMC Greenplum 1 Big Data and the Data Warehouse Potential All
Customized Report- Big Data
GINeVRA Digital Research Hub Customized Report- Big Data 1 2014. All Rights Reserved. Agenda Context Challenges and opportunities Solutions Market Case studies Recommendations 2 2014. All Rights Reserved.
Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics
In Organizations Mark Vervuurt Cluster Data Science & Analytics AGENDA 1. Yellow Elephant 2. Data Ingestion & Complex Event Processing 3. SQL on Hadoop 4. NoSQL 5. InMemory 6. Data Science & Machine Learning
Ganzheitliches Datenmanagement
Ganzheitliches Datenmanagement für Hadoop Michael Kohs, Senior Sales Consultant @mikchaos The Problem with Big Data Projects in 2016 Relational, Mainframe Documents and Emails Data Modeler Data Scientist
Next-Generation Cloud Analytics with Amazon Redshift
Next-Generation Cloud Analytics with Amazon Redshift What s inside Introduction Why Amazon Redshift is Great for Analytics Cloud Data Warehousing Strategies for Relational Databases Analyzing Fast, Transactional
Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January 2015. Email: [email protected] Website: www.qburst.com
Lambda Architecture Near Real-Time Big Data Analytics Using Hadoop January 2015 Contents Overview... 3 Lambda Architecture: A Quick Introduction... 4 Batch Layer... 4 Serving Layer... 4 Speed Layer...
Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.
Collaborative Big Data Analytics 1 Big Data Is Less About Size, And More About Freedom TechCrunch!!!!!!!!! Total data: bigger than big data 451 Group Findings: Big Data Is More Extreme Than Volume Gartner!!!!!!!!!!!!!!!
Harnessing the Power of the Microsoft Cloud for Deep Data Analytics
1 Harnessing the Power of the Microsoft Cloud for Deep Data Analytics Today's Focus How you can operate your business more efficiently and effectively by tapping into Cloud based data analytics solutions
Data Integration Checklist
The need for data integration tools exists in every company, small to large. Whether it is extracting data that exists in spreadsheets, packaged applications, databases, sensor networks or social media
The Enterprise Data Hub and The Modern Information Architecture
The Enterprise Data Hub and The Modern Information Architecture Dr. Amr Awadallah CTO & Co-Founder, Cloudera Twitter: @awadallah 1 2013 Cloudera, Inc. All rights reserved. Cloudera Overview The Leader
Talend Big Data. Delivering instant value from all your data. Talend 2014 1
Talend Big Data Delivering instant value from all your data Talend 2014 1 I may say that this is the greatest factor: the way in which the expedition is equipped. Roald Amundsen race to the south pole,
Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP
Big-Data and Hadoop Developer Training with Oracle WDP What is this course about? Big Data is a collection of large and complex data sets that cannot be processed using regular database management tools
BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata
BIG DATA: FROM HYPE TO REALITY Leandro Ruiz Presales Partner for C&LA Teradata Evolution in The Use of Information Action s ACTIVATING MAKE it happen! Insights OPERATIONALIZING WHAT IS happening now? PREDICTING
Cloudera Enterprise Data Hub in Telecom:
Cloudera Enterprise Data Hub in Telecom: Three Customer Case Studies Version: 103 Table of Contents Introduction 3 Cloudera Enterprise Data Hub for Telcos 4 Cloudera Enterprise Data Hub in Telecom: Customer
Hadoop and Relational Database The Best of Both Worlds for Analytics Greg Battas Hewlett Packard
Hadoop and Relational base The Best of Both Worlds for Analytics Greg Battas Hewlett Packard The Evolution of Analytics Mainframe EDW Proprietary MPP Unix SMP MPP Appliance Hadoop? Questions Is Hadoop
CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data
Research Report CA Technologies Big Data Infrastructure Management Executive Summary CA Technologies recently exhibited new technology innovations, marking its entry into the Big Data marketplace with
TE's Analytics on Hadoop and SAP HANA Using SAP Vora
TE's Analytics on Hadoop and SAP HANA Using SAP Vora Naveen Narra Senior Manager TE Connectivity Santha Kumar Rajendran Enterprise Data Architect TE Balaji Krishna - Director, SAP HANA Product Mgmt. -
The BIg Picture. Dinsdag 17 september 2013
The BIg Picture Dinsdag 17 september 2013 2 Agenda A short historical overview on BI Current Issues Current trends Future architecture First steps to this architecture 3 MIS/EIS Data Warehouse BI Multidimensional
Extend your analytic capabilities with SAP Predictive Analysis
September 9 11, 2013 Anaheim, California Extend your analytic capabilities with SAP Predictive Analysis Charles Gadalla Learning Points Advanced analytics strategy at SAP Simplifying predictive analytics
Where is... How do I get to...
Big Data, Fast Data, Spatial Data Making Sense of Location Data in a Smart City Hans Viehmann Product Manager EMEA ORACLE Corporation August 19, 2015 Copyright 2014, Oracle and/or its affiliates. All rights
The Future of Data Management with Hadoop and the Enterprise Data Hub
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah Cofounder & CTO, Cloudera, Inc. Twitter: @awadallah 1 2 Cloudera Snapshot Founded 2008, by former employees of Employees
Building Your Big Data Team
Building Your Big Data Team With all the buzz around Big Data, many companies have decided they need some sort of Big Data initiative in place to stay current with modern data management requirements.
Modern Data Architecture for Predictive Analytics
Modern Data Architecture for Predictive Analytics David Smith VP Marketing and Community - Revolution Analytics John Kreisa VP Strategic Marketing- Hortonworks Hortonworks Inc. 2013 Page 1 Your Presenters
The 4 Pillars of Technosoft s Big Data Practice
beyond possible Big Use End-user applications Big Analytics Visualisation tools Big Analytical tools Big management systems The 4 Pillars of Technosoft s Big Practice Overview Businesses have long managed
BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES
BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES Relational vs. Non-Relational Architecture Relational Non-Relational Rational Predictable Traditional Agile Flexible Modern 2 Agenda Big Data
Real Time Big Data Processing
Real Time Big Data Processing Cloud Expo 2014 Ian Meyers Amazon Web Services Global Infrastructure Deployment & Administration App Services Analytics Compute Storage Database Networking AWS Global Infrastructure
EMC Federation Big Data Solutions. Copyright 2015 EMC Corporation. All rights reserved.
EMC Federation Big Data Solutions 1 Introduction to data analytics Federation offering 2 Traditional Analytics! Traditional type of data analysis, sometimes called Business Intelligence! Type of analytics
W H I T E P A P E R. Building your Big Data analytics strategy: Block-by-Block! Abstract
W H I T E P A P E R Building your Big Data analytics strategy: Block-by-Block! Abstract In this white paper, Impetus discusses how you can handle Big Data problems. It talks about how analytics on Big
Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database
Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica
Self-service BI for big data applications using Apache Drill
Self-service BI for big data applications using Apache Drill 2015 MapR Technologies 2015 MapR Technologies 1 Data Is Doubling Every Two Years Unstructured data will account for more than 80% of the data
Open Source in Financial Services: Meet the challenges of new business models and disruption
Open Source in Financial Services: Meet the challenges of new business models and disruption Speakers Vamsi Chemitiganti, General Manager Financial Services, Hortonworks Josh West, Senior Solutions Architect,
Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth
MAKING BIG DATA COME ALIVE Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth Steve Gonzales, Principal Manager [email protected]
Big Data and Industrial Internet
Big Data and Industrial Internet Keijo Heljanko Department of Computer Science and Helsinki Institute for Information Technology HIIT School of Science, Aalto University [email protected] 16.6-2015
Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84
Index A Amazon Web Services (AWS), 50, 58 Analytics engine, 21 22 Apache Kafka, 38, 131 Apache S4, 38, 131 Apache Sqoop, 37, 131 Appliance pattern, 104 105 Application architecture, big data analytics
From Spark to Ignition:
From Spark to Ignition: Fueling Your Business on Real-Time Analytics Eric Frenkiel, MemSQL CEO June 29, 2015 San Francisco, CA What s in Store For This Presentation? 1. MemSQL: A real-time database for
Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.
Big Data Analytics 1 Priority Discussion Topics What are the most compelling business drivers behind big data analytics? Do you have or expect to have data scientists on your staff, and what will be their
Big Data Multi-Platform Analytics (Hadoop, NoSQL, Graph, Analytical Database)
Multi-Platform Analytics (Hadoop, NoSQL, Graph, Analytical Database) Presented By: Mike Ferguson Intelligent Business Strategies Limited 2 Day Workshop : 25-26 September 2014 : 29-30 September 2014 www.unicom.co.uk/bigdata
Tiber Solutions. Understanding the Current & Future Landscape of BI and Data Storage. Jim Hadley
Tiber Solutions Understanding the Current & Future Landscape of BI and Data Storage Jim Hadley Tiber Solutions Founded in 2005 to provide Business Intelligence / Data Warehousing / Big Data thought leadership
Advanced Big Data Analytics with R and Hadoop
REVOLUTION ANALYTICS WHITE PAPER Advanced Big Data Analytics with R and Hadoop 'Big Data' Analytics as a Competitive Advantage Big Analytics delivers competitive advantage in two ways compared to the traditional
QUEST meeting Big Data Analytics
QUEST meeting Big Data Analytics Peter Hughes Business Solutions Consultant SAS Australia/New Zealand Copyright 2015, SAS Institute Inc. All rights reserved. Big Data Analytics WHERE WE ARE NOW 2005 2007
This Symposium brought to you by www.ttcus.com
This Symposium brought to you by www.ttcus.com Linkedin/Group: Technology Training Corporation @Techtrain Technology Training Corporation www.ttcus.com Big Data Analytics as a Service (BDAaaS) Big Data
Upcoming Announcements
Enterprise Hadoop Enterprise Hadoop Jeff Markham Technical Director, APAC [email protected] Page 1 Upcoming Announcements April 2 Hortonworks Platform 2.1 A continued focus on innovation within
EVERYTHING THAT MATTERS IN ADVANCED ANALYTICS
EVERYTHING THAT MATTERS IN ADVANCED ANALYTICS Marcia Kaufman, Principal Analyst, Hurwitz & Associates Dan Kirsch, Senior Analyst, Hurwitz & Associates Steve Stover, Sr. Director, Product Management, Predixion
Big Data Can Drive the Business and IT to Evolve and Adapt
Big Data Can Drive the Business and IT to Evolve and Adapt Ralph Kimball Associates 2013 Ralph Kimball Brussels 2013 Big Data Itself is Being Monetized Executives see the short path from data insights
