Dashboard Engine for Hadoop
|
|
|
- Francine Greer
- 10 years ago
- Views:
Transcription
1 Matt McDevitt Sr. Project Manager Pavan Challa Sr. Data Engineer June 2015 Dashboard Engine for Hadoop Think Big Start Smart Scale Fast
2 Agenda Think Big Overview Engagement Model Solution Offerings Dashboard Engine Demo Q&A 2015 Think Big, a CONFIDENTIAL Teradata Company 2
3 2015 Think Big, a CONFIDENTIAL Teradata Company 3 3
4 Think Big Overview Founded in 2010, acquired in 2014, International in 2015 First and leading professional services firm exclusively focused on big data End to End Services: Strategy, Design, Implementation, IP/Software, Support and Managed Services Academy to scale delivery capability Extend and integrate open source with UDA Team-based delivery with Solution Center Growing quickly: we re hiring! Think Big Founded 2010 PRESTO 2015 Think Big, a CONFIDENTIAL Teradata Company 4
5 Think Big Engagement Model 2015 Think Big, a CONFIDENTIAL Teradata Company 5
6 Think Big Analytics VELOCITY Methodology New Data Big Data Approach Use Cases Roadmap Big Data Lab Business Analytics New Models New Analytics New Insights New Data Requirements Big Data Program Mgt Solutions Planning and Design Prioritization Capability Backlog Grooming for engineering Data Science Discovery R&D Managed Services Quality Assurance & Test Managed Support Break Fix Sustaining Engineering Data Engineering Engineering Sprint(s) Releases 2015 Think Big, a CONFIDENTIAL Teradata Company 6
7 Think Big Solution Offerings 1. Big Data Strategy Roadmap 2. Data Lake Starter Program 3. Data Lake Optimization 4. Data Lake Managed Services 5. Presto for the Enterprise new as of June 10, Big Data Managed Services 7. Think Big Academy Device Data Manufacturing Operations Omni-Channel Marketing Analytics Financial Services Fraud/Risk Analytics Healthcare personalization Custom Analytics Solution Services Device Data Behavior Analytics IT Threat Detection Public Sector Risk Analysis Gaming Analytics 2015 Think Big, a CONFIDENTIAL Teradata Company 7
8 Data Lake Implementation MAKING BIG DATA COME ALIVE
9 Data Lake Program Offers Data Lake: Starter Program Stand up a Data Lake and build 3 governed batch data ingest streams Includes Services and Subscription Software Frameworks Data Lake: Optimization Add governance to your Data Lake For Data Lakes not originally built by Think Big Data Lake: Dashboard Engine Reporting Install and configure engine with Data Lake to build dashboard analytics for deep dimensional rollup reporting capabilities with Tableau on Hadoop Data Lake: Security Data Security & InfoSec, Cluster Hardening, Perimeter, Connectivity Data Lake: Managed Services Only for Data Lakes that Think Big Designs and Builds On Premise, Public Cloud (AWS) and Private Cloud (Teradata and Altiscale) 2015 Think Big, a CONFIDENTIAL Teradata Company 9
10 Think Big Data Lake Starter Program (8 Week Engagement) Objective: Design, Develop and Deploy Data Lake Ingestion with Governance 2 weeks 2 week 2 week 2 weeks Design Build & Test Integrate & Tune Assess, Mentor & Plan Collaborative workshops with business groups Identification and prioritization of high-value data streams Gap analysis Data Stream Prioritization Develop Ingest workflows Install Metadata and Info Security Services Prepare Cluster for Integration test Develop & Unit Testing Install Ingest & System Test Begin Profiling Data System Integration Testing Learn about Information Security and data wrangling Begin Building DL Reporting Final tuning, assessment and next steps Organization & Training Data Sources Cluster configuration & Integration Info Security Objectives Software Component Installation Data Profiling and Capability Follow-up Roadmap Executive Presentation 2015 Think Big, a CONFIDENTIAL Teradata Company 10
11 Think Big Enterprise Data Lake Perimeter-Authentication-Authorization Sequence Automate Prepare Source Metadata Collect & Manage Apply Structure Evaluate Source Data Ingest Metadata Prepare Data for Ingest Information Sources InfoSec Compress Protect Dashboard Engine Downstream Applications Enterprise Data Lake 2015 Think Big, a CONFIDENTIAL Teradata Company 11
12 API API Statistics Graph Analytics Dashboard Engine Realtime Processing Machine Learning Discovery Zone Kafka Spark Experimental Data Data Lab Msg Queue CDC Raw Data Processing Derived Views Buffer Server Governed Ingestion Data Repository Metadata Repository Security, Archival Loom integrated Metadata, lineage, Wrangling RainStor System of Record, Archive Think Big, a CONFIDENTIAL Teradata Company 12
13 Think Big, a CONFIDENTIAL Teradata Company 13
14 Why a Dashboard Engine? Events Hadoop 2015 Think Big, a CONFIDENTIAL Teradata Company 14
15 ThinkBig Dashboard Engine Strengths Near real-time analytics Easily scales to 100s of simulaneous users Query latency typically under 100 ms Deep dimensional drill-down Works with popular BI tools javascript, jquery Tableau others announced soon 2015 Think Big, a CONFIDENTIAL Teradata Company 15
16 Using Tableau without Dashboard Engine Queryable data limited by size of Server. Doesn t scale as users grow. Middle Tier Server Hadoop Extract 2015 Think Big, a CONFIDENTIAL Teradata Company 16
17 Using Impala without Think Big Dashboard Engine For the time the query is running, most or all of the cluster is dedicated to that one query. Has limitations if the cluster has other loads Has limitations for simultaneous dashboard users Low latencies possible only if all the event data is in RAM at query time Think Big, a CONFIDENTIAL Teradata Company 17
18 18 Dash Board Engine Architecture
19 Think Big s Dashboard Engine for Hadoop Uses the power of Apache Spark to pre-aggregate data Scales as event volume grows. Scales as number of users grows. API 2015 Think Big, a CONFIDENTIAL Teradata Company 19
20 Arrivals-a:SFO-s:CA Arrivals-a:SFO-s:CA Arrivals-a:SFO-s:CA Arrivals-s:CA Arrivals-s:CA Arrivals-s:CA Arrivals Arrivals Arrivals Store cube data 2015 Think Big, a CONFIDENTIAL Teradata Company 20
21 API - Connecting to the Dashboard Engine Aggregate API that understands metrics, dimensions, time ranges. Relational API that understands (some) SQL. Aggregate API SQL API 2015 Think Big, a CONFIDENTIAL Teradata Company 21
22 22 Demo
23 Flight Data Statistics for Demo Running on a 16-node cluster (TD Appliance for Hadoop) Process and store all data in ~ 2 hours Rows Storage space Flight records 160 million 30 GB MOLAP cube 35 billion 2.1 TB 2015 Think Big, a CONFIDENTIAL Teradata Company 23
24 SQL Query to REST API Example Sends SQL queries to the API SELECT FlightData.Date AS "none_date_ok", FlightData.State AS "none_state_nk, SUM(FlightData.Arrivals) AS "sum_arrivals_nk FROM GROUP BY "default"."flightdata" "FlightData" "none_date_ok, "none_state_nk Translated to Aggregate API queries period=day&start= &dimension=state:&metric=arrivals 2015 Think Big, a CONFIDENTIAL Teradata Company 24
25 Example index: List all Airports for a specific State <index name="airportsbystate"> <periods> <period>day</period> </periods> <indexdimensions> <dimension name="state" /> </indexdimensions> <listdimensions> <dimension name="airport" /> </listdimensions> </index> 2015 Think Big, a CONFIDENTIAL Teradata Company 25
26 Aggregate use: Show arrivals for all airports for NY y&start= &end= &dimension=Airport:&dimension=State:NY&metric=Arrivals&head ers=on Day Start Airport State Arrivals ALB NY ART NY BUF NY JFK NY LGA NY ROC NY SWF NY SYR NY Think Big, a CONFIDENTIAL Teradata Company 26
27 Index: List Flight No / Carrier / City / State combinations <index name="listflightnocarriercitystate"> <periods> <period>day</period> </periods> <indexdimensions> </indexdimensions> <listdimensions> <dimension name="state" /> <dimension name="city" /> <dimension name="carrier" /> <dimension name="flightno" /> </listdimensions> </index> 2015 Think Big, a CONFIDENTIAL Teradata Company 27
28 Dimensions use: Show all Flight/Carrier/City/State =day&start= &end= &dimension=State:&dimension=City:&dimension=Carrier:&dime nsion=flightno: "results":[ ["AK","Anchorage, AK","AS","101"], ["AK","Anchorage, AK","AS","102"], ["AK","Anchorage, AK","AS","103"], ["AK","Anchorage, AK","AS","106"], ["AK","Anchorage, AK","AS","108"],... ["AL","Huntsville, AL","DL","1782"], ["AL","Huntsville, AL","DL","2077"],... ["WY","Rock Springs, WY","OO","7413"]] 2015 Think Big, a CONFIDENTIAL Teradata Company 28
29 Index Question Q: Drill down to a list of flights that had caused delay in Colorado done by Delta? A: Create the index below, rerun index creation step, query delay metrics for given state and carrier, while listing flight numbers dimension=flightno: <index name="listflightnobycarrierstate"> </index> <periods> <period>day</period> </periods> <indexdimensions> <dimension name="state" /> <dimension name="carrier" /> </indexdimensions> <listdimensions> <dimension name="flightno" /> </listdimensions> 2015 Think Big, a CONFIDENTIAL Teradata Company 29
30 30 Questions?
31 We are hiring!!! DATA ANALYTICS DATA ENGINEERS DATA SOLUTIONS Think Big International
Data Governance in the Hadoop Data Lake. Kiran Kamreddy May 2015
Data Governance in the Hadoop Data Lake Kiran Kamreddy May 2015 One Data Lake: Many Definitions A centralized repository of raw data into which many data-producing streams flow and from which downstream
Data Governance in the Hadoop Data Lake. Michael Lang May 2015
Data Governance in the Hadoop Data Lake Michael Lang May 2015 Introduction Product Manager for Teradata Loom Joined Teradata as part of acquisition of Revelytix, original developer of Loom VP of Sales
From Spark to Ignition:
From Spark to Ignition: Fueling Your Business on Real-Time Analytics Eric Frenkiel, MemSQL CEO June 29, 2015 San Francisco, CA What s in Store For This Presentation? 1. MemSQL: A real-time database for
The Future of Data Management
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class
A very short talk about Apache Kylin Business Intelligence meets Big Data. Fabian Wilckens EMEA Solutions Architect
A very short talk about Apache Kylin Business Intelligence meets Big Data Fabian Wilckens EMEA Solutions Architect 1 The challenge today 2 Very quickly: OLAP Online Analytical Processing How many beers
Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth
MAKING BIG DATA COME ALIVE Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth Steve Gonzales, Principal Manager [email protected]
Real Time Big Data Processing
Real Time Big Data Processing Cloud Expo 2014 Ian Meyers Amazon Web Services Global Infrastructure Deployment & Administration App Services Analytics Compute Storage Database Networking AWS Global Infrastructure
Beyond Lambda - how to get from logical to physical. Artur Borycki, Director International Technology & Innovations
Beyond Lambda - how to get from logical to physical Artur Borycki, Director International Technology & Innovations Simplification & Efficiency Teradata believe in the principles of self-service, automation
Testing Big data is one of the biggest
Infosys Labs Briefings VOL 11 NO 1 2013 Big Data: Testing Approach to Overcome Quality Challenges By Mahesh Gudipati, Shanthi Rao, Naju D. Mohan and Naveen Kumar Gajja Validate data quality by employing
Ganzheitliches Datenmanagement
Ganzheitliches Datenmanagement für Hadoop Michael Kohs, Senior Sales Consultant @mikchaos The Problem with Big Data Projects in 2016 Relational, Mainframe Documents and Emails Data Modeler Data Scientist
Building a real-time, self-service data analytics ecosystem Greg Arnold, Sr. Director Engineering
Building a real-time, self-service data analytics ecosystem Greg Arnold, Sr. Director Engineering Self Service at scale 6 5 4 3 2 1 ? Relational? MPP? Hadoop? Linkedin data 350M Members 25B 3.5M 4.8B 2M
Simplifying Big Data Analytics: Unifying Batch and Stream Processing. John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!!
Simplifying Big Data Analytics: Unifying Batch and Stream Processing John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!! Streaming Analy.cs S S S Scale- up Database Data And Compute Grid
Addressing Risk Data Aggregation and Risk Reporting Ben Sharma, CEO. Big Data Everywhere Conference, NYC November 2015
Addressing Risk Data Aggregation and Risk Reporting Ben Sharma, CEO Big Data Everywhere Conference, NYC November 2015 Agenda 1. Challenges with Risk Data Aggregation and Risk Reporting (RDARR) 2. How a
Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect
Big Data & QlikView Democratizing Big Data Analytics David Freriks Principal Solution Architect TDWI Vancouver Agenda What really is Big Data? How do we separate hype from reality? How does that relate
Upcoming Announcements
Enterprise Hadoop Enterprise Hadoop Jeff Markham Technical Director, APAC [email protected] Page 1 Upcoming Announcements April 2 Hortonworks Platform 2.1 A continued focus on innovation within
Pulsar Realtime Analytics At Scale. Tony Ng April 14, 2015
Pulsar Realtime Analytics At Scale Tony Ng April 14, 2015 Big Data Trends Bigger data volumes More data sources DBs, logs, behavioral & business event streams, sensors Faster analysis Next day to hours
In-memory computing with SAP HANA
In-memory computing with SAP HANA June 2015 Amit Satoor, SAP @asatoor 2015 SAP SE or an SAP affiliate company. All rights reserved. 1 Hyperconnectivity across people, business, and devices give rise to
Towards Smart and Intelligent SDN Controller
Towards Smart and Intelligent SDN Controller - Through the Generic, Extensible, and Elastic Time Series Data Repository (TSDR) YuLing Chen, Dell Inc. Rajesh Narayanan, Dell Inc. Sharon Aicler, Cisco Systems
CAPTURING & PROCESSING REAL-TIME DATA ON AWS
CAPTURING & PROCESSING REAL-TIME DATA ON AWS @ 2015 Amazon.com, Inc. and Its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent
Data Lake In Action: Real-time, Closed Looped Analytics On Hadoop
1 Data Lake In Action: Real-time, Closed Looped Analytics On Hadoop 2 Pivotal s Full Approach It s More Than Just Hadoop Pivotal Data Labs 3 Why Pivotal Exists First Movers Solve the Big Data Utility Gap
Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes
Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes Highly competitive enterprises are increasingly finding ways to maximize and accelerate
Harnessing the Power of the Microsoft Cloud for Deep Data Analytics
1 Harnessing the Power of the Microsoft Cloud for Deep Data Analytics Today's Focus How you can operate your business more efficiently and effectively by tapping into Cloud based data analytics solutions
Architectural patterns for building real time applications with Apache HBase. Andrew Purtell Committer and PMC, Apache HBase
Architectural patterns for building real time applications with Apache HBase Andrew Purtell Committer and PMC, Apache HBase Who am I? Distributed systems engineer Principal Architect in the Big Data Platform
Bringing Strategy to Life Using an Intelligent Data Platform to Become Data Ready. Informatica Government Summit April 23, 2015
Bringing Strategy to Life Using an Intelligent Platform to Become Ready Informatica Government Summit April 23, 2015 Informatica Solutions Overview Power the -Ready Enterprise Government Imperatives Improve
HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics
HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics ESSENTIALS EMC ISILON Use the industry's first and only scale-out NAS solution with native Hadoop
Big Data Analytics Platform @ Nokia
Big Data Analytics Platform @ Nokia 1 Selecting the Right Tool for the Right Workload Yekesa Kosuru Nokia Location & Commerce Strata + Hadoop World NY - Oct 25, 2012 Agenda Big Data Analytics Platform
Roadmap Talend : découvrez les futures fonctionnalités de Talend
Roadmap Talend : découvrez les futures fonctionnalités de Talend Cédric Carbone Talend Connect 9 octobre 2014 Talend 2014 1 Connecting the Data-Driven Enterprise Talend 2014 2 Agenda Agenda Why a Unified
The Future of Data Management with Hadoop and the Enterprise Data Hub
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah Cofounder & CTO, Cloudera, Inc. Twitter: @awadallah 1 2 Cloudera Snapshot Founded 2008, by former employees of Employees
Analytics on Spark & Shark @Yahoo
Analytics on Spark & Shark @Yahoo PRESENTED BY Tim Tully December 3, 2013 Overview Legacy / Current Hadoop Architecture Reflection / Pain Points Why the movement towards Spark / Shark New Hybrid Environment
Traditional BI vs. Business Data Lake A comparison
Traditional BI vs. Business Data Lake A comparison The need for new thinking around data storage and analysis Traditional Business Intelligence (BI) systems provide various levels and kinds of analyses
Ali Ghodsi Head of PM and Engineering Databricks
Making Big Data Simple Ali Ghodsi Head of PM and Engineering Databricks Big Data is Hard: A Big Data Project Tasks Tasks Build a Hadoop cluster Challenges Clusters hard to setup and manage Build a data
How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time
SCALEOUT SOFTWARE How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time by Dr. William Bain and Dr. Mikhail Sobolev, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T wenty-first
Apache Kylin Introduction Dec 8, 2014 @ApacheKylin
Apache Kylin Introduction Dec 8, 2014 @ApacheKylin Luke Han Sr. Product Manager [email protected] @lukehq Yang Li Architect & Tech Leader [email protected] Agenda What s Apache Kylin? Tech Highlights Performance
TE's Analytics on Hadoop and SAP HANA Using SAP Vora
TE's Analytics on Hadoop and SAP HANA Using SAP Vora Naveen Narra Senior Manager TE Connectivity Santha Kumar Rajendran Enterprise Data Architect TE Balaji Krishna - Director, SAP HANA Product Mgmt. -
Big Data Analytics Roadmap Energy Industry
Douglas Moore, Principal Consultant, Architect June 2013 Big Data Analytics Energy Industry Agenda Why Big Data in Energy? Imagine Overview - Use Cases - Readiness Analysis - Architecture - Development
Real-Time Data Access Using Restful Framework for Multi-Platform Data Warehouse Environment
www.wipro.com Real-Time Data Access Using Restful Framework for Multi-Platform Data Warehouse Environment Pon Prabakaran Shanmugam, Principal Consultant, Wipro Analytics practice Table of Contents 03...Abstract
MySQL and Hadoop: Big Data Integration. Shubhangi Garg & Neha Kumari MySQL Engineering
MySQL and Hadoop: Big Data Integration Shubhangi Garg & Neha Kumari MySQL Engineering 1Copyright 2013, Oracle and/or its affiliates. All rights reserved. Agenda Design rationale Implementation Installation
Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture
Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture Apps and data source extensions with APIs Future white label, embed or integrate Power BI Deploy Intelligent
Oracle Big Data Building A Big Data Management System
Oracle Big Building A Big Management System Copyright 2015, Oracle and/or its affiliates. All rights reserved. Effi Psychogiou ECEMEA Big Product Director May, 2015 Safe Harbor Statement The following
Databricks. A Primer
Databricks A Primer Who is Databricks? Databricks vision is to empower anyone to easily build and deploy advanced analytics solutions. The company was founded by the team who created Apache Spark, a powerful
Oracle Database 12c Plug In. Switch On. Get SMART.
Oracle Database 12c Plug In. Switch On. Get SMART. Duncan Harvey Head of Core Technology, Oracle EMEA March 2015 Safe Harbor Statement The following is intended to outline our general product direction.
Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013
Big Data Use Case How Rackspace is using Private Cloud for Big Data Bryan Thompson May 8th, 2013 Our Big Data Problem Consolidate all monitoring data for reporting and analytical purposes. Every device
BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata
BIG DATA: FROM HYPE TO REALITY Leandro Ruiz Presales Partner for C&LA Teradata Evolution in The Use of Information Action s ACTIVATING MAKE it happen! Insights OPERATIONALIZING WHAT IS happening now? PREDICTING
Building a data analytics platform with Hadoop, Python and R
Building a data analytics platform with Hadoop, Python and R Agenda Me Sanoma Past Present Future 3 18 November 2013 /me @skieft Software architect for Sanoma Managing the data and search team Focus on
EMC Federation Big Data Solutions. Copyright 2015 EMC Corporation. All rights reserved.
EMC Federation Big Data Solutions 1 Introduction to data analytics Federation offering 2 Traditional Analytics! Traditional type of data analysis, sometimes called Business Intelligence! Type of analytics
SEIZE THE DATA. 2015 SEIZE THE DATA. 2015
1 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. BIG DATA CONFERENCE 2015 Boston August 10-13 Predicting and reducing deforestation
Big data blue print for cloud architecture
Big data blue print for cloud architecture -COGNIZANT Image Area Prabhu Inbarajan Srinivasan Thiruvengadathan Muralicharan Gurumoorthy Praveen Codur 2012, Cognizant Next 30 minutes Big Data / Cloud challenges
MS 20467: Designing Business Intelligence Solutions with Microsoft SQL Server 2012
MS 20467: Designing Business Intelligence Solutions with Microsoft SQL Server 2012 Description: This five-day instructor-led course teaches students how to design and implement a BI infrastructure. The
Production ready hadoop. By Deepak Rao Na,onal Head Datawarehousing Bajaj Finserv
Production ready hadoop By Deepak Rao Na,onal Head Datawarehousing Bajaj Finserv Agenda! Data in today s BFSI world! Modern Data Lake! Use cases & prototyping! Big data impact in BFSI! Thank you!! Defini8on
Databricks. A Primer
Databricks A Primer Who is Databricks? Databricks was founded by the team behind Apache Spark, the most active open source project in the big data ecosystem today. Our mission at Databricks is to dramatically
Artur Borycki. Director International Solutions Marketing
Artur Borycki Director International Solutions Agenda! Evolution of Teradata s Unified Architecture Analytical and Workloads! Teradata s Reference Information Architecture Evolution of Teradata s" Unified
Hadoop & Spark Using Amazon EMR
Hadoop & Spark Using Amazon EMR Michael Hanisch, AWS Solutions Architecture 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Agenda Why did we build Amazon EMR? What is Amazon EMR?
www.objectivity.com Choosing The Right Big Data Tools For The Job A Polyglot Approach
www.objectivity.com Choosing The Right Big Data Tools For The Job A Polyglot Approach Nic Caine NoSQL Matters, April 2013 Overview The Problem Current Big Data Analytics Relationship Analytics Leveraging
How Companies are! Using Spark
How Companies are! Using Spark And where the Edge in Big Data will be Matei Zaharia History Decreasing storage costs have led to an explosion of big data Commodity cluster software, like Hadoop, has made
Case Study: Real-time Analytics With Druid. Salil Kalia, Tech Lead, TO THE NEW Digital
Case Study: Real-time Analytics With Druid Salil Kalia, Tech Lead, TO THE NEW Digital Agenda Understanding the use-case Ad workflow Our use case Experiments with technologies Redis Cassandra Introduction
Big Data & Analytics for Semiconductor Manufacturing
Big Data & Analytics for Semiconductor Manufacturing 半 導 体 生 産 におけるビッグデータ 活 用 Ryuichiro Hattori 服 部 隆 一 郎 Intelligent SCM and MFG solution Leader Global CoC (Center of Competence) Electronics team General
Introducing Oracle Exalytics In-Memory Machine
Introducing Oracle Exalytics In-Memory Machine Jon Ainsworth Director of Business Development Oracle EMEA Business Analytics 1 Copyright 2011, Oracle and/or its affiliates. All rights Agenda Topics Oracle
Cognos Performance Troubleshooting
Cognos Performance Troubleshooting Presenters James Salmon Marketing Manager [email protected] Andy Ellis Senior BI Consultant [email protected] Want to ask a question?
IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!
The Bloor Group IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS VENDOR PROFILE The IBM Big Data Landscape IBM can legitimately claim to have been involved in Big Data and to have a much broader
Data Integration Checklist
The need for data integration tools exists in every company, small to large. Whether it is extracting data that exists in spreadsheets, packaged applications, databases, sensor networks or social media
Safe Harbor Statement
Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment
Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook
Hadoop Ecosystem Overview CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook Agenda Introduce Hadoop projects to prepare you for your group work Intimate detail will be provided in future
Azure Data Lake Analytics
Azure Data Lake Analytics Compose and orchestrate data services at scale Fully managed service to support orchestration of data movement and processing Connect to relational or non-relational data
Maximizing Your Storage Investment with the EMC Storage Inventory Dashboard
Maximizing Your Storage Investment with the EMC Storage Inventory Dashboard Matt Roberts Application Development Practice Lead Copyright 2008 EMC Corporation. All rights reserved. Today s Agenda Complexity
AtScale Intelligence Platform
AtScale Intelligence Platform PUT THE POWER OF HADOOP IN THE HANDS OF BUSINESS USERS. Connect your BI tools directly to Hadoop without compromising scale, performance, or control. TURN HADOOP INTO A HIGH-PERFORMANCE
Real Time Fraud Detection With Sequence Mining on Big Data Platform. Pranab Ghosh Big Data Consultant IEEE CNSV meeting, May 6 2014 Santa Clara, CA
Real Time Fraud Detection With Sequence Mining on Big Data Platform Pranab Ghosh Big Data Consultant IEEE CNSV meeting, May 6 2014 Santa Clara, CA Open Source Big Data Eco System Query (NOSQL) : Cassandra,
Big Analytics in the Cloud. Matt Winkler PM, Big Data @ Microsoft @mwinkle
Big Analytics in the Cloud Matt Winkler PM, Big Data @ Microsoft @mwinkle Part 3: Single Slide JustGiving is a global online social platform for giving that lets you raise money for a cause you care about
Oracle Big Data SQL Technical Update
Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical
Big Data Analytics - Accelerated. stream-horizon.com
Big Data Analytics - Accelerated stream-horizon.com Legacy ETL platforms & conventional Data Integration approach Unable to meet latency & data throughput demands of Big Data integration challenges Based
HDP Hadoop From concept to deployment.
HDP Hadoop From concept to deployment. Ankur Gupta Senior Solutions Engineer Rackspace: Page 41 27 th Jan 2015 Where are you in your Hadoop Journey? A. Researching our options B. Currently evaluating some
Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc. 2011 2014. All Rights Reserved
Hortonworks & SAS Analytics everywhere. Page 1 A change in focus. A shift in Advertising From mass branding A shift in Financial Services From Educated Investing A shift in Healthcare From mass treatment
How To Handle Big Data With A Data Scientist
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
NAVIGATING THE BIG DATA JOURNEY
Making big data come alive NAVIGATING THE BIG DATA JOURNEY Big Data and Hadoop: Moving from Strategy to Production London Dublin Mumbai Boston New York Atlanta Chicago Salt Lake City Silicon Valley (650)
QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering
QLIKVIEW INTEGRATION TION WITH AMAZON REDSHIFT John Park Partner Engineering June 2014 Page 1 Contents Introduction... 3 About Amazon Web Services (AWS)... 3 About Amazon Redshift... 3 QlikView on AWS...
Cisco IT Hadoop Journey
Cisco IT Hadoop Journey Srini Desikan, Program Manager IT 2015 MapR Technologies 1 Agenda Hadoop Platform Timeline Key Decisions / Lessons Learnt Data Lake Hadoop s place in IT Data Platforms Use Cases
Native Connectivity to Big Data Sources in MSTR 10
Native Connectivity to Big Data Sources in MSTR 10 Bring All Relevant Data to Decision Makers Support for More Big Data Sources Optimized Access to Your Entire Big Data Ecosystem as If It Were a Single
BIG DATA. Using the Lambda Architecture on a Big Data Platform to Improve Mobile Campaign Management. Author: Sandesh Deshmane
BIG DATA Using the Lambda Architecture on a Big Data Platform to Improve Mobile Campaign Management Author: Sandesh Deshmane Executive Summary Growing data volumes and real time decision making requirements
www.sryas.com Analance Data Integration Technical Whitepaper
Analance Data Integration Technical Whitepaper Executive Summary Business Intelligence is a thriving discipline in the marvelous era of computing in which we live. It s the process of analyzing and exploring
SQL Server PDW. Artur Vieira Premier Field Engineer
SQL Server PDW Artur Vieira Premier Field Engineer Agenda 1 Introduction to MPP and PDW 2 PDW Architecture and Components 3 Data Structures 4 PDW Tools Data Load / Data Output / Administrative Console
Big Data for Investment Research Management
IDT Partners www.idtpartners.com Big Data for Investment Research Management Discover how IDT Partners helps Financial Services, Market Research, and Investment Management firms turn big data into actionable
SAP HANA PLATFORM Top Ten Questions for Choosing In-Memory Databases. Start Here
PLATFORM Top Ten Questions for Choosing In-Memory Databases Start Here PLATFORM Top Ten Questions for Choosing In-Memory Databases. Are my applications accelerated without manual intervention and tuning?.
GROW WITH BIG DATA Third Eye Consulting Services & Solutions LLC.
GROW WITH BIG DATA Third Eye Consulting Services & Solutions LLC. Connected Cars Driving Us to a Better Us - In Real Time What is a Connected Car? Connected Car - Definition A connected car is a car that
Information Builders Mission & Value Proposition
Value 10/06/2015 2015 MapR Technologies 2015 MapR Technologies 1 Information Builders Mission & Value Proposition Economies of Scale & Increasing Returns (Note: Not to be confused with diminishing returns
Real-Time Data Analytics and Visualization
Real-Time Data Analytics and Visualization Making the leap to BI on Hadoop Predictive Analytics & Business Insights 2015 February 9, 2015 David P. Mariani CEO, AtScale, Inc. THE TRUTH ABOUT DATA We think
Dell* In-Memory Appliance for Cloudera* Enterprise
Built with Intel Dell* In-Memory Appliance for Cloudera* Enterprise Find out what faster big data analytics can do for your business The need for speed in all things related to big data is an enormous
Embedded inside the database. No need for Hadoop or customcode. True real-time analytics done per transaction and in aggregate. On-the-fly linking IP
Operates more like a search engine than a database Scoring and ranking IP allows for fuzzy searching Best-result candidate sets returned Contextual analytics to correctly disambiguate entities Embedded
Big Data overview. Livio Ventura. SICS Software week, Sept 23-25 Cloud and Big Data Day
Big Data overview SICS Software week, Sept 23-25 Cloud and Big Data Day Livio Ventura Big Data European Industry Leader for Telco, Energy and Utilities and Digital Media Agenda some data on Data Big Data
Putting Apache Kafka to Use!
Putting Apache Kafka to Use! Building a Real-time Data Platform for Event Streams! JAY KREPS, CONFLUENT! A Couple of Themes! Theme 1: Rise of Events! Theme 2: Immutability Everywhere! Level! Example! Immutable
Real-time Ad-hoc Analytics on S3 with MemSQL
Real-time Ad-hoc Analytics on S3 with MemSQL Satish Cattamanchi 4INFO Sarvesh Gupta Tavant Technologies September, 2015 ABSTRACT Enterprises are witnessing a rapid increase in data volume with growing
Optimized for the Industrial Internet: GE s Industrial Data Lake Platform
Optimized for the Industrial Internet: GE s Industrial Lake Platform Agenda The Opportunity The Solution The Challenges The Results Solutions for Industrial Internet, deep domain expertise 2 GESoftware.com
Investor Presentation. Second Quarter 2015
Investor Presentation Second Quarter 2015 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences
Hadoop & SAS Data Loader for Hadoop
Turning Data into Value Hadoop & SAS Data Loader for Hadoop Sebastiaan Schaap Frederik Vandenberghe Agenda What s Hadoop SAS Data management: Traditional In-Database In-Memory The Hadoop analytics lifecycle
End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ
End to End Solution to Accelerate Data Warehouse Optimization Franco Flore Alliance Sales Director - APJ Big Data Is Driving Key Business Initiatives Increase profitability, innovation, customer satisfaction,
