Internet of Things Data Analytics - Part 1

Size: px
Start display at page:

Download "Internet of Things Data Analytics - Part 1"

Transcription

1 Internet of Things Data Analytics - Part 1 Introduction to Data Analytics Aveek Dutta Assistant Professor Electrical Engineering and Computer Science University of Kansas aveekd@ku.edu

2 Three Elements of IoT Origin of Data (Week 2 - Networks) Data Acquisition (Week 3 - Sensors / Android) Data Interpretation (Week 4 - Analytics)

3 Objectives Attributes of Data (e.g., shape, size, color) Lifecycle approach to data science and analytics From statistics to analytics Differences between Data analyst and BI analyst Apply techniques and tools to analyze Big Data Create statistical models Lead to actionable results Visualization techniques to clearly communicate insights Overview of MapReduce/Hadoop and in-database analytics (Time permitting)

4 What is YOUR definition of BIG DATA? How big is BIG? Where does it come from? Why is it hard to analyze? What is the value?

5 Why learn this? Source : Datascience@Berkeley A recent study by the McKinsey Global Institute concludes, "a shortage of the analytical and managerial talent necessary to make the most of Big Data is a significant and pressing challenge (for the U.S.)." The report estimates that there will be four to five million jobs in the U.S. requiring data analysis skills by 2018, and that large numbers of positions will only be filled through training or retraining. The authors also project a need for 1.5 million more managers and analysts with deep analytical and technical skills "who can ask the right questions and consume the results of analysis of big data effectively."

6

7 Key characteristics of Big Data Data Volume 44x increases from 2009 to zettabytes to 35.2 zettabytes Processing Complexity Changing data structures Use cases warranting additional transformations and analytical techniques Data Structure Greater variety of data structures to mine and analyze

8 CISCO VNI Click here for the full report - highly recommended Provides the numbers for networked systems only VELOCITY VARIETY (changes with time) VOLUME

9 Data Attributes - Temporal Uncertainty Post Analysis - learn from past events Extract models, fit data points Prediction and Forecast based on derived models, Improve models

10 Data Attributes - Dimensions

11 Data Attributes - Relationship Public Domain,

12 Data Attributes - Correlation

13 Definitions, Drivers and Differences

14 5 V s of Big Data And VALUE

15 Industry Implications Volume Communications/ Topology Network Management Velocity Hybrid SoC (CPUGPU-FPGA) Communications HPC / SAN/Fault Tolerance Variety Applications Classification Veracity Crowdsourcing / Trustworthy Recommendation systems

16 Data Structures: Increasingly Unstructured Data containing a defined data type, format, structure More Structured Structured Example: Transaction data and OLAP Textual data files with a discernable pattern, enabling parsing Semi-Structured Example: XML data files that are self describing and defined by an xml schema Textual data with erratic data formats, can be formatted with effort, tools, time Quasi Structured Unstructured Example: Web clickstream data that may contain some inconsistencies in data values and formats Data that has no inherent structure and is usually stored as different types of files. Example: Text documents, PDFs, images and video

17 Four Main Types of Data Structures Quasi-Structured Data Structured Data Semi-Structured Data View Source com/#hl=en&sugexp=kjrmc&cp=8&gs_id=2m&xhr=t&q=data+scientist&pq=big+data&pf=p& sclient=psyb&source=hp&pbx=1&oq=data+sci&aq=0&aqi=g4&aql=f&gs_sm=&gs_upl=&bav =on.2,or.r_gc.r_pw.,cf.osb&fp=d566e0fbd09c8604&biw=1382&bih=651 Unstructured Data The Red Wheelbarrow, by William Carlos Williams

18 Big Data Ecosystem Data Devices 1 Individual Analytic Services Medical Information Brokers Advertising Marketers Employers Law Enforcement 2 Internet Government Data Collectors Websites 3 Data Aggregators Data Users/Buyers 4 Catalog Co-Ops Phone/TV Media Media Archives Credit Bureaus Retail Financial Banks Government List Brokers Delivery Service Private Investigators /Lawyers

19 Sources of Big Data - An Overview Source: Kapow Software, a Kofax company

20 Data Repositories Data Islands Spreadmarts Isolated data marts Spreadsheets and lowvolume DB s for recordkeeping Analyst dependent on data extracts Data Warehouses Analytic Sandbox Centralized data containers in a purpose-built space Data assets gathered from multiple sources and technologies for analysis Supports BI and reporting, but restricts robust analyses Enables high performance analytics using in-db processing Analyst dependent on IT & DBAs for data access and schema changes Reduces costs associated with data replication into "shadow" file systems Analysts must spend significant time to get extracts from multiple sources Analyst-owned rather than DBA owned

21 Business Intelligence vs. Data Science Predictive Analytics & Data Mining (Data Science) High Data Science BUSINESS VALUE Optimization, predictive modeling, forecasting, statistical analysis Structured/unstructured data, many types of sources, very large data sets Common Questions What if..? What s the optimal scenario for our business? What will happen next? What if these trends continue? Why is this happening? Business Intelligence Business Intelligence Low Past Typical Techniques & Data Types TIME Typical Techniques & Data Types Standard and ad hoc reporting, dashboards, alerts, queries, details on demand Structured data, traditional sources, manageable data sets Common Questions What happened last quarter? How many did we sell? Where is the problem? In which situations? Future

22 Profile of a Data Scientist Quantitative Technical Skeptical Curious & Creative Communicative & Collaborative

23 Big Data Analytics: Industry Examples Health Care Reducing Cost of Care Medical Public Services Internet Government Life Sciences Data Collectors Genomic Mapping IT Infrastructure Preventing Pandemics Unstructured Data Analysis Online Services Phone/TV Retail Financial Social Media for Professionals Module 1: Introduction to BDA 23

24 Big Data Analytics: Healthcare Situation Poor police response and problems with medical care, triggered by shooting of a Rutgers student The event drove local doctor to map crime data and examine local health care Dr. Jeffrey Brenner generated his own crime maps from medical Use of Big Data billing records of 3 hospitals Key Outcomes City hospitals & ER s provided expensive care, low quality care Reduced hospital costs by 56% by realizing that 80% of city s medical costs came from 13% of its residents, mainly low-income or elderly Now offers preventative care over the phone or through home visits

25 Big Data Analytics: Public Services Situation Threat of global pandemics has increased exponentially Pandemics spreads at faster rates, more resistant to antibiotics Created a network of viral listening posts Combines data from viral discovery in the field, research in disease hotspots, and social media trends Use of Big Data Using Big Data to make accurate predictions on spread of new pandemics Key Outcomes Identified a fifth form of human malaria, including its origin Identified why efforts failed to control swine flu Proposing more proactive approaches to preventing outbreaks 25

26 Big Data Analytics: Life Sciences Situation Broad Institute (MIT & Harvard) mapping the Human Genome In 13 yrs, mapped 3 billion genetic base pairs; 8 petabytes Use of Big Data Developed 30+ software packages, now shared publicly, along with the genomic data Key Outcomes Using genetic mappings to identify cellular mutations causing cancer and other serious diseases Innovating how genomic research informs new pharmaceutical drugs

27 Big Data Analytics: IT Infrastructure Situation Explosion of unstructured data required new technology to analyze quickly, and efficiently Doug Cutting created Hadoop to divide large processing tasks into smaller tasks across many computers Use of Big Data Analyzes social media data generated by hundreds of thousands of users Key Outcomes New York Times used Hadoop to transform its entire public archive, from 1851 to 1922, into 11 million PDF files in 24 hrs Applications range from social media, sentiment analysis, wartime chatter, natural language processing

28 Big Data Analytics: Online Services Situation Opportunity to create social media space for professionals Collects and analyzes data from over 100 million users Use of Big Data Adding 1 million new users per week Key Outcomes LinkedIn Skills, InMaps, Job Recommendations, Recruiting Established a diverse data scientist group, as founder believes this is the start of Big Data revolution

29 Data Analytics Lifecycle

30 How to Approach Your Analytics Problems Your Thoughts? How do you currently approach your analytics problems? Do you follow a methodology or some kind of framework? How do you plan for an analytic project? Module 2: Data Analytics Lifecycle 30

31 Value of Using the Data Analytics Lifecycle Focus your time Ensure rigor and completeness Enable better transition to members of the cross-functional analytic teams Repeatable Scale to additional analysts Support validity of findings Module 2: Data Analytics Lifecycle 31

32 Data Analytics Lifecycle Do I have enough information to draft an analytic plan and share for peer review? 1 Discovery 2 6 Operationalize Data Prep Do I have enough good quality data to start building the model? 3 5 Model Planning Communicate Results 4 Is the model robust enough? Have we failed for sure? Model Building Do I have a good idea about the type of model to try? Can I refine the analytic plan?

33 Phase 1: Discovery Do I have enough information to draft an analytic plan and share for peer review? 1 Discovery Learn the Business (Problem) Domain Datadownstream Prep Operationalize Determine amount of domain knowledge needed to interpret results Do I have enough good quality data to start building the model? Determine the general analytic problem type (such as clustering, classification) If you don t know, then conduct initial research to learn about the domain area you ll be analyzing Learn from the past (aka literature review) Communicate Model Have there been previous attempts in the organization to solve this problem? Results Planning If so, why did they fail? Why are we trying again? How have things changed? Is the model robust enough? Have we failed for sure? Model Building Do I have a good idea about the type of model to try? Can I refine the analytic plan?

34 Phase 1: Discovery Do I have enough information to draft an analytic plan and share for peer review? 1 Discovery Resources Operationalize Assess available technology, Available data People (team), time (man-hours) Data Prep Do I have enough good quality data to start building the model? Frame the problem..it is the process of stating the analytics problem to be solved Model Communicate State the analytics problem, why it is important, and to whom Results Planning Clearly articulate the current situation and pain points Objectives What is the goal? What are the Model criteria for success? What s good Doenough? I have a good idea about type model Building model What is the failure criterion (when do we just stop trying or settle forthe what weofhave)? Is the robust to try? Can I refine the enough? Have we analytic plan? failed for sure?

35 Phase 1: Discovery Do I have enough information to draft an analytic plan and share for peer review? 1 Discovery Formulate Initial Hypotheses IH, H1, H2, H3, Hn Operationalize Data Prep Do I have enough good quality data to start building the model? Gather and assess hypotheses from stakeholders and domain experts Preliminary data exploration to inform discussions with stakeholders during the hypothesis forming stage Identify Data Sources Begin Learning the Data Communicate Model Results sources for previewing the data and provide high-level Planning Aggregate understanding Review the raw data Determine the structures and tools needed Model Scope the kind of data needed for this kind of problem Building Is the model robust enough? Have we failed for sure? Do I have a good idea about the type of model to try? Can I refine the analytic plan?

36 Phase 2: Data Preparation Prepare Analytic Sandbox Discovery Bandwidth and network Perform ELT (Extract - Load - Transform) Operationalize Data Conditioning Clean and normalize data Discern what you keep vs. what you discard Survey & Visualize Communicate Overview, zoom & filter, details-on-demand Results Do I have enough information to draft an analytic plan and share for peer review? 2 Data Prep Do I have enough good quality data to start building the model? Model Planning Descriptive Statistics Model Building Do I have a good idea about the type of model Useful Tools for this phase: Is the model robust to try? Can I refine the For Data Transformation & Cleansing: SQL, Hadoop, MapReduce, Alpine Miner enough? Have we analytic plan? Visualization: R (base package, ggplot and lattice), GnuPlot, Ggobi/Rggobi, Spotfire, Tableau failed for sure?

37 Phase 3: Model Planning Do I have enough information to draft an analytic plan and share for peer review? Discovery Determine Methods Select methods based on hypotheses, data structure and volume Operationalize Data Prep Do I have enough good quality data to start building the model? Ensure techniques and approach will meet business objectives Variable Selection Inputs from stakeholders and domain experts Communicate Results Capture essence of the predictors, leverage a technique for dimensionality reduction Iterative testing to confirm the most significant variables Model Useful Tools for this phase: R/PostgreSQL, SQL Analytics, Building the model robust Alpine Is Miner, SAS/ACCESS, SPSS/OBDC enough? Have we failed for sure? 3 Model Planning Do I have a good idea about the type of model to try? Can I refine the analytic plan?

38 Phase 4: Model Building Do I have enough information to draft an analytic plan and share for peer review? Discovery Develop data sets for testing, training, and production purposes Get the best environment you can for building models and workflows fast hardware, parallel processing Do I have enough good Need to ensure that the model data is sufficiently robust for the model and analytical quality data to techniques start building Smaller, test sets for validating approach, training set for initial experiments the model? Operationalize Data Prep Communicate Results Model Planning 4 Is the model robust enough? Have we failed for sure? Model Building Do I have a good idea about the type of model to try? Can I refine the analytic plan? Useful Tools for this phase: R, PL/R, SQL, Alpine Miner, SAS Enterprise Miner

39 Phase 5: Communicate Results Do I have enough information to draft an analytic plan and share for peer review? Discovery Do I have enough good quality data to start building the model? Did we succeed? Did we fail? Operationalize 5 Communicate Results Is the model robust enough? Have we failed for sure? Data Prep Interpret the results Compare to IH s from Phase 1 Identify key findings Model Planning Quantify business value Summarizing findings (depends on audience) Model Building Do I have a good idea about the type of model to try? Can I refine the analytic plan?

40 Phase 6: Operationalize Do I have enough information to draft an analytic plan and share for peer review? Discovery 6 Operationalize Communicate Results Is the model robust enough? Have we failed for sure? Run a pilot Assess the benefits Do I have enough good quality data to start building the model? Data Prep Provide final deliverables Implement the model in the production environment Model Define process to update, retrain, and retire Planning the model, as needed Model Building Do I have a good idea about the type of model to try? Can I refine the analytic plan?

41 We should be confident in answering... What is Big-Data? Why is it hard and challenging? What are attributes of data? What are 5 V s of analytics? Structure, source and management of data What are the differences with BI What are the components of Data Analytics Lifecycle Discovery, Data Prep, Model Planning, Model Building, Communicate Results, Operationalize More Reading - Free ebooks on Data Analysis

Big Data Analytics. David Dietrich, EMC Education Services. April 4, 2013

Big Data Analytics. David Dietrich, EMC Education Services. April 4, 2013 Big Data Analytics Harvard-Smithsonian Center for Astrophysics Data Science Training for Librarians April 4, 2013 David Dietrich, EMC Education Services I ll go into a company and say, What data problems

More information

BIG DATA TECHNOLOGY. Hadoop Ecosystem

BIG DATA TECHNOLOGY. Hadoop Ecosystem BIG DATA TECHNOLOGY Hadoop Ecosystem Agenda Background What is Big Data Solution Objective Introduction to Hadoop Hadoop Ecosystem Hybrid EDW Model Predictive Analysis using Hadoop Conclusion What is Big

More information

This Symposium brought to you by www.ttcus.com

This Symposium brought to you by www.ttcus.com This Symposium brought to you by www.ttcus.com Linkedin/Group: Technology Training Corporation @Techtrain Technology Training Corporation www.ttcus.com Big Data Analytics as a Service (BDAaaS) Big Data

More information

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata BIG DATA: FROM HYPE TO REALITY Leandro Ruiz Presales Partner for C&LA Teradata Evolution in The Use of Information Action s ACTIVATING MAKE it happen! Insights OPERATIONALIZING WHAT IS happening now? PREDICTING

More information

How To Write A Data Analysis Project

How To Write A Data Analysis Project Section 1. Data Analytics Lifecycle Overview The Data Analytics Lifecycle is designed specifically for Big Data problems and data science projects. The lifecycle has six phases, and project work can occur

More information

Using Tableau Software with Hortonworks Data Platform

Using Tableau Software with Hortonworks Data Platform Using Tableau Software with Hortonworks Data Platform September 2013 2013 Hortonworks Inc. http:// Modern businesses need to manage vast amounts of data, and in many cases they have accumulated this data

More information

Introduction to Big Data Analytics p. 1 Big Data Overview p. 2 Data Structures p. 5 Analyst Perspective on Data Repositories p.

Introduction to Big Data Analytics p. 1 Big Data Overview p. 2 Data Structures p. 5 Analyst Perspective on Data Repositories p. Introduction p. xvii Introduction to Big Data Analytics p. 1 Big Data Overview p. 2 Data Structures p. 5 Analyst Perspective on Data Repositories p. 9 State of the Practice in Analytics p. 11 BI Versus

More information

How To Create A Data Science System

How To Create A Data Science System Enhance Collaboration and Data Sharing for Faster Decisions and Improved Mission Outcome Richard Breakiron Senior Director, Cyber Solutions Rbreakiron@vion.com Office: 571-353-6127 / Cell: 803-443-8002

More information

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved. Big Data Analytics 1 Priority Discussion Topics What are the most compelling business drivers behind big data analytics? Do you have or expect to have data scientists on your staff, and what will be their

More information

Transforming the Telecoms Business using Big Data and Analytics

Transforming the Telecoms Business using Big Data and Analytics Transforming the Telecoms Business using Big Data and Analytics Event: ICT Forum for HR Professionals Venue: Meikles Hotel, Harare, Zimbabwe Date: 19 th 21 st August 2015 AFRALTI 1 Objectives Describe

More information

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES Relational vs. Non-Relational Architecture Relational Non-Relational Rational Predictable Traditional Agile Flexible Modern 2 Agenda Big Data

More information

Are You Ready for Big Data?

Are You Ready for Big Data? Are You Ready for Big Data? Jim Gallo National Director, Business Analytics February 11, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?

More information

Data Refinery with Big Data Aspects

Data Refinery with Big Data Aspects International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 655-662 International Research Publications House http://www. irphouse.com /ijict.htm Data

More information

BIG DATA & ANALYTICS. Transforming the business and driving revenue through big data and analytics

BIG DATA & ANALYTICS. Transforming the business and driving revenue through big data and analytics BIG DATA & ANALYTICS Transforming the business and driving revenue through big data and analytics Collection, storage and extraction of business value from data generated from a variety of sources are

More information

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica

More information

Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing

More information

HSD. W Business Analytics (M.Sc.) IT in Business Analytics. IT Applications in Business Analytics SS2016 / 01 Introduction Thomas Zeutschler

HSD. W Business Analytics (M.Sc.) IT in Business Analytics. IT Applications in Business Analytics SS2016 / 01 Introduction Thomas Zeutschler Hochschule Düsseldorf University of Applied Scienses Fachbereich Wirtschaftswissenschaften W Business Analytics (M.Sc.) IT in Business Analytics IT Applications in Business Analytics SS2016 / 01 Introduction

More information

HDP Enabling the Modern Data Architecture

HDP Enabling the Modern Data Architecture HDP Enabling the Modern Data Architecture Herb Cunitz President, Hortonworks Page 1 Hortonworks enables adoption of Apache Hadoop through HDP (Hortonworks Data Platform) Founded in 2011 Original 24 architects,

More information

DRIVING THE CHANGE ENABLING TECHNOLOGY FOR FINANCE 15 TH FINANCE TECH FORUM SOFIA, BULGARIA APRIL 25 2013

DRIVING THE CHANGE ENABLING TECHNOLOGY FOR FINANCE 15 TH FINANCE TECH FORUM SOFIA, BULGARIA APRIL 25 2013 DRIVING THE CHANGE ENABLING TECHNOLOGY FOR FINANCE 15 TH FINANCE TECH FORUM SOFIA, BULGARIA APRIL 25 2013 BRAD HATHAWAY REGIONAL LEADER FOR INFORMATION MANAGEMENT AGENDA Major Technology Trends Focus on

More information

Big Data & Analytics. The. Deal. About. Jacob Büchler jbuechler@dk.ibm.com Cand. Polit. IBM Denmark, Solution Exec. 2013 IBM Corporation

Big Data & Analytics. The. Deal. About. Jacob Büchler jbuechler@dk.ibm.com Cand. Polit. IBM Denmark, Solution Exec. 2013 IBM Corporation The Big Data & Analytics Deal About Jacob Büchler jbuechler@dk.ibm.com Cand. Polit. IBM Denmark, Solution Exec. 1 Big Data is All Data from Everywhere Big Data Is Becoming The Next Natural Resource We

More information

Understanding Data Warehouse Needs Session #1568 Trends, Issues and Capabilities

Understanding Data Warehouse Needs Session #1568 Trends, Issues and Capabilities Understanding Data Warehouse Needs Session #1568 Trends, Issues and Capabilities Dr. Frank Capobianco Advanced Analytics Consultant Teradata Corporation Tracy Spadola CPCU, CIDM, FIDM Practice Lead - Insurance

More information

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

More information

Big Data and Analytics in Government

Big Data and Analytics in Government Big Data and Analytics in Government Nov 29, 2012 Mark Johnson Director, Engineered Systems Program 2 Agenda What Big Data Is Government Big Data Use Cases Building a Complete Information Solution Conclusion

More information

How Big Data is Different

How Big Data is Different FALL 2012 VOL.54 NO.1 Thomas H. Davenport, Paul Barth and Randy Bean How Big Data is Different Brought to you by Please note that gray areas reflect artwork that has been intentionally removed. The substantive

More information

Big Data e BI voltados a estratégias em governo e definição de políticas e serviços públicos

Big Data e BI voltados a estratégias em governo e definição de políticas e serviços públicos Big Data e BI voltados a estratégias em governo e definição de políticas e serviços públicos Claudio Chauke Executive Partner claudio.chauke@gartner.com 23 Gartner, Inc. and/or its affiliates. All rights

More information

Business Analytics In a Big Data World Ted Malone Solutions Architect Data Platform and Cloud Microsoft Federal

Business Analytics In a Big Data World Ted Malone Solutions Architect Data Platform and Cloud Microsoft Federal Business Analytics In a Big Data World Ted Malone Solutions Architect Data Platform and Cloud Microsoft Federal Information has gone from scarce to super-abundant. That brings huge new benefits. The Economist

More information

Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016

Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016 Big Data! Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016 Big Data: Data Analytical Tools for Decision Support 2 Outline Introduce

More information

UNLEASHING THE VALUE OF THE TERADATA UNIFIED DATA ARCHITECTURE WITH ALTERYX

UNLEASHING THE VALUE OF THE TERADATA UNIFIED DATA ARCHITECTURE WITH ALTERYX UNLEASHING THE VALUE OF THE TERADATA UNIFIED DATA ARCHITECTURE WITH ALTERYX 1 Successful companies know that analytics are key to winning customer loyalty, optimizing business processes and beating their

More information

Are You Ready for Big Data?

Are You Ready for Big Data? Are You Ready for Big Data? Jim Gallo National Director, Business Analytics April 10, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?

More information

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics

More information

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved. Mike Maxey Senior Director Product Marketing Greenplum A Division of EMC 1 Greenplum Becomes the Foundation of EMC s Big Data Analytics (July 2010) E M C A C Q U I R E S G R E E N P L U M For three years,

More information

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

Chapter 5. Warehousing, Data Acquisition, Data. Visualization Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization 5-1 Learning Objectives

More information

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance IBM s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice

More information

Analytics in the Cloud. Peter Sirota, GM Elastic MapReduce

Analytics in the Cloud. Peter Sirota, GM Elastic MapReduce Analytics in the Cloud Peter Sirota, GM Elastic MapReduce Data-Driven Decision Making Data is the new raw material for any business on par with capital, people, and labor. What is Big Data? Terabytes of

More information

Why Big Data Analytics?

Why Big Data Analytics? An ebook by Datameer Why Big Data Analytics? Three Business Challenges Best Addressed Using Big Data Analytics It s hard to overstate the importance of data for businesses today. It s the lifeline of any

More information

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014 5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for

More information

Data Isn't Everything

Data Isn't Everything June 17, 2015 Innovate Forward Data Isn't Everything The Challenges of Big Data, Advanced Analytics, and Advance Computation Devices for Transportation Agencies. Using Data to Support Mission, Administration,

More information

IBM Big Data Platform

IBM Big Data Platform IBM Big Data Platform Turning big data into smarter decisions Stefan Söderlund. IBM kundarkitekt, Försvarsmakten Sesam vår-seminarie Big Data, Bigga byte kräver Pigga Hertz! May 16, 2013 By 2015, 80% of

More information

Building Analytics and Big Data Capabilities Tom Davenport CDB Annual Conference May 23, 2012

Building Analytics and Big Data Capabilities Tom Davenport CDB Annual Conference May 23, 2012 Building Analytics and Big Data Capabilities Tom Davenport CDB Annual Conference May 23, 2012 A Bright Idea Informatics/Analytics on Small and Big Data It works for: Old companies (GE, P&G, Marriott, Bank

More information

Empowering the Masses with Analytics

Empowering the Masses with Analytics Empowering the Masses with Analytics THE GAP FOR BUSINESS USERS For a discussion of bridging the gap from the perspective of a business user, read Three Ways to Use Data Science. Ask the average business

More information

Demonstration of SAP Predictive Analysis 1.0, consumption from SAP BI clients and best practices

Demonstration of SAP Predictive Analysis 1.0, consumption from SAP BI clients and best practices September 10-13, 2012 Orlando, Florida Demonstration of SAP Predictive Analysis 1.0, consumption from SAP BI clients and best practices Vishwanath Belur, Product Manager, SAP Predictive Analysis Learning

More information

Ramesh Bhashyam Teradata Fellow Teradata Corporation bhashyam.ramesh@teradata.com

Ramesh Bhashyam Teradata Fellow Teradata Corporation bhashyam.ramesh@teradata.com Challenges of Handling Big Data Ramesh Bhashyam Teradata Fellow Teradata Corporation bhashyam.ramesh@teradata.com Trend Too much information is a storage issue, certainly, but too much information is also

More information

Advanced In-Database Analytics

Advanced In-Database Analytics Advanced In-Database Analytics Tallinn, Sept. 25th, 2012 Mikko-Pekka Bertling, BDM Greenplum EMEA 1 That sounds complicated? 2 Who can tell me how best to solve this 3 What are the main mathematical functions??

More information

Cisco Data Preparation

Cisco Data Preparation Data Sheet Cisco Data Preparation Unleash your business analysts to develop the insights that drive better business outcomes, sooner, from all your data. As self-service business intelligence (BI) and

More information

Big Data Analytics Best Practices

Big Data Analytics Best Practices 1 Big Data Analytics Best Practices Marshall Presser Federal Field CTO Greenplum 2 Big Data Makes the Mainstream 3 WHAT DOES IT TAKE? 4 1. New Applications MADlib 5 2. New Skill Sets -- Data Science 6

More information

Agile Business Intelligence Data Lake Architecture

Agile Business Intelligence Data Lake Architecture Agile Business Intelligence Data Lake Architecture TABLE OF CONTENTS Introduction... 2 Data Lake Architecture... 2 Step 1 Extract From Source Data... 5 Step 2 Register And Catalogue Data Sets... 5 Step

More information

Ganzheitliches Datenmanagement

Ganzheitliches Datenmanagement Ganzheitliches Datenmanagement für Hadoop Michael Kohs, Senior Sales Consultant @mikchaos The Problem with Big Data Projects in 2016 Relational, Mainframe Documents and Emails Data Modeler Data Scientist

More information

Getting Value from Big Data with Analytics

Getting Value from Big Data with Analytics Getting Value from Big Data with Analytics Edward Roske, CEO Oracle ACE Director info@interrel.com BLOG: LookSmarter.blogspot.com WEBSITE: www.interrel.com TWITTER: Eroske About interrel Reigning Oracle

More information

Big Data Challenges and Success Factors. Deloitte Analytics Your data, inside out

Big Data Challenges and Success Factors. Deloitte Analytics Your data, inside out Big Data Challenges and Success Factors Deloitte Analytics Your data, inside out Big Data refers to the set of problems and subsequent technologies developed to solve them that are hard or expensive to

More information

Demystifying Big Data Government Agencies & The Big Data Phenomenon

Demystifying Big Data Government Agencies & The Big Data Phenomenon Demystifying Big Data Government Agencies & The Big Data Phenomenon Today s Discussion If you only remember four things 1 Intensifying business challenges coupled with an explosion in data have pushed

More information

The 4 Pillars of Technosoft s Big Data Practice

The 4 Pillars of Technosoft s Big Data Practice beyond possible Big Use End-user applications Big Analytics Visualisation tools Big Analytical tools Big management systems The 4 Pillars of Technosoft s Big Practice Overview Businesses have long managed

More information

How To Use Big Data For Business

How To Use Big Data For Business Big Data Maturity - The Photo and The Movie Mike Ferguson Managing Director, Intelligent Business Strategies BA4ALL Big Data & Analytics Insight Conference Stockholm, May 2015 About Mike Ferguson Mike

More information

How To Get An Advantage From Analytics

How To Get An Advantage From Analytics Business Analytics Gaining the Advantage Clients Collective Intelligence is one of 100 partners Nationally Surfacing Previously Siloed Data Allow Users Direct Access to Data Streamline Business Practices

More information

BIG DATA STRATEGY. Rama Kattunga Chair at American institute of Big Data Professionals. Building Big Data Strategy For Your Organization

BIG DATA STRATEGY. Rama Kattunga Chair at American institute of Big Data Professionals. Building Big Data Strategy For Your Organization BIG DATA STRATEGY Rama Kattunga Chair at American institute of Big Data Professionals Building Big Data Strategy For Your Organization In this session What is Big Data? Prepare your organization Building

More information

Beyond Watson: The Business Implications of Big Data

Beyond Watson: The Business Implications of Big Data Beyond Watson: The Business Implications of Big Data Shankar Venkataraman IBM Program Director, STSM, Big Data August 10, 2011 The World is Changing and Becoming More INSTRUMENTED INTERCONNECTED INTELLIGENT

More information

Introduction to Big Data Analytics

Introduction to Big Data Analytics 1 Introduction to Big Data Analytics Key Concepts Big Data overview State of the practice in analytics Business Intelligence versus Data Science Key roles for the new Big Data ecosystem The Data Scientist

More information

Using Big Data Analytics to

Using Big Data Analytics to Using Big Data Analytics to Improve Government Performance Arun Chandrasekaran Gartner is a registered trademark of Gartner, Inc. or its affiliates. This publication may not be reproduced or distributed

More information

Big Analytics: A Next Generation Roadmap

Big Analytics: A Next Generation Roadmap Big Analytics: A Next Generation Roadmap Cloud Developers Summit & Expo: October 1, 2014 Neil Fox, CTO: SoftServe, Inc. 2014 SoftServe, Inc. Remember Life Before The Web? 1994 Even Revolutions Take Time

More information

How To Understand Business Intelligence

How To Understand Business Intelligence An Introduction to Advanced PREDICTIVE ANALYTICS BUSINESS INTELLIGENCE DATA MINING ADVANCED ANALYTICS An Introduction to Advanced. Where Business Intelligence Systems End... and Predictive Tools Begin

More information

Advanced Big Data Analytics with R and Hadoop

Advanced Big Data Analytics with R and Hadoop REVOLUTION ANALYTICS WHITE PAPER Advanced Big Data Analytics with R and Hadoop 'Big Data' Analytics as a Competitive Advantage Big Analytics delivers competitive advantage in two ways compared to the traditional

More information

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ End to End Solution to Accelerate Data Warehouse Optimization Franco Flore Alliance Sales Director - APJ Big Data Is Driving Key Business Initiatives Increase profitability, innovation, customer satisfaction,

More information

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here

Data Virtualization for Agile Business Intelligence Systems and Virtual MDM. To View This Presentation as a Video Click Here Data Virtualization for Agile Business Intelligence Systems and Virtual MDM To View This Presentation as a Video Click Here Agenda Data Virtualization New Capabilities New Challenges in Data Integration

More information

Making big data simple with Databricks

Making big data simple with Databricks Making big data simple with Databricks We are Databricks, the company behind Spark Founded by the creators of Apache Spark in 2013 Data 75% Share of Spark code contributed by Databricks in 2014 Value Created

More information

Bringing Strategy to Life Using an Intelligent Data Platform to Become Data Ready. Informatica Government Summit April 23, 2015

Bringing Strategy to Life Using an Intelligent Data Platform to Become Data Ready. Informatica Government Summit April 23, 2015 Bringing Strategy to Life Using an Intelligent Platform to Become Ready Informatica Government Summit April 23, 2015 Informatica Solutions Overview Power the -Ready Enterprise Government Imperatives Improve

More information

Oracle Big Data Discovery Unlock Potential in Big Data Reservoir

Oracle Big Data Discovery Unlock Potential in Big Data Reservoir Oracle Big Data Discovery Unlock Potential in Big Data Reservoir Gokula Mishra Premjith Balakrishnan Business Analytics Product Group September 29, 2014 Copyright 2014, Oracle and/or its affiliates. All

More information

OpenChorus: Building a Tool-Chest for Big Data Science

OpenChorus: Building a Tool-Chest for Big Data Science OpenChorus: Building a Tool-Chest for Big Data Science Milind Bhandarkar Chief Scientist, Machine Learning Platforms EMC Greenplum 1 Agenda! Tools for Data Science! Data Science Workflow! Greenplum OpenChorus!

More information

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA OLAP and OLTP AMIT KUMAR BINDAL Associate Professor Databases Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age Information, which is created by data,

More information

Big Data + Open Source + Collaboration= Big Innovation

Big Data + Open Source + Collaboration= Big Innovation Big Data + Open Source + Collaboration= Big Innovation Luke Lonergan Co-founder Greenplum & CTO, EMC Data Computing Division May 17, 2011 1 To make step-function changes, revolutionary changes, seems to

More information

locuz.com Big Data Services

locuz.com Big Data Services locuz.com Big Data Services Big Data At Locuz, we help the enterprise move from being a data-limited to a data-driven one, thereby enabling smarter, faster decisions that result in better business outcome.

More information

Advanced Analytic Dashboards at Lands End. Brenda Olson and John Kruk April 2004

Advanced Analytic Dashboards at Lands End. Brenda Olson and John Kruk April 2004 Advanced Analytic Dashboards at Lands End Brenda Olson and John Kruk April 2004 Presentation Information Presenter: Brenda Olson and John Kruk Company: Lands End Contributors: Lands End EDW/BI Teams Title:

More information

NEWLY EMERGING BEST PRACTICES FOR BIG DATA

NEWLY EMERGING BEST PRACTICES FOR BIG DATA 2000-2012 Kimball Group. All rights reserved. Page 1 NEWLY EMERGING BEST PRACTICES FOR BIG DATA Ralph Kimball Informatica October 2012 Ralph Kimball Big is Being Monetized Big data is the second era of

More information

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2 Volume 6, Issue 3, March 2016 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue

More information

Getting Started Practical Input For Your Roadmap

Getting Started Practical Input For Your Roadmap Getting Started Practical Input For Your Roadmap Mike Ferguson Managing Director, Intelligent Business Strategies BA4ALL Big Data & Analytics Insight Conference Stockholm, May 2015 About Mike Ferguson

More information

Business Intelligence: Effective Decision Making

Business Intelligence: Effective Decision Making Business Intelligence: Effective Decision Making Bellevue College Linda Rumans IT Instructor, Business Division Bellevue College lrumans@bellevuecollege.edu Current Status What do I do??? How do I increase

More information

CHAPTER SIX DATA. Business Intelligence. 2011 The McGraw-Hill Companies, All Rights Reserved

CHAPTER SIX DATA. Business Intelligence. 2011 The McGraw-Hill Companies, All Rights Reserved CHAPTER SIX DATA Business Intelligence 2011 The McGraw-Hill Companies, All Rights Reserved 2 CHAPTER OVERVIEW SECTION 6.1 Data, Information, Databases The Business Benefits of High-Quality Information

More information

BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE

BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE Current technology for Big Data allows organizations to dramatically improve return on investment (ROI) from their existing data warehouse environment.

More information

Investor Presentation. Second Quarter 2015

Investor Presentation. Second Quarter 2015 Investor Presentation Second Quarter 2015 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences

More information

Big Data Are You Ready? Jorge Plascencia Solution Architect Manager

Big Data Are You Ready? Jorge Plascencia Solution Architect Manager Big Data Are You Ready? Jorge Plascencia Solution Architect Manager Big Data: The Datafication Of Everything Thoughts Devices Processes Thoughts Things Processes Run the Business Organize data to do something

More information

HDP Hadoop From concept to deployment.

HDP Hadoop From concept to deployment. HDP Hadoop From concept to deployment. Ankur Gupta Senior Solutions Engineer Rackspace: Page 41 27 th Jan 2015 Where are you in your Hadoop Journey? A. Researching our options B. Currently evaluating some

More information

MapR: Best Solution for Customer Success

MapR: Best Solution for Customer Success 2015 MapR Technologies 2015 MapR Technologies 1 MapR: Best Solution for Customer Success Best Product High Growth 700+ Customers Premier Investors Apache Open Source 2X 2X Growth In Direct Customers Growth

More information

USING BIG DATA FOR INTELLIGENT BUSINESSES

USING BIG DATA FOR INTELLIGENT BUSINESSES HENRI COANDA AIR FORCE ACADEMY ROMANIA INTERNATIONAL CONFERENCE of SCIENTIFIC PAPER AFASES 2015 Brasov, 28-30 May 2015 GENERAL M.R. STEFANIK ARMED FORCES ACADEMY SLOVAK REPUBLIC USING BIG DATA FOR INTELLIGENT

More information

BIG DATA. Value 8/14/2014 WHAT IS BIG DATA? THE 5 V'S OF BIG DATA WHAT IS BIG DATA?

BIG DATA. Value 8/14/2014 WHAT IS BIG DATA? THE 5 V'S OF BIG DATA WHAT IS BIG DATA? WHAT IS BIG DATA? BIG DATA DR. KLARA NELSON THE UNIVERSITY OF TAMPA "Volumes of data that are unusually large, or types of data that are unstructured" Thomas Davenport, Keeping Up with the Quants, 2013,

More information

Big Data Big Data/Data Analytics & Software Development

Big Data Big Data/Data Analytics & Software Development Big Data Big Data/Data Analytics & Software Development Danairat T. danairat@gmail.com, 081-559-1446 1 Agenda Big Data Overview Business Cases and Benefits Hadoop Technology Architecture Big Data Development

More information

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Datenverwaltung im Wandel - Building an Enterprise Data Hub with Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees

More information

Artur Borycki. Director International Solutions Marketing

Artur Borycki. Director International Solutions Marketing Artur Borycki Director International Solutions Agenda! Evolution of Teradata s Unified Architecture Analytical and Workloads! Teradata s Reference Information Architecture Evolution of Teradata s" Unified

More information

How to Enhance Traditional BI Architecture to Leverage Big Data

How to Enhance Traditional BI Architecture to Leverage Big Data B I G D ATA How to Enhance Traditional BI Architecture to Leverage Big Data Contents Executive Summary... 1 Traditional BI - DataStack 2.0 Architecture... 2 Benefits of Traditional BI - DataStack 2.0...

More information

Extend your analytic capabilities with SAP Predictive Analysis

Extend your analytic capabilities with SAP Predictive Analysis September 9 11, 2013 Anaheim, California Extend your analytic capabilities with SAP Predictive Analysis Charles Gadalla Learning Points Advanced analytics strategy at SAP Simplifying predictive analytics

More information

Data Warehouse design

Data Warehouse design Data Warehouse design Design of Enterprise Systems University of Pavia 21/11/2013-1- Data Warehouse design DATA PRESENTATION - 2- BI Reporting Success Factors BI platform success factors include: Performance

More information

Data Integration Checklist

Data Integration Checklist The need for data integration tools exists in every company, small to large. Whether it is extracting data that exists in spreadsheets, packaged applications, databases, sensor networks or social media

More information

MDM and Data Warehousing Complement Each Other

MDM and Data Warehousing Complement Each Other Master Management MDM and Warehousing Complement Each Other Greater business value from both 2011 IBM Corporation Executive Summary Master Management (MDM) and Warehousing (DW) complement each other There

More information

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop ISSN: 2454-2377, October 2015 Big Data and Hadoop Simmi Bagga 1 Satinder Kaur 2 1 Assistant Professor, Sant Hira Dass Kanya MahaVidyalaya, Kala Sanghian, Distt Kpt. INDIA E-mail: simmibagga12@gmail.com

More information

Big Data and the Data Lake. February 2015

Big Data and the Data Lake. February 2015 Big Data and the Data Lake February 2015 My Vision: Our Mission Data Intelligence is a broad term that describes the real, meaningful insights that can be extracted from your data truths that you can act

More information

TEXT ANALYTICS INTEGRATION

TEXT ANALYTICS INTEGRATION TEXT ANALYTICS INTEGRATION A TELECOMMUNICATIONS BEST PRACTICES CASE STUDY VISION COMMON ANALYTICAL ENVIRONMENT Structured Unstructured Analytical Mining Text Discovery Text Categorization Text Sentiment

More information

3 MUST-HAVES IN PUBLIC SECTOR INFORMATION GOVERNANCE

3 MUST-HAVES IN PUBLIC SECTOR INFORMATION GOVERNANCE EXECUTIVE SUMMARY Information governance incorporates the policies, controls and information lifecycle management processes organizations and government agencies utilize to control cost and risk. With

More information

IDC MaturityScape Benchmark: Big Data and Analytics in Government. Adelaide O Brien Research Director IDC Government Insights June 20, 2014

IDC MaturityScape Benchmark: Big Data and Analytics in Government. Adelaide O Brien Research Director IDC Government Insights June 20, 2014 IDC MaturityScape Benchmark: Big Data and Analytics in Government Adelaide O Brien Research Director IDC Government Insights June 20, 2014 IDC MaturityScape Benchmark: Big Data and Analytics in Government

More information

Luncheon Webinar Series May 13, 2013

Luncheon Webinar Series May 13, 2013 Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration

More information

What's New in SAS Data Management

What's New in SAS Data Management Paper SAS034-2014 What's New in SAS Data Management Nancy Rausch, SAS Institute Inc., Cary, NC; Mike Frost, SAS Institute Inc., Cary, NC, Mike Ames, SAS Institute Inc., Cary ABSTRACT The latest releases

More information

Creating a Business Intelligence Competency Center to Accelerate Healthcare Performance Improvement

Creating a Business Intelligence Competency Center to Accelerate Healthcare Performance Improvement Creating a Business Intelligence Competency Center to Accelerate Healthcare Performance Improvement Bruce Eckert, National Practice Director, Advisory Group Ramesh Sakiri, Executive Consultant, Healthcare

More information

Introducing Oracle Exalytics In-Memory Machine

Introducing Oracle Exalytics In-Memory Machine Introducing Oracle Exalytics In-Memory Machine Jon Ainsworth Director of Business Development Oracle EMEA Business Analytics 1 Copyright 2011, Oracle and/or its affiliates. All rights Agenda Topics Oracle

More information

Data Discovery, Analytics, and the Enterprise Data Hub

Data Discovery, Analytics, and the Enterprise Data Hub Data Discovery, Analytics, and the Enterprise Data Hub Version: 101 Table of Contents Summary 3 Used Data and Limitations of Legacy Analytic Architecture 3 The Meaning of Data Discovery & Analytics 4 Machine

More information