SQLSaturday #399 Sacramento 25 July, 2015. Big Data Analytics with Excel

Similar documents
NZ BI User Group Auckland 18 September, Big Data Analytics with PowerPivot and Power View

Big Data Analytics with PowerPivot and Power View

BIG DATA TRENDS AND TECHNOLOGIES

Microsoft Big Data. Solution Brief

How To Extend An Enterprise Bio Solution

Bringing Big Data to People

Next Generation Microsoft Business Analytics

Bring your data to life with Microsoft Power BI. Peter Myers Bitwise Solutions

Microsoft Big Data Solutions. Anar Taghiyev P-TSP

Modernizing Your Data Warehouse for Hadoop

Designing Self-Service Business Intelligence and Big Data Solutions

SQL Server 2014 Faster Insights from any Data Level 300

SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM

Course 20467: Designing Self-Service Business Intelligence and Big Data Solutions

Microsoft SQL Server 2012 with Hadoop

Native Connectivity to Big Data Sources in MicroStrategy 10. Presented by: Raja Ganapathy

How To Handle Big Data With A Data Scientist

Tap into Hadoop and Other No SQL Sources

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

Big Data on Microsoft Platform

Introducing the Reimagined Power BI Platform. Jen Underwood, Microsoft

Hadoop and Data Warehouse Friends, Enemies or Profiteers? What about Real Time?

The Inside Scoop on Hadoop

Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru

Using Tableau Software with Hortonworks Data Platform

#mstrworld. Tapping into Hadoop and NoSQL Data Sources in MicroStrategy. Presented by: Trishla Maru. #mstrworld

BIG DATA What it is and how to use?

SAP and Hortonworks Reference Architecture

MS 10977B Upgrading Your SQL Server Skills to Microsoft SQL Server 2014

Faster Insights from Any Data Technical White Paper

Updating Your SQL Server Skills to Microsoft SQL Server 2014

Business Analytics In a Big Data World Ted Malone Solutions Architect Data Platform and Cloud Microsoft Federal

Azure Data Lake Analytics

BIG DATA TECHNOLOGY. Hadoop Ecosystem

Big Data Scenario mit Power BI vs. SAP HANA Gerhard Brückl

Creating a universe on Hive with Hortonworks HDP 2.0

10977B: Updating Your SQL Server Skills to Microsoft SQL Server 2014

Big Data: Making Sense of it all!

Assignment # 1 (Cloud Computing Security)

Updating Your SQL Server Skills to Microsoft SQL Server 2014

SQL Server What s New? Christopher Speer. Technology Solution Specialist (SQL Server, BizTalk Server, Power BI, Azure) v-cspeer@microsoft.

Upgrading Your SQL Server Skills to Microsoft SQL Server 2014 va

Big Data Realities Hadoop in the Enterprise Architecture

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc All Rights Reserved

SQL 2012: Bringing Big Data to the Desktop

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Raul F. Chong Senior program manager Big data, DB2, and Cloud IM Cloud Computing Center of Competence - IBM Toronto Lab, Canada

Updating Your SQL Server Skills from Microsoft SQL Server 2008 to Microsoft SQL Server 2014

TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP

Please give me your feedback

Are You Ready for Big Data?

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: Vol. 1, Issue 6, October Big Data and Hadoop

Course 10977: Updating Your SQL Server Skills to Microsoft SQL Server 2014

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

Big Data Are You Ready? Thomas Kyte

Are You Ready for Big Data?

Upgrading Your SQL Server Skills to Microsoft SQL Server 2014

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

QlikView, Creating Business Discovery Application using HDP V1.0 March 13, 2014

Big Data Analytics. with EMC Greenplum and Hadoop. Big Data Analytics. Ofir Manor Pre Sales Technical Architect EMC Greenplum

BIG DATA CHALLENGES AND PERSPECTIVES

Whitepaper: Solution Overview - Breakthrough Insight. Published: March 7, Applies to: Microsoft SQL Server Summary:

Manifest for Big Data Pig, Hive & Jaql

Unified Batch & Stream Processing Platform

Step by Step: Big Data Technology. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 25 August 2015

So What s the Big Deal?

CREATING PACKAGED IP FOR BUSINESS ANALYTICS PROJECTS

Course 10977A: Updating Your SQL Server Skills to Microsoft SQL Server 2014

Big Data Processing: Past, Present and Future

Microsoft Analytics Platform System. Solution Brief

Business Intelligence in Excel 2013 Excel, PowerPivot and Power View. Stéphane Fréchette Friday April 26, 2013

Analytics in the Cloud. Peter Sirota, GM Elastic MapReduce

Implementing Data Models and Reports with Microsoft SQL Server 2012 MOC 10778

Simplifying Big Data Analytics: Unifying Batch and Stream Processing. John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!!

Implement Hadoop jobs to extract business value from large and varied data sets

OPEN MODERN DATA ARCHITECTURE FOR FINANCIAL SERVICES RISK MANAGEMENT

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

Reporting Fundamentals for Programmers

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

Hadoop in the Hybrid Cloud

Upgrading Your SQL Server Skills to Microsoft SQL Server 2014

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

Information Builders Mission & Value Proposition

Updating Your SQL Server Skills to Microsoft SQL Server 2014 (10977) H8B96S

Native Connectivity to Big Data Sources in MSTR 10

HDFS. Hadoop Distributed File System

Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

Big Data Buzzwords From A to Z. By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012

Transcription:

SQLSaturday #399 Sacramento 25 July, 2015 Big Data Analytics with Excel

Presenter Introduction Peter Myers Independent BI Expert Bitwise Solutions BBus, SQL Server MCSE, SQL Server MVP since 2007 Experienced in designing, developing and maintaining Microsoft database and application solutions since 1997 Focuses on education and mentoring Based in Melbourne, Australia peter.myers@bitwisesolutions.com.au http://www.linkedin.com/in/peterjsmyers

Session Outline Introducing: Big Data Hadoop Azure HDInsight Excel BI with Power Add-ins Big Data Modeling with Power Pivot: Benefits Considerations Resources

Demonstration 1. Creating the Azure HDInsight Cluster

Introducing Big Data Cheap Storage Inexpensive Computing

Introducing Big Data (Continued) Big data is a collection of data sets so large and complex that it becomes awkward to work with using on-hand database management tools. Difficulties include capture, storage, search, sharing, analysis, and visualization. Wikipedia

Introducing Big Data (Continued) Big data solutions deal with the complexities of: VOLUME (Size) VARIETY (Structure) VELOCITY (Speed)

Introducing Big Data (Continued) Petabytes Click stream Wikis/blogs Sensors RFID Devices Social sentiment Audio/video Big Data Log files Terabytes Spatial and GPS coordinates Data market feeds Gigabytes egov feeds Weather Megabytes Text/image Data Complexity: Variety and Velocity

Introducing Big Data Responding to New Questions What s the social sentiment of my product? How do I better predict future outcomes? How do I optimize my services based on patterns of weather, traffic, etc.?

Introducing Hadoop Apache Hadoopis for big data It is a set of open source projects that transform commodity hardware into a service that can: Store petabytes of data reliably Allow huge distributed computations

Introducing Hadoop (Continued) Key attributes: Open source Highly scalable Runs on commodity hardware Redundant and reliable (no data loss) Batch processing centric using a Map-Reduce processing paradigm

Introducing Hadoop Comparison to Traditional RDBMS Data Size Access Updates Structure Integrity Scaling DBA Ratio TRADITIONAL RDBMS HADOOP

Introducing Azure HDInsight Azure HDInsight is Microsoft s Hadoop-based service that enables big data solutions in the cloud Empowers organizations with new insights on previously untouched unstructured data, while connecting to the most widely used BI tools on the planet +

Introducing Azure HDInsight (Continued) Key attributes: 100% Apache Hadoop solution in the cloud Built on Hortonworks Data Platform (HDP) Develop in.net and Java Can be automated with PowerShell and Command Line Can deliver insights through Excel with the Power add-ins

Introducing Azure HDInsight How it Works: 1 Data Storage

Demonstration 1. Storing Web Log Data in Azure Storage

Introducing Azure HDInsight How it Works: 2 Take the Processing to the Data RUNTIME Server Server Server Server

Introducing Azure HDInsight The Big Data Ecosystem Legend Red = Core Hadoop Blue = Data processing Green = Packages Distributed Processing (MapReduce) Purple = Microsoft integration points and value adds Orange = Data Movement Distributed Storage (HDFS)

Demonstration 1. Word count with Pig( Hello World for Hadoop)

Introducing Azure HDInsight Traditional E-Commerce Data Flow

Introducing Azure HDInsight New E-Commerce Big Data Flow

Introducing Azure HDInsight Hadoop Data Flow Data Hadoop Analytics

Excel BI with Power Add-ins Excel: Complete and Powerful Self-Service Tool Access Share Clean Power View Power Map Visualize Mash-up Explore

Demonstrations 1. Using Power Query to query the Pig output 2. Using Power Pivot to load web log data with Hive 3. Using Power View to analyzeweb log data

Big Data Modeling with Power Pivot Benefits Data models can surface big data in an intuitive way to promote rapid exploration, analysis and reporting Big data can be easily integrated with other data sources Self-service BI potential: Power Pivot can load big data by using the Table Import Wizard ODBC direct to HDInsight OLE DB with a SQL Server linked server to Azure HDInsight Power Pivot workbooks can become a data source for: Local Excel reports (within the same workbook) with PivotTables, PivotCharts, CUBE functions and Power View Other analytic and reporting tools (if published to SharePoint on-premises)

Big Data Modeling with Power Pivot Considerations Big data results may be too large for loading into in-memory storage Workarounds, by minimizing the amount of data to retrieve Retrieve a smaller time period of data Decrease the dimensionality, and/or Increase the grain Sample with a random distribution of data Once the big data results are loaded (cached in memory), the data model can deliver high query performance

Demonstration 1. Destroying the Azure HDInsight Cluster

Summary Big data refers to data sets so large and/or complex that they become awkward to work with in conventional ways Hadoop can store petabytes of data reliably and execute huge distributed computations Big data query results often involve significant latency Excel BI includes Power add-ins to query, analyze and visualize Big Data sourced from Azure HDInsight

Resources Microsoft Big Data web site http://www.microsoft.com/en-us/server-cloud/solutions/big-data.aspx Azure HDInsight web site http://azure.microsoft.com/en-us/documentation/services/hdinsight/ Hortonworks tutorials http://hortonworks.com/tutorials Numerous tutorials are available to learn about big data by using the Hortonworks Sandbox

Training Solutions Peter Myers provides training in SQL Server BI and Power BI All training courses in the Microsoft BI Academy program have been specifically designed to enable students to quickly commence developing and maintaining state-of-the-art integrated Business Intelligence (BI) solutions developed by using Microsoft products SQL Server EIM (2 days) SQL Server SSAS (3 days) SQL Server SSRS (3 days) Office Power BI (2 days) Training courses can be delivered remotely, or in-person http://www.bitwisesolutions.com.au

Thank You