NZ BI User Group Auckland 18 September, 2013 Big Data Analytics with PowerPivot and Power View
Presenter Introduction Peter Myers BI Expert Bitwise Solutions BBus, SQL Server MCSE, MCT, SQL Server MVP Experienced in designing, developing and maintaining Microsoft database and application solutions, since 1997 Focuses on education and mentoring Based in Melbourne, Australia peter.myers@bitwisesolutions.com.au http://www.linkedin.com/in/peterjsmyers
Session Objectives To introduce big data To introduce HDInsight To introduce self-service Business Intelligence (BI): Excel 2013 PowerPivot Excel 2013 Power View To introduce the new self-service BI features in Excel 2013 To demonstrate how to create a self-service big data BI solution with Excel 2013 To provide relevant resources for further investigation
Session Outline Introducing: Big Data and Hadoop HDInsight PowerPivot in Excel 2013 Power View in Excel 2013 Big Data Modeling with PowerPivot: Benefits Considerations Demonstrations Resources
Introducing Big Data Big data is a collection of data sets so large and complex that it becomes awkward to work with using on-hand database management tools. Difficulties include capture, storage, search, sharing, analysis, and visualization. Wikipedia
Introducing Big Data Continued Big data solutions deal with complexities of: VOLUME (Size) VARIETY (Structure) VELOCITY (Speed)
Introducing Big Data Continued Petabytes Terabytes Gigabytes Megabytes Data Complexity: Variety and Velocity
Introducing Big Data Responding to New Questions What s the social sentiment of my product? How do I better predict future outcomes? How do I optimize my services based on patterns of weather, traffic, etc.?
Introducing Hadoop Apache Hadoopis for big data It is a set of open source projects that transform commodity hardware into a service that can: Store petabytes of data reliably Allow huge distributed computations Key attributes: Open source Highly scalable Runs on commodity hardware Redundant and reliable (no data loss) Batch processing centric using a Map-Reduce processing paradigm
Introducing Hadoop How it Works
Introducing Hadoop How it Works RUNTIME Server Server Server Server
Introducing HDInsight HDInsight is Microsoft s 100% Apache compatible Hadoop distribution Available as a Windows Azure service presently available in developer preview Empowers organizations with new insights on previously untouched unstructured data, while connecting to the most widely used BI tools on the planet
Introducing PowerPivot PowerPivot empowers business users to create selfservice, BI solutions in Excel A client add-in extends Excel s capabilities to support creating data models Achieved with a client-side version of SQL Server Analysis Services, known as the xvelocity in-memory analytics engine Can efficiently store data volumes far greater than what Excel worksheets can achieve A separate window is used to load, explore, relate, and enrich data with calculations Can import and relate data from corporate, local, and ad hoc data stores The Excel 2013 add-in is automatically installed, but not enabled
Introducing Power View Power View is an interactive data exploration, visualization, and presentation experience Highly visual design experience Rich meta-driven interactivity Presentation-ready at all times It delivers intuitive ad-hoc reporting for business users Reports can be based on tabular data models, including PowerPivot It is now available inside Excel 2013, and with new features: Pie charts Maps KPIs Hierarchies Drill down/drill up Report styles, themes and text resizing Backgrounds and background images Hyperlinks Printing
Big Data Modeling with PowerPivot Benefits Data models can surface big data in an intuitive way to promote rapid exploration, analysis and reporting Big data can be easily integrated with other data sources Self-service BI potential: PowerPivot can load big data by using the Table Import Wizard ODBC direct to HDInsight OLE DB with a SQL Server linked server to HDInsight PowerPivot workbooks can become a data source for: Local Excel reports (within the same workbook) with PivotTables, PivotCharts, CUBE functions and Power View Other analytic and reporting tools (once published to SharePoint)
Big Data Modeling with PowerPivot Considerations Big data results may be too large for loading into in-memory storage Workaround: Minimize the amount of data to retrieve and store Retrieve a smaller time period of data Decrease the dimensionality, and/or Increase the grain Sample with a random distribution of data Once the big data results are loaded (cached in memory), the data model can deliver high query performance
Demonstrations 1. Creating an HDInsight Solution 2. Creating a PowerPivot Workbook Based on HDInsight 3. Analyzing Big Data with Excel Reports
Resources Microsoft Big Data http://www.microsoft.com/bigdata Windows Azure HDInsight https://www.hadooponazure.com HDInsight Services for Windows Includes an excellent set of BI specific resources in the section named Using HDInsight with other BI Technologies http://social.technet.microsoft.com/wiki/contents/articles/6204.hadoop-basedservices-for-windows-en-us.aspx Blog: Big Data for Everyone: Using Microsoft s Familiar BI Tools with Hadoop http://blogs.msdn.com/b/microsoft_business_intelligence1/archive/2012/02/24 /big-data-for-everyone-using-microsoft-s-familiar-bi-tools-with-hadoop.aspx
Thank You