Understanding Microsoft s BI Tools The purpose of this document is to provide a high level understanding of what tools Microsoft has to support the concepts of data warehousing, business intelligence, and reporting. There are 3 areas to view their support of these processes. The tooling, overlap, and usage will be noted below.
Contents Data at Rest...3 SQL Server Data Engine Columnstore Indexes...4 Parallel Data Warehouse (PDW)...4 SQL Server Analysis Services Multidimensional Models...5 Tabular Models...5 Power Pivot Power Pivot for Excel...6 Power Pivot for SharePoint...7 Upgrading Power Pivot...7 Data in Motion...8 SQL Server Integration Services...9 SSAS Multidimensional Model Processing...9 Data Visualization... 11 SQL Server Reporting Services SSRS Standalone...12 SSRS Integrated with SharePoint...12 SSRS Report Builder...12 PerformancePoint Services...13 Excel...13 Power View...14 Power Map...14 Other Tools... 15 Data Mining...16 HDInsight...16 Power Q & A...16 BizTalk...9 xvelocity In-Memory Analytics Engine Data Refresh... 10 SQL Server Service Broker... 10 Power Query... 10
Data at Rest Data at rest refers to data storage techniques used to provide or host data for use by visualization tools.
SQL Server Data Engine This is the most obvious storage or data at rest component. This is a relational database management system (RDBMS) with various optimizations used to support data warehouse and data mart solutions. Columnstore Indexes This capability uses the xvelocity In-Memory Analytics Engine to greatly improve query performance against large tables within a data warehouse or data mart. Parallel Data Warehouse (PDW) The PDW is an appliance that is used to provide massively parallel processing (MMP) solutions in the multiple terabyte to petabyte sizes. The appliance starts at a half rack and can be expanded from there. PolyBase has the ability to use a single query language (T-SQL) to query relational and Hadoop stores. PolyBase is currently only available in PDW. Azure SQL Database is a relational database engine built on Windows Azure. It is truly implemented as Platform as a Service allowing users to create databases on a shared server instance. This supports the core relational database functionality and is continually increasing in capability. Currently Azure SQL Databases are limited in size to 150 GB. However, these databases have high availability built in and most server level management is handled by Microsoft, thus reducing overall cost of ownership. 4
SQL Server Analysis Services (SSAS) SQL Server Analysis Services supports two types of data storage. Multidimensional Models The most familiar storage is called multidimensional models. This covers the traditional cube storage techniques. Multidimensional models support large amounts of data which can be highly indexed and aggregated for efficient querying and extensive analysis. The largest cube in the world has exceeded 20 terabytes in size which, due to compression, consumes a data mart in excess of 250 terabytes. Multidimensional models are intended to be used against data marts or data warehouses designed around the Kimball dimensional model star schemas with dimensions and facts. Tabular Models Tabular models were introduced in SQL Server 2012 and are a direct descendant of the Power Pivot capability that was introduced previously in Excel. Tabular models are an in-memory data store also driven by the xvelocity In-Memory Analytics Engine. They can scale to the amount of memory supported on the server. These models require less design time and can perform much better than multidimensional models. They are limited in size by the amount of memory that is available, but compression is usually 10:1 or better so quite a bit of data can be stored in this way. 5
Power Pivot Power Pivot is the original in-memory storage introduced by Microsoft as Vertipaq. Vertipaq is now called the xvelocity In-Memory Analytics Engine. What makes Power Pivot and the SSAS Tabular Models so powerful is their ability to mash up data easily. Users can open the Power Pivot window and pull data from a variety of data sources including relational databases, SSRS reports, SharePoint lists, text files, OData feeds, and even other Power Pivot models. Once the data is pulled into PowerPivot, relationships between previously unrelated data sources can be made and analysis done in a similar fashion to SSAS Multidimensional Models. This data is loaded in memory and stored with the Excel spreadsheet when done. There are two flavors of Power Pivot Power Pivot for Excel and Power Pivot for SharePoint. Power Pivot for Excel Power Pivot for Excel was originally delivered as a SQL Server Add-In for Excel 2010. It is still available for Excel 2010 as a separate add-in. With Excel 2013, Power Pivot is built into Excel and the advanced features of Power Pivot can be accessed by enabling it in Excel s option menu with no further downloads required. For all things in-memory, Excel s Power Pivot is the place to start. Once a model is created here it can be shared in SharePoint and eventually deployed to SSAS Tabular. 6
Power Pivot (cont.) Power Pivot for SharePoint Power Pivot capabilities in SharePoint are available in SharePoint 2010 and SharePoint 2013. Office 365 with Power BI also supports Power Pivot, but is limited to smaller models (up to 250 MB). When an Excel workbook with Power Pivot data is uploaded to SharePoint and SharePoint s Power Pivot support is enabled, the Power Pivot data is moved to SharePoint s SSAS instance. This SSAS instance is entirely controlled by the SharePoint farm and is not available for general use or access. However, now the Power Pivot data is available for others to use. When deployed to SharePoint, Power Pivot tables can have scheduled refreshes and Power Pivot data can be used as a standalone data source. Upgrading Power Pivot The last piece of the Power Pivot scenario is that a Power Pivot model can be imported into a SSAS Tabular model with one click. This is done by creating a new Tabular model project in Visual Studio (2010 or above) with the latest SQL Server Data Tools and selecting Import from Power Pivot. Once this is complete, the Power Pivot model that started in Excel will now be managed in SSAS Tabular. 7
Data in Motion Data has to get to a place of rest in some way. Microsoft has some basic tools for moving data.
SQL Server Integration Services SSIS is the enterprise data movement and transformation tool in the Microsoft stack. It is capable of moving data from a wide variety of sources to a wide variety of destinations. Along the way, fairly complex transformations and reconfigurations of the data is possible. SSIS operates as a batch movement tool. This means it is not the best tool for any real-time data movement, but is very efficient at moving large amounts of data. It is the ideal tool to load data marts, data warehouses, or operational data stores. SSAS Multidimesional Model Processing The SSAS Multidimensional Models are loaded by processing the objects. Processing essentially executes a number of dynamically created queries against the data sources to load the dimensions and measure groups within an SSAS database. BizTalk BizTalk is another method to move data to data marts or warehouses. This tool focuses on individual messages and sending that data to the appropriate destination. BizTalk should be considered if real time is a requirement and there are other uses for BizTalk within the enterprise. 9
xvelocity In-Memory Analytics Engine Data Refresh SQL Server Service Broker All xvelocity tools, Power Pivot and SSAS Tabular models, use a concept called refresh to load the models. In Power Pivot, the refreshes are done at the table level. This means that in all cases, the data in the table is removed and reloaded from the source. The level of refresh granularity in this case is at the table level. Therefore, large tables will take longer to refresh. In SSAS Tabular, there is the capability to use partitions to limit the scope of the refresh to the more volatile data segments within a table. This will allow for a much more efficient refresh scenario and is one more reason to upgrade a PowerPivot model to a Tabular model. SQL Server Service Broker is an asynchronous messaging system allowing data to be moved in a more real time fashion. When considering real time capabilities within a data mart or data warehouse, Service Broker is a valid option and much easier to maintain and implement than BizTalk over time. Power Query Power Query is an Excel add-in that brings data movement and transformation capabilities into Excel. Power Query natively supports more data sources than Power Pivot does including Hadoop and HDInsight. More importantly, Power Query also supports shaping data and using a number of steps to shape the data. This is similar in practice to using SSIS to transform data through a series of tasks or packages. Each time the data is refreshed in Excel it runs the data through the same shaping steps. 10
Data Visualization Data visualization takes many forms including reports, dashboards, spreadsheets, etc. Microsoft has three core toolsets used to visualize data: SQL Server Reporting Services, SharePoint, and Office. Many of the individual tools live in multiple locations. The focus on this section will be at the tool level.
SQL Server Reporting Services (SSRS) SQL Server Reporting Services can be used as a standalone product or integrated into SharePoint. In both cases, the product is primarily used to create pixel-perfect reporting for the purpose of printing. There is a significant amount of controls that can be used to visualize the data ranging from tables to charts to maps. When deployed as an HTML report, SSRS supports a level of interactivity including show/hide capabilities and drill through reporting using pass through parameters. SSRS Standalone The standalone implementation of SSRS has a lightweight reporting portal that can be used to expose reports. This portal is not easily branded or customized. It supports basic report organization and security. SSRS Integrated with SharePoint When SSRS is integrated into SharePoint, the reporting capability is built into a portal that fully supports security and customization. SharePoint also has support for SSRS allowing users to create reports for use as webparts for more complex dashboarding. SSRS Report Builder Report builder is an Office-like design experience for building reports. It is supported in both standalone and integrated modes. It is very effective when used with SharePoint as it allows users to create reports in a library and Report Builder understands the report location as well as any shared components used within the report such as Shared Data Sources. 12
Performance Point Services PerformancePoint Services (PPS) has been embedded into the Enterprise Edition of SharePoint since SharePoint 2010. PPS is used to design and deploy dashboards within SharePoint. The Dashboard Builder allows users to build reports, charts, KPIs, and scorecards. Once created, these can be deployed to a PerformancePoint Dashboard or to a PPS webpart for use within a SharePoint page. PPS supports a high level of interactivity when used with SQL Server Analysis Services, so much so that usage with other data sources is not recommended for most visualizations with the exception of KPI targets. PerformancePoint also includes one of the most sought after visualizations in the industry, the Decomposition Tree, which allows meaningful drill capability across SSAS data. Excel Excel 2013 is truly the first version of Excel that is a fully functional BI client tool. While always used to analyze data, the latest version of Excel has gone way beyond a spreadsheet. At its most basic level, Excel provides pivot tables, pivot charts, conditional formatting, sparklines, and OLAP tools. These features allow users to create sophisticated dashboards and reports within Excel for use within SharePoint or Office365. Excel also has added data slicers, timeline slicers, and Office Apps which expand the visualization functionality within Excel to produce compelling dashboards. 13
Power View Power View was created by the SQL Server Reporting Services team to allow for simple (all within 2 clicks) and compelling visualizations. It was released originally in SharePoint 2010 with the SSRS add-in. Microsoft has also made this available in Excel 2013 making this functionality easily available to the standard Excel user. At the current time, Power View only supports data stored in Power Pivot, SSAS Tabular, and SSAS Multidimensional models. The timeline view and mapping are some of the key visualization elements available in Power View. One drawback to using Power View is that it is built in Silverlight which means it cannot be viewed on ipad or SurfaceRT unless deployed through Office 365 which uses HTML5. Power Map Power Map is a 3D visualization in Excel with Office 365. It allows users to create compelling stories over space (geography) and time. Once completed, these visualizations can be saved as movies for viewing on websites and in meetings. 14
Other Tools There are some other tools that do not fit into this scenario.
Data Mining Data Mining is currently available in SSAS multidimensional models and in Excel as a free add-in. Microsoft has made a number of data mining algorithms freely available in these tools. While not frequently used or updated, these can be very valuable to customers who want to do some basic data mining without paying for tools such as SAS. HDInsight HDInsight is Microsoft s Hadoop-based service in Windows Azure. This allows users to implement big data solutions using HDFS without incurring setup and maintenance on site. Power Q & A Power Q & A is a part of Office 365 with Power BI. This tool allows users to ask natural language questions and search the BI solution for answers. An example is which country did we sell the most widgets in last year? Power Q &A will start returning results as the question is being entered and visualize it with Power View. 16
Want to learn more about Microsoft BI? Read about Microsoft BI Strategies and learn ways to utilize these tools. Or contact Magenic and see how we can help you. This can be a paragraph about Magenic and probably specifically about Magenic s BI practices. Maybe a little about Steve Hughes? I dunno, but its a good space for it. Lorem ipsum dolor sit amet, consectetur adipiscing elit. Donec ut nisl vulputate, malesuada urna at, tristique mi. Donec sodales pulvinar orci vitae suscipit. Proin eget tempor nibh. Nullam eu metus ut metus sollicitudin condimentum sit amet ut sapien. Curabitur at tortor augue. Morbi vitae mauris ut ante. Tweet this! Reading Magenic s Understanding Microsoft s BI Tools