Test Validations for Next Generation Business Intelligence



Similar documents
Data Warehousing and Data Mining in Business Applications

Testing Big data is one of the biggest

SAS Business Intelligence Online Training

Advanced Analytics. The Way Forward for Businesses. Dr. Sujatha R Upadhyaya

IST722 Data Warehousing

A Knowledge Management Framework Using Business Intelligence Solutions

Microsoft Business Intelligence

Management Accountants and IT Professionals providing Better Information = BI = Business Intelligence. Peter Simons peter.simons@cimaglobal.

End to End Microsoft BI with SQL 2008 R2 and SharePoint 2010

PUSH INTELLIGENCE. Bridging the Last Mile to Business Intelligence & Big Data Copyright Metric Insights, Inc.

The Microsoft Business Intelligence 2010 Stack Course 50511A; 5 Days, Instructor-led

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

Intelligent BI Testing. Key to Reliable Information. Data to Impact.

70-467: Designing Business Intelligence Solutions with Microsoft SQL Server

MDM for the Enterprise: Complementing and extending your Active Data Warehousing strategy. Satish Krishnaswamy VP MDM Solutions - Teradata

Introduction to Business Intelligence

MDM and Data Warehousing Complement Each Other

MS 50511A The Microsoft Business Intelligence 2010 Stack

BUSINESS INTELLIGENCE. Keywords: business intelligence, architecture, concepts, dashboards, ETL, data mining

Vendor briefing Business Intelligence and Analytics Platforms Gartner 15 capabilities

Business Intelligence: Effective Decision Making

Master Data Management and Data Warehousing. Zahra Mansoori

Oracle BI Application: Demonstrating the Functionality & Ease of use. Geoffrey Francis Naailah Gora

Paper DM10 SAS & Clinical Data Repository Karthikeyan Chidambaram

The IBM Cognos Platform

SAS BI Course Content; Introduction to DWH / BI Concepts

SQL Server 2012 End-to-End Business Intelligence Workshop

Enhancing Decision Making

Making Business Intelligence Relevant for Mid-sized Companies. Improving Business Results through Performance Management

Oracle Business Intelligence Suite Enterprise Edition

Real time Business Intelligence

Oracle Business Intelligence Suite Enterprise Edition Overview and Benefits

8. Business Intelligence Reference Architectures and Patterns

Analytical CRM to Operational CRM Operational CRM to Analytical CRM Applications

Establish and maintain Center of Excellence (CoE) around Data Architecture

State of Louisiana Department of Revenue. Development/implementation of LDR s First Data Mart RFP Official Responses to Written Inquiries

3/17/2009. Knowledge Management BIKM eclassifier Integrated BIKM Tools

Presented by: Jose Chinchilla, MCITP

Intelligent Business Processes

What s New with Informatica Data Services & PowerCenter Data Virtualization Edition

LEARNING SOLUTIONS website milner.com/learning phone

Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers

Enterprise Information Integration (EII) A Technical Ally of EAI and ETL Author Bipin Chandra Joshi Integration Architect Infosys Technologies Ltd

Metadata Strategies: your guide through the data jungle Achim Granzen EMEA Technology Strategist

Descriptive to Predictive to Prescriptive Analytics: Move Up the Value Chain. Suren Nathan CTO

Building a Data Warehouse

QAD Business Intelligence

Introduction to SAS Risk Management

HYPERION MASTER DATA MANAGEMENT SOLUTIONS FOR IT

Importance or the Role of Data Warehousing and Data Mining in Business Applications

IT FUSION CONFERENCE. Build a Better Foundation for Business

Creating a Business Intelligence Competency Center to Accelerate Healthcare Performance Improvement

Hexaware E-book on Predictive Analytics

Data Warehouse Overview. Srini Rengarajan

California Enterprise Architecture Framework. Business Intelligence (BI) Reference Architecture (RA)

Using Predictions to Power the Business. Wayne Eckerson Director of Research and Services, TDWI February 18, 2009

HROUG. The future of Business Intelligence & Enterprise Performance Management. Rovinj October 18, 2007

The Role of the Analyst in Business Analytics. Neil Foshay Schwartz School of Business St Francis Xavier U

Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University

How I Transitioned from an E-Business Suite Development to an Oracle Business Intelligence Developer

BI, Analytics and Big Data A Modern-Day Perspective

Real-Time Data Analytics and Visualization

Justice Data Warehousing and Court Business Intelligence. Technical Introduction. Harris County Courts

National Health Reform Enterprise Data Warehouse (NHR EDW) Program. RFT Industry Brief

QlikView Business Discovery Platform. Algol Consulting Srl

10 Biggest Causes of Data Management Overlooked by an Overload

OLAP Theory-English version

SimCorp Solution Guide

<Insert Picture Here> Oracle Retail Data Model Overview

Oracle Business Intelligence 11g Business Dashboard Management

How to Enhance Traditional BI Architecture to Leverage Big Data

Data Warehouse: Introduction

Data warehouse Architectures and processes

AMSTERDAM BOSTON HEIDELBERG LONDON NEW YORK OXFORD PARIS SAN DIEGO SAN FRANCISCO SINGAPORE SYDNEY TOKYO

Migrating a Discoverer System to Oracle Business Intelligence Enterprise Edition

<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Four Methods to Monetize Service Assurance Monitoring Data

SQL Server 2005 Features Comparison

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

Analance Data Integration Technical Whitepaper

Using Open Source Middleware for the Business Intelligence. Licensed under Creative Commons Att. Nc Nd 2.5 license

Getting Value from Big Data with Analytics

Republic Polytechnic School of Information and Communications Technology C355 Business Intelligence. Module Curriculum

Cis330. Mostafa Z. Ali

VIEWPOINT. High Performance Analytics. Industry Context and Trends

Welcome To Today s Webinar: Dynamics Insights SM for Microsoft Dynamics AX

Integrating SAP and non-sap data for comprehensive Business Intelligence

Getting Real Real Time Data Integration Patterns and Architectures

Welcome to online seminar on. Oracle Agile PLM BI. Presented by: Rapidflow Apps Inc. January, 2011

The SAS Transformation Project Deploying SAS Customer Intelligence for a Single View of the Customer

Master Data Management: More than a single view of the enterprise? Tony Fisher President and CEO

A Service-oriented Architecture for Business Intelligence

Cost-Effective Business Intelligence with Red Hat and Open Source

Transcription:

Test Validations for Next Generation Business Intelligence International Software Testing Conference 2012 Anusha Jaya Murthy Yerraguntla Infosys Limited (NASDAQ: INFY) 1

Abstract Traditional BI approaches like OLAP and data mining are proving to be insufficient to make rapid analytical decisions. This practice is turning out to be less efficient in the current economic scenario especially where real time data based decisions are to be made. This paper would give a briefly touch upon traditional BI and its limitations. It would focus upon the key facets of Next Generation BI, which has become an emerging trend in the field of Analytics. It would touch upon the additional validations to be done in the Next Gen BI from a testing stand point. 2

What is traditional Business Intelligence? This is a form of making business decisions using the data residing in the warehouse and data marts. It comprises of analysis spread across three layers Source, Analytics and Reporting. Key features: Analysis hinges upon Historical data Analysis performed only on the structured and organized data Data refresh in the user reports happen at a pre-defined frequency Source layer Analytics layer Reporting layer Fig 1 Layers of BI 3

What are the short comings of traditional Business Intelligence? Not efficient to deliver required information/data to Business on demand. This resulted in delayed and deferred decisions. Business reports would get updated only at the pre-defined frequency of the data loads. Could not help decision makers being pro-active to the changing requirements. They could only be reactive, since the opportunities were lost even before analysis, in most cases. The out come of the decisions made is not in line with the management expectations. Proved to be inefficient in managing unstructured data: in terms of analysis and its integration with structured data. 4

What are the solutions to these shortcomings? Real time BI A means to make business decisions using the real time data as against historical data Predictive Analytics - A means to predict the future business trends based on the real time and historical data. Textual Analytics - A means to analyze voluminous and un-organized data and make business decisions out of it. CDC on Demand -5A means to capture the most recent data as and when needed. This is

What is a Next Generation Business intelligence? A comprehensive package which combines multiple features listed below : Real time BI Predictive Analytics Textual Analytics CDC on Demand 6

Next Generation Business Intelligence flow Sourc e Analytics layer BI layer Reports Structured Source 1 2 4 Standard ETL Predictive 5 Textual EDW Unstructured Source 1 Data Categorizatio n 2 META DATA MANAGEMENT Legend Real time data Structured source systems Fig 2 Next Generation BI flow Pro-active cache Un structured systems Micro batch ETL / Change Data capture / Queue validations 7

QA validations for Predictive Analytics The early validation ensures that appropriate subjects have been identified. Validate if the Predictive scores and goals are computed accurately. This involves validation of formulae used by the models. Validate the effectiveness of the model by doing `back testing i.e. comparison of forecasts vs. actuals. The outcome of this testing would help the designers modify the predictive model. 8

QA validations for Textual Analytics Validate the metadata of the unstructured text Validate the mechanism of converting unstructured to structured data (tools, algorithms, business rules) Performance testing while handling voluminous data Ensure that `Blather is not keyed in as input to the analytics process. 9

QA validations for Change data capture mechanism (CDC) Validate the data captured in DW by comparing against the source for the updated/inserted data ensuring that the latest change is captured. Initial Data Modified Data Validate the Failover and Recoverability Fig 3 - Differential data capture 10

Key validations in traditional BI Source layer Analytics layer Schema checks ETL checks Data Quality checks Data Model validations Security validations Cube Validations Report layer Report layout validations Performance validations Data accuracy validations Fig 4 - Layer wise validations 11

Summary of the additional skillset needed BI Phase Source Layer Analytics Layer Reporting Layer Additional QA facet for Next Gen BI CDC on Demand (triggered by the business event) Recovery/Fail over Validation Predictive Analytics: Back testing for Predictive Models Predictive score validation Text Analytics: Meta data validation of unstructured data User experience validation: Validation of real time alerts and notifications Validate whether the data flow meets the SLA. Table 1 Additional QA scenarios for Next Gen BI 12

Role to be played by QA to validate Next Gen BI Need to transform from QA into DA i.e. from a Quality Analyst to a Data Analyst. Should append technical flavor to the current skillset. Should be able to put theoretical business domain knowledge into practice. Should be able to comprehend Statistical inferences of Predictive models. Knowledge on mathematical simulation models such as Monte-Carlo would become imperative for analysis. 13

Questions??? 14

Thank you 15