Enterprise Solutions. Data Warehouse & Business Intelligence Chapter-8



Similar documents
Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University

Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006

Open Source Business Intelligence Intro

Understanding Data Warehousing. [by Alex Kriegel]

Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers

SAS BI Course Content; Introduction to DWH / BI Concepts

Hexaware E-book on Predictive Analytics

QlikView Business Discovery Platform. Algol Consulting Srl

Data Search. Searching and Finding information in Unstructured and Structured Data Sources

Data Mart/Warehouse: Progress and Vision

Whitepaper. Data Warehouse/BI Testing Offering YOUR SUCCESS IS OUR FOCUS. Published on: January 2009 Author: BIBA PRACTICE

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

Analytics Industry Trends Survey. Research conducted and written by:

Advanced Data Management Technologies

Armanino McKenna LLP Welcomes You To Today s Webinar:

Cúram Business Intelligence and Analytics Guide

Business Intelligence In SAP Environments

A Critical Review of Data Warehouse

MDM and Data Warehousing Complement Each Other

Getting it Right: How to Find the Right BI Package for the Right Situation Norma Waugh. RMOUG Training Days February 15-17, 2011

Reporting and Business Intelligence Tools. Prasad Veeramachaneni DBMS Consulting 10 October 2010 Tutorial Session Session T09

POLAR IT SERVICES. Business Intelligence Project Methodology

OLAP Theory-English version

Comparative Analysis of the Main Business Intelligence Solutions

LEARNING SOLUTIONS website milner.com/learning phone

Data Warehousing and Data Mining in Business Applications

TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS

BIG DATA COURSE 1 DATA QUALITY STRATEGIES - CUSTOMIZED TRAINING OUTLINE. Prepared by:

A Model-based Software Architecture for XML Data and Metadata Integration in Data Warehouse Systems

Breadboard BI. Unlocking ERP Data Using Open Source Tools By Christopher Lavigne

Part 22. Data Warehousing

Data warehouse and Business Intelligence Collateral

SAS Business Intelligence Online Training

Business Intelligence for Financial Services: A Case Study

Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach

Business Intelligence. Advanced visualization. Reporting & dashboards. Mobile BI. Packaged BI

Management Consulting Systems Integration Managed Services WHITE PAPER DATA DISCOVERY VS ENTERPRISE BUSINESS INTELLIGENCE

Introduction to Datawarehousing

Tips and Techniques on how to better Monitor, Manage and Optimize your MicroStrategy System High ROI DW and BI Solutions

Fluency With Information Technology CSE100/IMT100

DECISION SUPPORT SYSTEMS OR BUSINESS INTELLIGENCE. WHICH IS THE BEST DECISION MAKER?

White Paper. Comparison of Business Intelligence Stacks: Microsoft SQL Server Reporting Services and SAP Business Objects July 7, 2010

Open Source Business Intelligence

IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH

Data Warehouse Overview. Srini Rengarajan

BENEFITS OF AUTOMATING DATA WAREHOUSING

Introduction to Business Intelligence

Business Analytics and Data Visualization. Decision Support Systems Chattrakul Sombattheera

Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole

Understanding and Evaluating the BI Platform by Cindi Howson

Data W a Ware r house house and and OLAP II Week 6 1

Online Courses. Version 9 Comprehensive Series. What's New Series

Meta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring

Nothing in this job description restricts management's right to assign or reassign duties and responsibilities to this job at any time.

Overview. DW Source Integration, Tools, and Architecture. End User Applications (EUA) EUA Concepts. DW Front End Tools. Source Integration

Extend your analytic capabilities with SAP Predictive Analysis

Data Testing on Business Intelligence & Data Warehouse Projects

Data Integration and ETL Process

<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server

An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of

IST722 Data Warehousing

MS 20467: Designing Business Intelligence Solutions with Microsoft SQL Server 2012

The Benefits of Data Modeling in Business Intelligence

Why include analytics as part of the School of Information Technology curriculum?

JOURNAL OF OBJECT TECHNOLOGY

SAP BusinessObjects Information Steward

Establish and maintain Center of Excellence (CoE) around Data Architecture

Business Intelligence Platform Capability Matrix

Alteryx Strategic Analytics Solving Complex Analytic Challenges with a Simple Solution

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

DATA WAREHOUSE CONCEPTS DATA WAREHOUSE DEFINITIONS

Data Warehousing Systems: Foundations and Architectures

14. Data Warehousing & Data Mining

CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University

DATA WAREHOUSING AND OLAP TECHNOLOGY

CASE PROJECTS IN DATA WAREHOUSING AND DATA MINING

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

SAP BusinessObjects SOLUTIONS FOR ORACLE ENVIRONMENTS

How to Enhance Traditional BI Architecture to Leverage Big Data

ORACLE BUSINESS INTELLIGENCE SUITE ENTERPRISE EDITION PLUS

REENGINEERING HR APPRAISAL LEGACY SYSTEM TO BI PLATFORM

Lection 3-4 WAREHOUSING

Business Intelligence Solutions. Cognos BI 8. by Adis Terzić

Dashboard Reporting Business Intelligence

Paper DM10 SAS & Clinical Data Repository Karthikeyan Chidambaram

Optimizing the Performance of the Oracle BI Applications using Oracle Datawarehousing Features and Oracle DAC

How To Model Data For Business Intelligence (Bi)

Transcription:

Enterprise Solutions Data Warehouse & Business Intelligence Chapter-8

Learning Objectives Concepts of Data Warehouse Business Intelligence, Analytics & Big Data Tools for DWH & BI

Concepts of Data Warehouse A Data Warehouse is a subject-oriented, integrated, time-variant and non-volatile collection of data in support of management's decision making process, that is, they are built to analyse a particular subject area or domain or a subject area within a domain - for example, "sales" within retail banking would be a particular subject area Data Warehouses integrates data from multiple data sources or collection of data from various sources of the organization Obviously, the data in a DWH is historical and hence time variant By the very fact that it is historical, the data is also permanent and non-volatile and should never be deleted or modified

Data Warehouse Architecture Enterprise DWH are extremely complex entities and designs and there is no generic architecture However all DWHs have the following components or layers as in the representation below-

Data Warehouse Architecture Data Source Layer obviously, represents the different data sources that feed data into the DWH. These data sources may be of any types & format Data Extraction Layer is the layer where data gets pulled from the data source into the DWH with some data cleansing though without any major Data Transformation Staging Area - is where the data sits prior to being cleansed and transformed into a DWH or Data Mart quality ETL Layer - is where data gains its "intelligence", as logic is applied to transform the data from a transactional nature to an analytical nature Data Storage Layer is the final storage for the cleansed and transformed data Data Logic Layer - is where business rules are stored Data Presentation Layer - Reporting Tools are used here and this layer gives shape to the format in which information is presented to the user Metadata Layer Metadata is Data about Information stored in the DWH system System Operations Layer - includes information on how the DWH system operates, such as - ETL job status, access history, system performance, etc

Components of a Data Warehouse Data Integrity - Data integrity refers to the validity of data - data is consistent and correct. Conceptual Data Model - identifies the highest-level relationships between the different entities. Logical Data Model - A logical data model describes the data in as much detail as possible, without regard to how they will be physical implemented in the database. Physical data Model - Physical data model represents how the model will be built in the database. Data Mart - Data marts are small slices of the data warehouse - they have a more limited audience and/or data content Fact Table A table that stores quantitative information for analysis Dimensional Model - Dimension Data Modelling is used for creating Summary Information for example, summarization of Sales by Day, Week, Month & Year or by Regions Aggregation is a summary of detail data that is stored with or referred to by a cube ETL - Extraction, Transformation, and Loading is an ETL process to extract data from different types of systems, transform it into a structure that's more appropriate for reporting and analysis and finally loads it into the database and or cube(s) OLAP - On-Line Analytical Processing - OLAP provides end users a quick way of slicing and dicing the data

Data Cleansing Data cleansing is a key process in DWH Data Cleansing is the process of altering data in a given storage to ensure that it is accurate and correct - deleting old, incomplete or duplicated data There are many ways to pursue data cleansing in various software and data storage architectures - mostly focused on careful review of data sets and the protocols associated with any particular data storage technology Data Cleansing issues are similar to problems that which archivists, database admin staff and others face What is dirty data? Are data anomalies that create wrong outputs Dirty data is created when reality is different from what is captured and stored Typical steps in Data Cleansing- Data Analysis Defining Transformation Workflows and Mapping Rules Depends on the number of data sources, their degree of heterogeneity and the dirtyness of the data Verification of the correctness and effectiveness of a transformation Transformation - Execution of the transformation steps either by running the ETL Backflow of cleaned data - After (single-source) errors are removed, the cleaned data should also replace the dirty data in the original sources

Business Intelligence & Big Data BI is a set of sophisticated Analytical Techniques & Tools used in identifying, extracting and analysing 'hard' business data - such as sales revenue by products or departments or associated costs and incomes Objectives of a BI action include - understanding of a firm's internal and external strengths and weaknesses, understanding of the relationships between different data for better decision making, detection of opportunities for innovation and cost reduction and optimal deployment of resources BI is accomplished through the use of special softwares and helps companies organize and analyse data to make better decisions the data may be internal or from external data sources BI therefore is not one piece of software - generally include data mining tools, operational dashboards, reporting tools, search and query tools, analytics processing softwares, content viewer ISVs in the BI space include- SAP, Oracle, IBM, Microsoft, Information Builders, MicroStrategy and SAS. Some of the smaller niche players are Actuate Corporation, Alteryx, Logi Analytics, QlikTech and Tableau

Business Intelligence & Big Data Big Data is a popular term used to describe the exponential growth and availability of data, both structured and unstructured Big Data is defined in terms of Volume, Velocity, Variety, Complexity and Variability Why Big Data? The hopeful vision is that organizations will be able to take data from any source, harness relevant data and analyse it to find answers that resolve key issues such as product development strategies & cost rationalization It is intended that by combining big data and highpowered analytics, it may be possible to - recalculate entire risk portfolios in minutes, or identify root causes of failures

DEH & BI Products DWH & BI Products are categorized as- Data Modelling Data Mining OLAP Tools ETL Tools BI Tools Reporting Tools The major companies in the DWH/BI space are IBM, SAS, Oracle, TIBCO, Microstrategy, SAP, InformationBuilder, etc There are also smaller and niche product companies Some of the popular tools in the market are Erwin, Rational & Power Designer, Oracle Designer for Data Modelling IBM Cognos, IBM SPSS, SAS Enterprise Miner, TIBCO, etc for Data Mining BO, Cognos, Microstrategy, Hyperion - OLAP Informatica, Cognos, BO, Websphere ETL BI Cognos, Netweaver, BO, Siebel It is to be remembered that DWH & BI space is replete with tools from a slew of companies and most large organizations use multiple (seemingly ) redundant tool-set in their operations