INFORMATION TECHNOLOGY STANDARD



Similar documents
POLAR IT SERVICES. Business Intelligence Project Methodology

Data warehouse and Business Intelligence Collateral

Data Warehousing and Business Intelligence (DW/BI)

LEARNING SOLUTIONS website milner.com/learning phone

Microsoft Data Warehouse in Depth

MS 20467: Designing Business Intelligence Solutions with Microsoft SQL Server 2012

Designing Business Intelligence Solutions with Microsoft SQL Server 2012 Course 20467A; 5 Days

Whitepaper Data Governance Roadmap for IT Executives Valeh Nazemoff

Course 10777A: Implementing a Data Warehouse with Microsoft SQL Server 2012

Course Outline. Module 1: Introduction to Data Warehousing

Microsoft. Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server

Implementing a Data Warehouse with Microsoft SQL Server 2012

COURSE 20463C: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

Implementing a Data Warehouse with Microsoft SQL Server

SQL Server 2012 Business Intelligence Boot Camp

Implementing a Data Warehouse with Microsoft SQL Server 2012

Implementing a Data Warehouse with Microsoft SQL Server 2014

BENEFITS OF AUTOMATING DATA WAREHOUSING

Implementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777

Implement a Data Warehouse with Microsoft SQL Server 20463C; 5 days

Implementing a Data Warehouse with Microsoft SQL Server

Bringing agility to Business Intelligence Metadata as key to Agile Data Warehousing. 1 P a g e.

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Business Intelligence

Implementing a Data Warehouse with Microsoft SQL Server MOC 20463

COURSE OUTLINE MOC 20463: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

Knowledge Base Data Warehouse Methodology

Implementing a Data Warehouse with Microsoft SQL Server 2012 (70-463)

Microsoft Training and Certification Guide. Current as of December 31, 2013

Implementing a SQL Data Warehouse 2016

Implementing a Data Warehouse with Microsoft SQL Server

East Asia Network Sdn Bhd

HYPERION MASTER DATA MANAGEMENT SOLUTIONS FOR IT

The Role of the BI Competency Center in Maximizing Organizational Performance

US Department of Education Federal Student Aid Integration Leadership Support Contractor January 25, 2007

INFORMATION TECHNOLOGY STANDARD

BIG DATA COURSE 1 DATA QUALITY STRATEGIES - CUSTOMIZED TRAINING OUTLINE. Prepared by:

Paper DM10 SAS & Clinical Data Repository Karthikeyan Chidambaram

SAS BI Course Content; Introduction to DWH / BI Concepts

CLASS SPECIFICATION. Business Intelligence Supervisor

Implementing a Data Warehouse with Microsoft SQL Server

Course 20463:Implementing a Data Warehouse with Microsoft SQL Server

Establish and maintain Center of Excellence (CoE) around Data Architecture

Course Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning

MicroStrategy Course Catalog

Turnkey Hardware, Software and Cash Flow / Operational Analytics Framework

Jet Enterprise Frequently Asked Questions Pg. 1 03/18/2011 JEFAQ - 02/13/ Copyright Jet Reports International, Inc.

US Department of Education Federal Student Aid Integration Leadership Support Contractor June 1, 2007

RFP Attachment C Classifications

Implementing a Data Warehouse with Microsoft SQL Server 2012

Introduction. Architecture Re-engineering. Systems Consolidation. Data Acquisition. Data Integration. Database Technology

IBM Cognos Training: Course Brochure. Simpson Associates: SERVICE associates.co.uk

Nothing in this job description restricts management's right to assign or reassign duties and responsibilities to this job at any time.

COMMONWEALTH OF PENNSYLVANIA DEPARTMENT S OF PUBLIC WELFARE, INSURANCE AND AGING

Building a Data Warehouse

State of Louisiana Department of Revenue. Development/implementation of LDR s First Data Mart RFP Official Responses to Written Inquiries

Data Warehouse (DW) Maturity Assessment Questionnaire

SQL Server 2005 Features Comparison

IBM Cognos Performance Management Solutions for Oracle

B.Sc (Computer Science) Database Management Systems UNIT-V

The IBM Cognos Platform

Nothing in this job description restricts management's right to assign or reassign duties and responsibilities to this job at any time.

Beta: Implementing a Data Warehouse with Microsoft SQL Server 2012

Enterprise Solutions. Data Warehouse & Business Intelligence Chapter-8

Understanding Data Warehousing. [by Alex Kriegel]

Outline Business Intelligence Roadmap: The Complete Project Lifecycle for Decision-Support Applications

A McKnight Associates, Inc. White Paper: Effective Data Warehouse Organizational Roles and Responsibilities

Agile Business Intelligence Data Lake Architecture

70-467: Designing Business Intelligence Solutions with Microsoft SQL Server

AV-005: Administering and Implementing a Data Warehouse with SQL Server 2014

Microsoft Training and Certification Guide. Current as of March 16, 2015

Business Intelligence. Advanced visualization. Reporting & dashboards. Mobile BI. Packaged BI

Online Courses. Version 9 Comprehensive Series. What's New Series

SQL Server 2012 Gives You More Advanced Features (Out-Of-The-Box)

QAD Business Intelligence Release Notes

Foundations of Business Intelligence: Databases and Information Management

Optimos Enterprise Helpdesk Automation Solution Case Study

IMPLEMENTING A BUSINESS INTELLIGENCE (BI) PROJECT FOR STRATEGIC PLANNING AND DECISION MAKING SUPPORT

Contents. visualintegrator The Data Creator for Analytical Applications. Executive Summary. Operational Scenario

Structure of the presentation

Analance Data Integration Technical Whitepaper

SQL Server 2008 Business Intelligence

Oracle Warehouse Builder 10g

SACRAMENTO CITY UNIFIED SCHOOL DISTRICT Position Description. DEPARTMENT: Technology Services SALARY: Range 13 Salary Schedule A

BUSINESSOBJECTS DATA INTEGRATOR

Master Data Management

CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University

Integrating SAP and non-sap data for comprehensive Business Intelligence

TDWI strives to provide course books that are content-rich and that serve as useful reference documents after a class has ended.

EAI vs. ETL: Drawing Boundaries for Data Integration

ASYST Intelligence South Africa A Decision Inc. Company

Practical meta data solutions for the large data warehouse

QAD Business Intelligence

GENWARE COMPUTER SYSTEMS AUDITING SOLUTION FOR COGNOS BUSINESS INTELLIGENCE

Analance Data Integration Technical Whitepaper

Vertical Data Warehouse Solutions for Financial Services

IT Automation: Evaluate Job Scheduling and Run Book Automation Solutions

For Sales Kathy Hall

EXPLORING THE CAVERN OF DATA GOVERNANCE

Appendix 2-A. Application and System Development Requirements

Transcription:

COMMONWEALTH OF PENNSYLVANIA DEPARTMENT OF PUBLIC WELFARE INFORMATION TECHNOLOGY STANDARD Name Of Standard: Data Warehouse Standards Domain: Enterprise Knowledge Management Number: Category: STD-EKMS001 Data Warehouse Date Issued: 08/30/02 Issued By Direction Of: Date Revised: 04/02/07 James Weaver, Dir of Div of Tech Engineering Abstract: The purpose of this Standard is to establish enterprise-wide standards and guidance for Data Warehousing and its usage. Data Warehousing is a process for building decision support systems and a knowledge-based application environment in support of both everyday tactical decision making and long-term business strategy. Data Warehouses and Data Warehouse applications are designed primarily to support the decision-making process by providing the decision makers with access to accurate and consolidated information from a variety of sources. The primary objective of Data Warehousing is to collate information from disparate sources and place the information in a format conducive to the decision-making process. This objective necessitates a set of activities more complex than merely collecting data and reporting against it. Data Warehousing requires both business and technical expertise and typically involves the following activities: - Accurately identifying information to be placed in the Warehouse; Identifying and prioritizing subject category areas to be included in the Data Warehouse; Managing the scope of each subject area implemented into the Warehouse on an iterative basis; Defining the correct level of consolidation in support of business decision making; Extracting, cleansing, aggregating, transforming, and validating the data to ensure accuracy and consistency; -Defining and capturing metadata on all elements on the Data Warehouse; -Defining record retention criteria and policies for all records on the Data Warehouse; Developing a scaleable architecture to serve as the Data Warehouse s technical and application foundation, and identifying and selecting the hardware/software/ middleware components to implement it; Data Warehouse LifeCycle standards.doc Page 1 of 7

Establishing a refresh program that is consistent with business needs, timing and cycles; Providing powerful and user-friendly tools at the desktop to access the data in the warehouse; Educating users about the realm of possibilities available to them through Data Warehousing; Establishing Data Warehouse support mechanisms and training users to effectively utilize the desktop tools; Establishing processes for maintaining, enhancing, and ensuring the ongoing success and applicability of the Warehouse. General: These standards apply to all developers, both Commonwealth employees and contractors, who develop against the DPW Enterprise Data Warehouse. This policy ensures that all develop and implemented applications will facilitate enterprise-wide interoperability and standardization. Standard: Data Warehouse Technology Components: Since Data Warehousing encompasses many technologies, it is not limited to one specialized area. Typically, the technical aspects of Data Warehousing are divided into the following areas: Extract, Transform and Load (ETL) tools address the process of extracting, transforming, and loading data from various sources into the Data Warehouse using either custom-developed utilities or existing marketplace products. The DPW s standard tool for this technology is Power Center by Informatica. Data Warehouse repository tools address products used for physical storage of Data Warehouse information. These tools range from products currently available in the marketplace; tools used in the Commonwealth; and those recommended for deployment. The DPW Enterprise Data Warehouse is housed in an Oracle database. Enterprise reporting tools address the end-user reporting tools to be used to satisfy reporting requirements. The standard reporting tools for the DPW Enterprise Data Warehouse are from Cognos. Architecture standards address the various architecture patterns and available modeling techniques. ErWin is the DPW s standard tool for modeling. Security standards Security standards address authentication and access to the Data Warehouse for agency employees, and approval and audit processes by authorized agency members, subject to agency/enterprise security and privacy policies. The DPW Enterprise Data Warehouse uses Netegrity Site Minder for authentication and the security features in the Cognos, Oracle and Informatica tools for authorization. Data Warehouse Development: The development of data warehouse subject areas or the addition of elements to an already defined data warehouse subject follows a defined development lifecycle (DWSDLC). Although the phases in a DWSLC are similar to the phases in a typical Systems Development Life Cycle (SDLC) used to develop traditional operational systems the tasks and methodology for completing the tasks in each phase are different. That is because the goals and the data structures of operational system, that supports the Data Warehouse LifeCycle standards.doc Page 2 of 7

running of a business, and a data warehouse, that supports analysis of the business, are different. The following describes the phases and tasks and deliverables to be completed in the DWSDLC. PHASE 1 PLANNING This phase encompasses developing the overall scope and effort it will take to implement the subject area into the data warehouse. Defining the project scope. Creating the project plan. Review and establish the technical infrastructure. Defining the necessary resources. Defining the tasks and deliverables. Initial capacity plan Architecture review The deliverable from this phase is a Project Work Plan developed in Microsoft Project. PHASE 2 GATHERING DATA REQUIREMENTS AND MODELING The tasks for this phase are as follows: Define all user business requirements including o What business questions are to be answered o Record retention strategy o Performance metrics o Criticality of data The deliverable for this phase is General System Design Document. PHASE 3 DETAILED SYSTEM DESIGN This phase translates the user-oriented General System Design into a technical, computer-oriented detailed systems design. Data structures and processing are designed to the level of detail necessary to begin coding and includes the following tasks. Refine Data Requirements Develop Proof of Concept/Prototype Define Appropriate Reporting Applications The Deliverables to be produced in this phase are the DSD (Detailed Systems Design) document, the Requirements Traceability Matrix, and a Logical Data Model. PHASE 4 DATABASE DESIGN AND DEVELOPMENT This phase covers the database design and denormalization, and includes the following tasks. Designing the database, including fact tables, relationship tables and lookup tables. Denormalizing the data. Identifying keys. Data Warehouse LifeCycle standards.doc Page 3 of 7

Creating index strategies. Creating database objects The Deliverables to be produced in this phase is the Data Base Request Form. All DPW Database standards must be followed. This document must be submitted to the Enterprise Knowledge Management Section 2 weeks before a formal review is held. That time will be used to review and supply the submitted required changes before the formal meeting. PHASE 5 DATABASE MAPPING AND TRANSFORMATION This phase is started in conjunction with the database design phase. This phase encompasses identifying the sources of the data, determining the transformation logic and mapping the source data to the target data warehouse table design. Defining the sources systems. Determining file layouts. Developing transformation specifications. Mapping source to target data. Defining the test plan and queries that will be used to validate the data. The deliverables to be produced in this phase are: Metadata Document Extract programs which will be developed in accordance with the System Development Life Cycle standards in place of all application program development Test plan PHASE 6 DATA EXTRACTION AND LOAD This phase covers the actual extraction of the source data, running of the transformation logic, and loading the development database. Developing procedures to extract and move the data. Developing procedures to load the data. Developing extract programs. Developing the transformation logic Develop monthly load programs if necessary Testing the extract, transformation and loading procedures. The Deliverables to be produced in this phase is the Data Extract and Load Procedures Document. PHASE 7 DEVELOPMENT OF BUSINESS INTELLIGENCE APPLICATIONS Data Warehouse LifeCycle standards.doc Page 4 of 7

This phase addresses the development of the reports and cubes that will answer the business questions defined in the Requirements Phase. The development can begin as soon as the database has been defined and created. Developing summary logic Developing OLAP cube Developing Drill-thru reports Developing BI reporting applications Unit testing of all BI applications developed The Deliverables to be produced in this phase are the BI applications themselves and their related metadata and an updated Requirements Traceability Matrix. PHASE 8 TEST PLAN This phase includes the configuring the interface tools that will be used to access that data warehouse and determining the reports and queries that will be used by the testers for data validation and approval of BI reports and cubes. Configuring the interface tools. Defining the test scenarios for user data validation. Defining and develop the reports and queries to be created for data validation Defining test queries for validation of the BI cubes and reports. The deliverable out of this phase is a comprehensive user test plan that is developed by both the developers and the testers PHASE 9 DATA VALIDATION AND TESTING This phase has a few components. It addresses both the user acceptance testing and validation of the data that was loaded into the data warehouse and User acceptance testing of the BI cubes and reports. Standard data validation processes are included throughout the data extraction, transformation, load development and BI report and cube development phases. The test conditions identified in Phase VI should be used during this phase. Once the data has been validated the BI reports and cubes are created and their results are validated by the developers. Then the applications go into User Acceptance testing. UAT of loaded data o Run data validations o Reviewing all errors o Determine how to and correct data that is in error. o Re-Run reports and validate results. UAT of BI cubes and reports o Create cubes and reports Data Warehouse LifeCycle standards.doc Page 5 of 7

o Run queries to validate the cubes and reports The deliverables from this phase are: Test Report User Sign Off PHASE 10 PRE-IMPLEMENTATION/ AUTOMATING PROCESS This phase addresses the automation of the extract, transformation and load process. It can be started anytime after Phase V. Creating operational procedures if applicable. Creating scripts to automate the various processes. Providing information to the help desk. The Deliverables to be produced in this phase are: Operational Procedures Backup and Recovery Procedures Database Scripts PHASE 11 TRAINING This phase focuses on developing training programs for the user community. Users must be trained in the scope and content of the warehouse, the starter set of reports that were developed, the capabilities of the Business Intelligent System, and the interface tools. This phase can start anytime after phase 4. Creating procedures for User Support. Designing training programs or contracting out for those services. The deliverables from this phase are: User Support Procedures Training Plan All training material PHASE 12 ROLLOUT THE DATA WAREHOUSE ADDITIONS This phase includes the necessary tasks to deploy the data warehouse to the user community. Installing components necessary to connect users to the data warehouse. Installing middleware and end user software. Implementing the extract programs and scripts. Implementing the cubes and reports The deliverables for this phase are: Data Warehouse LifeCycle standards.doc Page 6 of 7

Element Transfer Request Form Completed User Request Forms Exemptions from this Standard: There will be no exemptions to this standard. Refresh Schedule: All standards and referenced documentation identified in this standard will be subject to review and possible revision annually or upon request by the DPW Information Technology Standards Team. Standard Revision Log: Change Date Version Change Description Author and Organization 08/30/02 1.0 Initial Creation Unknown 04/02/07 2.0 Revised content and format Arlene DiMarco DTE/EKMS Data Warehouse LifeCycle standards.doc Page 7 of 7