Data Mart/Warehouse: Progress and Vision



Similar documents
Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Fluency With Information Technology CSE100/IMT100

Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

DATA WAREHOUSE CONCEPTS DATA WAREHOUSE DEFINITIONS

SAS BI Course Content; Introduction to DWH / BI Concepts

Data Mining for Successful Healthcare Organizations

CHAPTER SIX DATA. Business Intelligence The McGraw-Hill Companies, All Rights Reserved

Data Warehousing and Data Mining in Business Applications

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP ( 28

Data Warehousing and OLAP Technology for Knowledge Discovery

Data Warehousing: A Technology Review and Update Vernon Hoffner, Ph.D., CCP EntreSoft Resouces, Inc.

Business Intelligence System for Monitoring, Analysis and Forecasting of Socioeconomic Development of Russian Territories

Enterprise Solutions. Data Warehouse & Business Intelligence Chapter-8

LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES

Turkish Journal of Engineering, Science and Technology

OLAP Theory-English version

Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA

Week 3 lecture slides

Implementing Business Intelligence at Indiana University Using Microsoft BI Tools

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

Data Warehousing. Read chapter 13 of Riguzzi et al Sistemi Informativi. Slides derived from those by Hector Garcia-Molina

Datawarehousing and Business Intelligence

DATA MINING AND WAREHOUSING CONCEPTS

Technology-Driven Demand and e- Customer Relationship Management e-crm

Part 22. Data Warehousing

SAS Business Intelligence Online Training

14. Data Warehousing & Data Mining

The Role of the BI Competency Center in Maximizing Organizational Performance

A Knowledge Management Framework Using Business Intelligence Solutions

An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of

Business Intelligence

An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies

Knowledge Discovery and Data. Data Mining vs. OLAP

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

POLAR IT SERVICES. Business Intelligence Project Methodology

Business Intelligence Solutions for Gaming and Hospitality

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

Data Warehouse: Introduction

Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University

IAF Business Intelligence Solutions Make the Most of Your Business Intelligence. White Paper November 2002

Data warehouse and Business Intelligence Collateral

Self-Service Business Intelligence: The hunt for real insights in hidden knowledge Whitepaper

B.Sc (Computer Science) Database Management Systems UNIT-V

Importance or the Role of Data Warehousing and Data Mining in Business Applications

BUILDING OLAP TOOLS OVER LARGE DATABASES

Week 13: Data Warehousing. Warehousing

DSS based on Data Warehouse

Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers

Data Warehousing Systems: Foundations and Architectures

Data Warehousing Concepts

DATA WAREHOUSING AND OLAP TECHNOLOGY

8. Business Intelligence Reference Architectures and Patterns

Business Intelligence Solutions. Cognos BI 8. by Adis Terzić

Business Intelligence and Healthcare

UC Berkeley Data Warehouse Roadmap. Data Warehouse Architecture

OLAP. Business Intelligence OLAP definition & application Multidimensional data representation

An Overview of Database management System, Data warehousing and Data Mining

IBM Cognos Training: Course Brochure. Simpson Associates: SERVICE associates.co.uk

Foundations of Business Intelligence: Databases and Information Management

Data Warehousing and Data Mining

Foundations of Business Intelligence: Databases and Information Management

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery

Open Source Business Intelligence Intro

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Talend Metadata Manager. Reduce Risk and Friction in your Information Supply Chain

CHAPTER 4 Data Warehouse Architecture

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

Dashboards PRESENTED BY: Quaid Saifee Director, WIT Inc.

The Role of Data Warehousing Concept for Improved Organizations Performance and Decision Making

Integrating SAP and non-sap data for comprehensive Business Intelligence

Data Warehousing. Jens Teubner, TU Dortmund Winter 2015/16. Jens Teubner Data Warehousing Winter 2015/16 1

MDM and Data Warehousing Complement Each Other

Subject Description Form

A Design and implementation of a data warehouse for research administration universities

SQL Server 2012 Business Intelligence Boot Camp

Course MIS. Foundations of Business Intelligence

Dimensional Modeling for Data Warehouse

Data Mining Solutions for the Business Environment

ANALYTICS CENTER LEARNING PROGRAM

MIS636 AWS Data Warehousing and Business Intelligence Course Syllabus

SENG 520, Experience with a high-level programming language. (304) , Jeff.Edgell@comcast.net

Business Intelligence, Analytics & Reporting: Glossary of Terms

5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2

2074 : Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000

Turnkey Hardware, Software and Cash Flow / Operational Analytics Framework

Data Warehousing, OLAP, and Data Mining

Data Migration. How CXAIR can be used to improve the efficiency and accuracy of data migration. A CXAIR White Paper.

An Approach for Facilating Knowledge Data Warehouse

DECISION SUPPORT SYSTEMS OR BUSINESS INTELLIGENCE. WHICH IS THE BEST DECISION MAKER?

Chapter 6 - Enhancing Business Intelligence Using Information Systems

Justice Data Warehousing and Court Business Intelligence. Technical Introduction. Harris County Courts

Transcription:

Data Mart/Warehouse: Progress and Vision Institutional Research and Planning University Information Systems

What is data warehousing? A data warehouse: is a single place that contains complete, accurate data from multiple sources, making data analysis easier. can streamline complex administrative functions and support the decision-making process. is specifically designed for querying and reporting. is separate from operational/legacy systems (e.g. SIS).

How data warehousing works Information is collected from a variety of different sources and formats. The data is cleaned transformed into a common format. The cleaned data is added to the warehouse, and made available through data marts. Data is frozen in time for comparative and forecasting purposes. Users can then write queries or make reports using the data warehouse, instead of linking separate databases

IRP Data Mart/Warehouse SIS (completed) HRS data IRP data selection and design Data Warehouse FRS data UIS Software/hardware support and tools Reports Queries President Administration Provost Deans Chairs

Use of data warehousing Reporting Consistency and standardization of data. Users only have to learn how to use one database to write queries and reports and display data in graphical form. Internal Enrollment, retention & graduation rates, etc. External IPEDS, US News & World Report, Princeton Review

Data Driven Decision Making Monitor data Data warehousing helps monitor important university processes. Rapid queries Users can generate ad-hoc reports on the fly without needing to spend time writing and rewriting the code needed to extract data from multiple sources Trend Analysis Administrators can easily analyze admissions and enrollment trends using historical data. Daily Decision Making Increase the efficiency of the daily functioning of departments and services

Benefits of Data Warehouse Competitive advantage Administrators can respond to changing conditions based on data. Consistent public image Reputation for quality and excellence Enhance public image Facilitates accreditation By using the data warehouse to drill down into the college level and department level, more detailed reports can be generated for accreditation

IRP s Data Mart Current data SIS Enrollment Degrees Admissions Graduation Retention Databases to be added Research expenditures (2 years) Course distribution (2 semesters w/ ability to go back further) Faculty file to 1995 (in progress) Staff file (in process of being cleaned) Budget file (in conceptual design stage) Athletics (in progress) Faculty Load (in development) Student services (in the future) Housing (in the future)

Data source analysis diagram NJIT Student Information System (SIS) Enrollment & Degree Data Other Data Commission of Higher Education (CHE) Official Data Student Information Data Warehouse (SIDW)

Architecture model Source Systems Application (Distributed) Focus Program - Enrollment Files - Degree Files NJIT-SIS Connx Program - Other Files CHE (Government) SURE File Security Service - Log on Administration - Authentication - Authorization ROLAP Service - Query Management - Data Analyze - Aggregate Navigate End User Application Client Advanced Analysis Client Back Room Data Staging Area T3 connection E-Mail E-Mail Base Level Presentation Repository Front Room Source Models Target Models Source Target Mappings & Cleansing Staging Files - Dimension sets - Facts Metadata Catalog Source Models Report Library Target Models Code Definitions Official Data Definitions Data Staging Services - Load - Transformation - Cleansing - Export - Metadata exchange Cleaned Data Bulk Loader Data Warehouse Data Staging Administrator Client Database Administrator Client Backup / Replicate

Appendix Current IRP Data Mart Screens

Screenshot: Predefined query

Screenshot: Enrollment data query

Screenshot: Enrollment cross-tab

Screenshot: Admissions data query

Screenshot: Admissions cross-tab

Screenshot: Graduation rate query

Screenshot: Graduation rate cross-tab

Screenshot: Retention rate query

Screenshot: Retention rate cross-tab

Screenshot: Student life query

Screenshot: Miscellaneous queries

Components of Data Warehouse

Extract, Transform and Load (ETL)

DW Summary: Characteristics Subject-orientation integrated non-volatile (i.e. not updated) time variant (kept for long periods, for forecasting and trend analysis) summarized large volume not normalized metadata data sources

Data Warehouse Process

Data Warehouse Tools Intelligent Agents and Agencies - tools work and think for user. Query Facilities and Managed Query environments. Statistical Analysis - One of the biggest surprises in the data warehousing marketplace is the resurgence of interest in traditional statistical analysis, and the concomitant resurrection of the popularity to products like SAS and SPSS.

Data Warehouse Tools Data Discovery - A large class of tools formerly classified as decision support, artificial intelligence and expert systems. They now make use of neural networks, fuzzy logic, decision trees, and other tools from advanced mathematics to allow a user to sift through massive amounts of raw data to discover new, interesting, insightful, and in many cases useful things about the organization, its operations, and its markets. There are many different data discovery tools/products on the market.

Data Warehouse Tools OLAP - On-line Analytical Processing: often uses multi-dimensional spreadsheet tools allowing users to look at information from many different angles. Users are able to slice and dice reports and to look at the same kinds of information at different levels at the same time. Typical OLAP application might allow a product manager to view sales figures for a given product at the national level, see them broken down by division, drill down to see territories within a division, check sales numbers for each store within a territory, and then compare them against sales of stores from another territory.

Data Warehouse Tools Data Mining Provides for: Knowledge discovery in databases Knowledge extraction Data archeology Data exploration Data pattern processing Data dredging Information harvesting

Data Mart Timeline: Project Planning/Management/Policy Development Environment Software Installation Data Inventory & Quality Assessment Database Design Application Design Data Scrubbing & Loading Application Development Testing Documentation Rollout Complete Project Month1 Month2 Month3 Month4 Month5 Month6