Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization



Similar documents
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

Fluency With Information Technology CSE100/IMT100

Foundations of Business Intelligence: Databases and Information Management

DATA WAREHOUSING AND OLAP TECHNOLOGY

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

Course MIS. Foundations of Business Intelligence

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA

Foundations of Business Intelligence: Databases and Information Management

14. Data Warehousing & Data Mining

Data Warehousing and Data Mining in Business Applications

Data Mining for Successful Healthcare Organizations

Data Mart/Warehouse: Progress and Vision

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

OLAP Theory-English version

CHAPTER SIX DATA. Business Intelligence The McGraw-Hill Companies, All Rights Reserved

Foundations of Business Intelligence: Databases and Information Management

Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT Learning Objectives

Database Marketing, Business Intelligence and Knowledge Discovery

Foundations of Business Intelligence: Databases and Information Management

Part 22. Data Warehousing

CHAPTER 4 Data Warehouse Architecture

Data Warehousing. Read chapter 13 of Riguzzi et al Sistemi Informativi. Slides derived from those by Hector Garcia-Molina

Decision Support and Business Intelligence Systems. Chapter 1: Decision Support Systems and Business Intelligence

An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of

Chapter 3 Data Warehouse - technological growth

DATA ANALYSIS USING BUSINESS INTELLIGENCE TOOL. A Thesis. Presented to the. Faculty of. San Diego State University. In Partial Fulfillment

5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA

Introduction to Data Mining

Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006

A Review of Data Mining Techniques

Data Warehouse Overview. Srini Rengarajan

Course Syllabus For Operations Management. Management Information Systems

Turkish Journal of Engineering, Science and Technology

1. What are the uses of statistics in data mining? Statistics is used to Estimate the complexity of a data mining problem. Suggest which data mining

Lection 3-4 WAREHOUSING

Students who successfully complete the Health Science Informatics major will be able to:

Data Warehousing Systems: Foundations and Architectures

B.Sc (Computer Science) Database Management Systems UNIT-V

Data Warehouse Snowflake Design and Performance Considerations in Business Analytics

Datawarehousing and Business Intelligence

LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES

A Knowledge Management Framework Using Business Intelligence Solutions

Importance or the Role of Data Warehousing and Data Mining in Business Applications

ORACLE TAX ANALYTICS. The Solution. Oracle Tax Data Model KEY FEATURES

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data

Applied Business Intelligence. Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA

Chapter 6. Foundations of Business Intelligence: Databases and Information Management

INTERACTIVE DECISION SUPPORT SYSTEM BASED ON ANALYSIS AND SYNTHESIS OF DATA - DATA WAREHOUSE

BUILDING OLAP TOOLS OVER LARGE DATABASES

When to consider OLAP?

Databases in Organizations

Knowledgent White Paper Series. Developing an MDM Strategy WHITE PAPER. Key Components for Success

Week 3 lecture slides

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP ( 28

The Role of Data Warehousing Concept for Improved Organizations Performance and Decision Making

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April ISSN

Data Warehousing and OLAP Technology for Knowledge Discovery

Master of Science in Health Information Technology Degree Curriculum

Principles of Data Mining by Hand&Mannila&Smyth

BENEFITS OF AUTOMATING DATA WAREHOUSING

CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University

ANALYTICS CENTER LEARNING PROGRAM

Chapter 1 Databases and Database Users

Improving Decision Making and Managing Knowledge

An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies

Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley. Chapter 1 Outline

Software Development Training Camp 1 (0-3) Prerequisite : Program development skill enhancement camp, at least 48 person-hours.

Integrating SAP and non-sap data for comprehensive Business Intelligence

Business Intelligence: Effective Decision Making

Chapter 6 8/12/2015. Foundations of Business Intelligence: Databases and Information Management. Problem:

Chapter 6 - Enhancing Business Intelligence Using Information Systems

An Overview of Database management System, Data warehousing and Data Mining

The Masters of Science in Information Systems & Technology

Concepts of Database Management Seventh Edition. Chapter 9 Database Management Approaches

DATA WAREHOUSE E KNOWLEDGE DISCOVERY

MDM and Data Warehousing Complement Each Other

A Model-based Software Architecture for XML Data and Metadata Integration in Data Warehouse Systems

Data Warehouse: Introduction

BUSINESS INTELLIGENCE AS SUPPORT TO KNOWLEDGE MANAGEMENT

Delivering Business Intelligence With Microsoft SQL Server 2005 or 2008 HDT922 Five Days

Data Warehousing and Data Mining

Microsoft Business Intelligence

Foundations of Business Intelligence: Databases and Information Management

Data Warehousing Concepts

RESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE

Information Management course

Data Mining Solutions for the Business Environment

A Technical Review on On-Line Analytical Processing (OLAP)

Class 2. Learning Objectives

Week 13: Data Warehousing. Warehousing

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing

Building a Data Warehouse

Business Intelligence

HYPERION MASTER DATA MANAGEMENT SOLUTIONS FOR IT

Transcription:

Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department

Information Sharing a Principle Component of the National Strategy for Homeland Security Vignette Network of systems that provide knowledge integration and distribution Horizontal and vertical information sharing Improved communications Mining of data stored in Web-enabled warehouse 5-2

Data, Information, Knowledge Data Items that are the most elementary descriptions of things, events, activities, and transactions May be internal or external Information Organized data that has meaning and value Knowledge Processed data or information that conveys understanding or learning applicable to a problem or activity 5-3

Data Raw data collected manually or by instruments Quality is critical Quality determines usefulness Contextual data quality Intrinsic data quality Accessibility data quality Representation data quality Often neglected or casually handled Problems exposed when data is summarized 5-4

5-5

Data Cleanse data When populating warehouse Data quality action plan Best practices for data quality Measure results Data integrity issues Uniformity Version Completeness check Conformity check Genealogy or drill-down Data Integration Access needed to multiple sources Often enterprise-wide Disparate and heterogeneous databases XML becoming language standard 5-6

External Data Sources Web Intelligent agents Document management systems Content management systems Commercial databases Sell access to specialized databases 5-7

Database Management Systems Software program Supplements operating system Manages data Queries data and generates reports Data security Combines with modeling language for construction of DSS 5-8

Database Models 5-9 Hierarchical Top down, like inverted tree Fields have only one parent, each parent can have multiple children Fast Network Relationships created through linked lists, using pointers Children can have multiple parents Greater flexibility, substantial overhead Relational Flat, two-dimensional tables with multiple access queries Examines relations between multiple tables Flexible, quick, and extendable with data independence Object oriented Data analyzed at conceptual level Inheritance, abstraction, encapsulation

5-10

Database Models, continued 5-11 Multimedia Based Multiple data formats JPEG, GIF, bitmap, PNG, sound, video, virtual reality Requires specific hardware for full feature availability Document Based Document storage and management Intelligent Intelligent agents and ANN Inference engines

Data Warehouse 5-12 Subject oriented Scrubbed so that data from heterogeneous sources are standardized Time series; no current status Nonvolatile Read only Summarized Not normalized; may be redundant Data from both internal and external sources is present Metadata included Data about data Business metadata Semantic metadata

Architecture May have one or more tiers Determined by warehouse, data acquisition (back end), and client (front end) One tier, where all run on same platform, is rare Two tier usually combines DSS engine (client) with warehouse More economical Three tier separates these functional parts 5-13

5-14

5-15

5-16 Migrating Data Business rules Stored in metadata repository Applied to data warehouse centrally Data extracted from all relevant sources Loaded through data-transformation tools or programs Separate operation and decision support environments Correct problems in quality before data stored Cleanse and organize in consistent manner

5-17 Data Warehouse Design Dimensional modeling Retrieval based Implemented by star schema Central fact table Dimension tables Grain Highest level of detail Drill-down analysis

5-18 Data Warehouse Development Data warehouse implementation techniques Top down Bottom up Hybrid Federated Projects may be data centric or application centric Implementation factors Organizational issues Project issues Technical issues Scalable Flexible

Data Marts Dependent Created from warehouse Replicated Functional subset of warehouse Independent Scaled down, less expensive version of data warehouse Designed for a department or SBU Organization may have multiple data marts Difficult to integrate 5-19

Business Intelligence and Analytics Business intelligence Acquisition of data and information for use in decision-making activities Business analytics Models and solution methods Data mining Applying models and methods to data to identify patterns and trends 5-20

OLAP 5-21 Activities performed by end users in online systems Specific, open-ended query generation SQL Ad hoc reports Statistical analysis Building DSS applications Modeling and visualization capabilities Special class of tools DSS/BI/BA front ends Data access front ends Database front ends Visual information access systems

5-22 Data Mining Organizes and employs information and knowledge from databases Statistical, mathematical, artificial intelligence, and machine-learning techniques Automatic and fast Tools look for patterns Simple models Intermediate models Complex Models

Data Mining Data mining application classes of problems Classification Clustering Association Sequencing Regression Forecasting Others Hypothesis or discovery driven Iterative Scalable 5-23

5-24 Tools and Techniques Data mining Statistical methods Decision trees Case based reasoning Neural computing Intelligent agents Genetic algorithms Text Mining Hidden content Group by themes Determine relationships

Knowledge Discovery in Databases Data mining used to find patterns in data Identification of data Preprocessing Transformation to common format Data mining through algorithms Evaluation 5-25

Data Visualization Technologies supporting visualization and interpretation Digital imaging, GIS, GUI, tables, multidimensions, graphs, VR, 3D, animation Identify relationships and trends Data manipulation allows real time look at performance data 5-26

Multidimensionality Data organized according to business standards, not analysts Conceptual Factors Dimensions Measures Time Significant overhead and storage Expensive Complex 5-27

Analytic systems Real-time queries and analysis Real-time decision-making Real-time data warehouses updated daily or more frequently Updates may be made while queries are active Not all data updated continuously Deployment of business analytic applications 5-28 GIS Systems Computerized system for managing and manipulating data with digitized maps Geographically oriented Geographic spreadsheet for models Software allows web access to maps Used for modeling and simulations

5-29

Web Analytics/Intelligence Web analytics Application of business analytics to Web sites Web intelligence Application of business intelligence techniques to Web sites 5-30