Databases and Information Management



Similar documents
Foundations of Business Intelligence: Databases and Information Management

5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2

TIM 50 - Business Information Systems

Course MIS. Foundations of Business Intelligence

Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT Learning Objectives

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management

Chapter 6. Foundations of Business Intelligence: Databases and Information Management

Chapter 6 8/12/2015. Foundations of Business Intelligence: Databases and Information Management. Problem:

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data

n Assignment 4 n Due Thursday 2/19 n Business paper draft n Due Tuesday 2/24 n Database Assignment 2 posted n Due Thursday 2/26

Foundations of Business Intelligence: Databases and Information Management

INFO Koffka Khan. Tutorial 6

Foundations of Business Intelligence: Databases and Information Management

Technology in Action. Alan Evans Kendall Martin Mary Anne Poatsy. Eleventh Edition. Copyright 2015 Pearson Education, Inc.

Data Hierarchy. Traditional File based Approach. Hierarchy of Data for a Computer-Based File

Data. Data and database. Aniel Nieves-González. Fall 2015

Introduction to Computing. Lectured by: Dr. Pham Tran Vu

Demystified CONTENTS Acknowledgments xvii Introduction xix CHAPTER 1 Database Fundamentals CHAPTER 2 Exploring Relational Database Components

CHAPTER SIX DATA. Business Intelligence The McGraw-Hill Companies, All Rights Reserved

Chapter 14: Databases and Database Management Systems

Module 3: File and database organization

Database Management. Technology Briefing. Modern organizations are said to be drowning in data but starving for information p.

B.Sc (Computer Science) Database Management Systems UNIT-V

When to consider OLAP?

Topics. Database Essential Concepts. What s s a Good Database System? Using Database Software. Using Database Software. Types of Database Programs

Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University

Database Design Basics

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

1 File Processing Systems

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

1. INTRODUCTION TO RDBMS

Database Resources. Subject: Information Technology for Managers. Level: Formation 2. Author: Seamus Rispin, current examiner

CHAPTER 7: DATABASE APPLICATIONS AND PRIVACY IMPLICATIONS

Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

History of Database Systems

Relational Databases for the Business Analyst

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Physical Database Design Process. Physical Database Design Process. Major Inputs to Physical Database. Components of Physical Database Design

CSE 132A. Database Systems Principles


DATA MINING AND WAREHOUSING CONCEPTS

7. Databases and Database Management Systems

Benefits of Normalisation in a Data Base - Part 1

Business Intelligence Tutorial

Business Intelligence: Effective Decision Making

How To Write A Diagram

Introduction. A. Bellaachia Page: 1

- Suresh Khanal. Microsoft Excel Short Questions and Answers 1

David M. Kroenke and David J. Auer Database Processing 11 th Edition Fundamentals, Design, and Implementation. Chapter Objectives

Data and Information Management in Public Health

CHAPTER 6 DATABASE MANAGEMENT SYSTEMS. Learning Objectives

University Data Warehouse Design Issues: A Case Study

Course Syllabus For Operations Management. Management Information Systems

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

DATABASE MANAGEMENT SYSTEM

Data W a Ware r house house and and OLAP II Week 6 1

Database Design. Marta Jakubowska-Sobczak IT/ADC based on slides prepared by Paula Figueiredo, IT/DB

Lecture 9 : Business Intelligence and Information Systems for Decision Making

Chapter Managing Knowledge in the Digital Firm

Correct Answer: J Explanation. Explanation/Reference: According to these references, this answer looks correct.

CHAPTER 5. Data and Knowledge Management

Fluency With Information Technology CSE100/IMT100

Concepts of Database Management Eighth Edition. Chapter 1 Introduction to Database Management

Relational Database Basics Review

Chapter 1: Introduction. Database Management System (DBMS) University Database Example

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA

JOURNAL OF OBJECT TECHNOLOGY

14 Databases. Source: Foundations of Computer Science Cengage Learning. Objectives After studying this chapter, the student should be able to:

Data Warehouse: Introduction

SQL Server 2012 Business Intelligence Boot Camp

ETL PROCESS IN DATA WAREHOUSE

VBA and Databases (see Chapter 14 )

SQL Server Training Course Content

10. Creating and Maintaining Geographic Databases. Learning objectives. Keywords and concepts. Overview. Definitions

Dimodelo Solutions Data Warehousing and Business Intelligence Concepts

MASTER DATA MANAGEMENT TEST ENABLER

ECS 165A: Introduction to Database Systems

Computer Information Systems

Oracle 10g PL/SQL Training

C HAPTER 4 INTRODUCTION. Relational Databases FILE VS. DATABASES FILE VS. DATABASES

DBMS / Business Intelligence, SQL Server

DATABASES AND DATABASE USERS

Facilitating Efficient Data Management by Craig S. Mullins

full file at

Overview. Physical Database Design. Modern Database Management McFadden/Hoffer Chapter 7. Database Management Systems Ramakrishnan Chapter 16

Extended RBAC Based Design and Implementation for a Secure Data Warehouse

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

- Eliminating redundant data - Ensuring data dependencies makes sense. ie:- data is stored logically

CHAPTER 5: BUSINESS ANALYTICS

Transcription:

Databases and Information Management Reading: Laudon & Laudon chapter 5 Additional Reading: Brien & Marakas chapter 3-4 COMP 5131 1

Outline Database Approach to Data Management Database Management Systems Improving Business Performance and Decision Making Data Warehouse Data Marts Business Intelligence Managing Data Resources COMP 5131 2

Database Approach Database Collection of related files containing records on people, places, or things Prior to digital databases, business used file cabinets with paper files Entity Generalized category representing person, place, thing on which we store and maintain information Example SUPPLIER, PART Attributes Specific characteristics of each entity,example SUPPLIER name, address PART description, unit price, supplier COMP 5131 3

Relational Database Organize Data into 2D Tables Tables Relations with columns and rows One table for each entity Example CUSTOMER, SUPPLIER, PART, SALES Fields (columns) store data representing an attribute Rows store data for separate records Key field: Uniquely identifies each record Primary key: One field in each table Cannot be duplicated Provides unique identifier for all information in any row COMP 5131 4

Relational Database Relational Database Table A relational database organizes data in the form of two-dimensional tables. Illustrated here is a table for the entity SUPPLIER showing how it represents the entity and its attributes. Supplier_Number is the key field. COMP 5131 5

Relational Database Part Table Data for the entity PART have their own separate table. Part_Number is the primary key and Supplier_Number is the foreign key, enabling users to find related information from the SUPPLIER table about the supplier for each part. COMP 5131 6

Relational Database Establishing Relationships Entity-relationship diagram used to clarify table relationships in a relational database Relational database tables may have: One-to-one relationship One-to-many relationship Many-to-many relationship Requires creating a table (join table, Intersection relation) that links the two tables to join information A Simple Entity Relationship Diagram Relationship between supplier and Part COMP 5131 7

Relational Database Sample Order Report The shaded areas show which data came from the SUPPLIER, LINE_ITEM, and ORDER tables. The database does not maintain data on Extended Price or Order Total because they can be derived from other data in the tables. COMP 5131 8

Relational Database Final Database Design with Sample Records The final design of the database for suppliers, parts, and orders has four tables. The LINE_ITEM table is a join table that eliminates the many-to-many relationship between ORDER and PART. COMP 5131 9

Relational Database Entity-Relationship Diagram for the Database with four Tables This diagram shows the relationship between the entities SUPPLIER, ART, LINE_ITEM, and ORDER. COMP 5131 10

Relational Database Normalization Process of streamlining complex groups of data to Minimize redundant data elements Minimize awkward many-to-many relationships Increase stability and flexibility Referential Entity Rules Used by relational databases to ensure that relationships between coupled tables remain consistent Example When one table has a foreign key that points to another table, you may not add a record to the table with foreign key unless there is a corresponding record in the linked table COMP 5131 11

Database Management Systems DBMS Specific type of software for creating, storing, organizing, and accessing data from a database Separates the logical and physical views of the data Logical view How end users view data Physical view How data are actually structured and organized Examples of DBMS Microsoft Access, DB2, Oracle Database, Microsoft SQL Server, MySQL (Open Source) COMP 5131 12

Database Management Systems HRD Database with Multiple Views Combine tables to deliver data Users Requirement Two tables share a common data element COMP 5131 13

Database Management Systems Operations of a Relational DBMS Select Creates a subset of all records meeting stated criteria Join Combines relational tables to present the ser with more information than is available from individual tables Project Creates a subset consisting of columns in a table Permits user to create new tables containing only desired information COMP 5131 14

Database Management Systems Three Basic Operations of a Relational DBMS The select, project, and join operations enable data from two different tables to be combined and only selected attributes to be displayed. COMP 5131 15

Database Management Systems Capabilities of DBMS Data Definition Capabilities Specify Structure of Contents of Database Data Directory Automated or manual file storing definitions of data elements and their characteristics Query and Data Reporting Data manipulation language Structured query language (SQL) Microsoft Access query-building tools Report generation, example Crystal Reports COMP 5131 16

Database Management Systems Access Data Directory Features COMP 5131 17

Database Management Systems Example of SQL Query COMP 5131 18

Database Management Systems An Access Query COMP 5131 19

Database Management Systems An Access Query COMP 5131 20

Database Management Systems Object-Oriented Database DBMS designed for structured data rows/columns Not suitable for graphics-based or multimedia applications Object-oriented Database OODBMS Stores data and procedures that act on those data as objects to be retrieved and shared Usage Manage multimedia components, Java applets for Web Relatively slow compared to relational DBMS Hybrid Object-relational DBMS Provide capabilities of both types Databases Improves Performance, Better Decisions Tools Data warehousing Multidimensional data analysis Data mining Utilizing Web interfaces to databases COMP 5131 21

Using Database to Improve Performance Data Warehouse Database that stores current and historical data that may be of interest to decision makers Consolidates and standardizes data from many systems, operational and transactional databases Data can be accessed but not altered Data Marts Subset of data warehouses that is highly focused and isolated for a specific population of users Can be constructed more quickly at lower cost Example Company might develop Marketing and Sales Data Mart to deal with customer information COMP 5131 22

Using Database to Improve Performance Components of Data Warehouse The data are combined with data from external sources and reorganized into a central database designed for management reporting and analysis. The information directory provides users with information about the data available in the warehouse. COMP 5131 23

Using Database to Improve Performance Business Intelligence Tools for consolidating, analyzing, and providing access to large amounts of data to improve decision making Software for database reporting and querying Tools for multidimensional data analysis (online analytical processing) Data Mining COMP 5131 24

Using Database to Improve Performance Data Mining Finds hidden patterns, relationships in large databases and infers rules from them to predict future behavior Types of Information Associations Occurrences linked to single event Example Chips with Coke for 65% but 85% when promotion for Coke Sequences Events linked over time Example House purchasing followed by new refrigerator 65% within 2 weeks, oven 45% within one month Classifications Patterns describing a group an item belongs to Example Characteristics of customers who are likely to leave, campaign Clusters Discovering as yet unclassified groupings Forecasting Uses series of values to forecast future values COMP 5131 25

Using Database to Improve Performance Data Mining Applications for all functional areas of business Government, Scientific Applications Usage Patterns in Customer Data Identifying profitable customers or for one-to-one marketing campaigns Predictive Analysis Using data mining techniques, historical data, and assumptions about future conditions to predict outcomes of events, such as the probability a customer will respond to an offer or purchase a specific product Privacy Concerns Usage Create detailed data image about each individual COMP 5131 26

Case Study DNA Databases Crime Fighting Weapon or Threat to Privacy? Questions What are the benefits of DNA databases? What problems do DNA databases pose? Who should be included in a national DNA database? Should it be limited to convicted felons? Explain your answer. Who should be able to use DNA databases? COMP 5131 27

Using Database to Improve Performance Databases and the web Information from Internal Databases Customers View Product Catalog, Place Order Request from HTML Commands SQL for DBMS Processing (database server) Software make this possible Web server Application servers or CGI Database server Advantage of using web to access internal databases Much less training to employees Few or no changes in internal databases Savings over redesigning and rebuilding legacy systems COMP 5131 28

Managing Data Resources Policies and Procedures for Data Management Information Policy Organization s rules Sharing, Disseminating, Acquiring, Classifying, Inventorying information Example Right to change/view sensitive employee data Data Administration Database design and management group responsible for defining and organizing the structure and content of the database, and maintaining the database Specific policies and procedures for data management Responsibilities Developing information policy, defining and organizing structure and content of database, planning for data, data directory development, Overseeing logical database design COMP 5131 29

Managing Data Resources Ensuring Data Quality Poor Data Quality Major problem for successful customer management relationship About 20% of US mail and packages are returned because of incorrect names or addresses Why Data Quality Problems? Redundant and inconsistent data produced by multiple systems Data input errors Major data quality problems Data Quality Audit Structured survey of the accuracy and completeness of data Data Cleansing Detects and corrects incorrect, incomplete, improperly formatted, and redundant data Specialized data cleansing software Automatically survey data files, correct errors in the data, integrate data into company wide format COMP 5131 30