Overview of Database Management



Similar documents
Overview of Data Management

Distributed Database Systems. Prof. Dr. Carl-Christian Kanne

Chapter 1: Introduction

Chapter 1: Introduction. Database Management System (DBMS) University Database Example

CS2Bh: Current Technologies. Introduction to XML and Relational Databases. Introduction to Databases. Why databases? Why not use XML?

CSE 132A. Database Systems Principles

1 File Processing Systems

History of Database Systems

Introduction. Introduction: Database management system. Introduction: DBS concepts & architecture. Introduction: DBS versus File system

Introduction: Database management system

Database Management. Chapter Objectives

Overview of Database Management Systems

INTRODUCTION DATABASE MANAGEMENT SYSTEMS

ECS 165A: Introduction to Database Systems

Introduction to Databases

Principles of Database. Management: Summary

Chapter 1: Introduction. Database Management System (DBMS)

Module 4 Creation and Management of Databases Using CDS/ISIS

Chapter 1: Introduction

Overview. Introduction to Database Systems. Motivation... Motivation: how do we store lots of data?

What is a database? COSC 304 Introduction to Database Systems. Database Introduction. Example Problem. Databases in the Real-World

DATABASE MANAGEMENT SYSTEM

DATABASES AND DATABASE USERS

Database System Architecture & System Catalog Instructor: Mourad Benchikh Text Books: Elmasri & Navathe Chap. 17 Silberschatz & Korth Chap.

COMP5138 Relational Database Management Systems. Databases are Everywhere!

Week 1 Part 1: An Introduction to Database Systems. Databases and DBMSs. Why Use a DBMS? Why Study Databases??

ICOM 6005 Database Management Systems Design. Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 August 23, 2001

Topics. Introduction to Database Management System. What Is a DBMS? DBMS Types

Introduction to database management systems

Database System Concepts

Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley. Chapter 1 Outline

æ A collection of interrelated and persistent data èusually referred to as the database èdbèè.

Database Design and Programming

Chapter 1 Databases and Database Users

Contents RELATIONAL DATABASES

Introduction to Object-Oriented and Object-Relational Database Systems

DBMS Questions. 3.) For which two constraints are indexes created when the constraint is added?

Database Systems. Lecture 1: Introduction

Database Management Systems

Basic Concepts of Database Systems

Files. Files. Files. Files. Files. File Organisation. What s it all about? What s in a file?

14 Databases. Source: Foundations of Computer Science Cengage Learning. Objectives After studying this chapter, the student should be able to:

DATABASE SYSTEM CONCEPTS AND ARCHITECTURE CHAPTER 2

CSE 233. Database System Overview

Chapter 2: DDBMS Architecture

ISM 318: Database Systems. Objectives. Database. Dr. Hamid R. Nemati

Databases and BigData

VALLIAMMAI ENGNIEERING COLLEGE SRM Nagar, Kattankulathur

Module 3: File and database organization

Data Hierarchy. Traditional File based Approach. Hierarchy of Data for a Computer-Based File

Database Concepts. Database & Database Management System. Application examples. Application examples

Relational Database Basics Review

Introduction to Database Systems

1. INTRODUCTION TO RDBMS

Lesson 8: Introduction to Databases E-R Data Modeling


CHAPTER 6 DATABASE MANAGEMENT SYSTEMS. Learning Objectives

Chapter 2. Data Model. Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel

10. Creating and Maintaining Geographic Databases. Learning objectives. Keywords and concepts. Overview. Definitions

Course Notes on A Short History of Database Technology

Course Notes on A Short History of Database Technology

Outline. Distributed DBMSPage 4. 1

Chapter 2 Database System Concepts and Architecture

B.Sc (Computer Science) Database Management Systems UNIT-V

Introduction to Database Systems. Chapter 1 Introduction. Chapter 1 Introduction

BCA. Database Management System

Core Syllabus. Version 2.6 B BUILD KNOWLEDGE AREA: DEVELOPMENT AND IMPLEMENTATION OF INFORMATION SYSTEMS. June 2006

Data and Databases. Technology Guides T3.1

Distributed Database Design (Chapter 5)

Study Notes for DB Design and Management Exam 1 (Chapters 1-2-3) record A collection of related (logically connected) fields.

Database System Concepts, Implementations and Organizations-A Detailed Survey

Object Oriented Databases. OOAD Fall 2012 Arjun Gopalakrishna Bhavya Udayashankar

B.Com(Computers) II Year RELATIONAL DATABASE MANAGEMENT SYSTEM Unit- I

Relational Database Systems 2 1. System Architecture

COURSE NAME: Database Management. TOPIC: Database Design LECTURE 3. The Database System Life Cycle (DBLC) The database life cycle contains six phases;

Database Fundamentals

DATA WAREHOUSING AND OLAP TECHNOLOGY

Carnegie Mellon Univ. Dept. of Computer Science Database Applications. Outline. We ll learn: Faloutsos CMU SCS

Data Models and Database Management Systems (DBMSs) Dr. Philip Cannata

3. Relational Model and Relational Algebra

COMPONENTS in a database environment

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

The Relational Model. Ramakrishnan&Gehrke, Chapter 3 CS4320 1

Database Systems Introduction Dr P Sreenivasa Kumar

Database System. Session 1 Main Theme Introduction to Database Systems Dr. Jean-Claude Franchitti

Chapter 1 File Organization 1.0 OBJECTIVES 1.1 INTRODUCTION 1.2 STORAGE DEVICES CHARACTERISTICS

Concepts of Database Management Seventh Edition. Chapter 9 Database Management Approaches

Distributed Databases in a Nutshell

In-memory databases and innovations in Business Intelligence

Database Design. Marta Jakubowska-Sobczak IT/ADC based on slides prepared by Paula Figueiredo, IT/DB

Introductory Concepts

Bridge from Entity Relationship modeling to creating SQL databases, tables, & relations

Objectives of Lecture 1. Labs and TAs. Class and Office Hours. CMPUT 391: Introduction. Introduction

Introduction to Database Systems. Module 1, Lecture 1. Instructor: Raghu Ramakrishnan UW-Madison

Database Management Systems: Relational, Object-Relational, and Object-Oriented Data Models COT/4-02-V1.1

Author: Abhishek Taneja

Transcription:

Overview of Database Management M. Tamer Özsu David R. Cheriton School of Computer Science University of Waterloo CS 348 Introduction to Database Management Fall 2012 CS 348 Overview of Database Management Fall 2012 1 / 29

What is Data? ANSI definition of data: 1 A representation of facts, concepts, or instructions in a formalized manner suitable for communication, interpretation, or processing by humans or by automatic means. 2 Any representation such as characters or analog quantities to which meaning is or might be assigned. Generally, we perform operations on data or data items to supply some information about an entity. Volatile vs persistent data Our concern is primarily with persistent data CS 348 Overview of Database Management Fall 2012 2 / 29

Early Data Management Ancient History Data are not stored on disk One data set per program. High data redundancy PROGRAM 1! Data! Management! DATA SET 1! PROGRAM 2! Data! Management! DATA SET 2! PROGRAM 3! Data! Management! DATA SET 3! CS 348 Overview of Database Management Fall 2012 3 / 29

File Processing More Recent History Data are stored in files with interface between programs and files. Various access methods exist (e.g., sequential, indexed, random). One file corresponds to one or several programs. PROGRAM 1! Data! Management! PROGRAM 2! Data! Management! PROGRAM 3! Data! Management! File! System! Services! FILE 1! FILE 2! Redundant Data! CS 348 Overview of Database Management Fall 2012 4 / 29

Database Approach PROGRAM 1 DBMS Query Processor Transaction Mgr Integrated Database PROGRAM 2 CS 348 Overview of Database Management Fall 2012 5 / 29

What is a Database? Definition (Database) A large and persistent collection of (more-or-less similar) pieces of information organized in a way that facilitates efficient retrieval and modification. The structure of the database is determined by the abstract data model that is used Examples: a file cabinet a library system a personnel management system Definition (Database Management System (DBMS)) A program (or set of programs) that manages details related to storage and access for a database. CS 348 Overview of Database Management Fall 2012 6 / 29

Data Model Formalism that defines what the structure of the data are =) schema within a file between files File systems can at best specify data organization within one file Alternatives for business data Hierarchical; network Relational Object-oriented (more recently object-relational) CS 348 Overview of Database Management Fall 2012 7 / 29

Schema and Instance Definition (Schema) A schema is a description of the data interface to the database (i.e., how the data is organized). Definition (Instance) A database instance is a database (real data) that conforms to a given schema. A schema can have many instances. CS 348 Overview of Database Management Fall 2012 8 / 29

Example Relational Schema EMP(ENO, ENAME, TITLE) PROJ(PNO, PNAME, BUDGET) WORKS(ENO, PNO, RESP, DUR) Instance EMP ENO ENAME TITLE E1 J. Doe Elect. Eng E2 M. Smith Syst. Anal. E3 A. Lee Mech. Eng. E4 J. Miller Programmer E5 B. Casey Syst. Anal. E6 L. Chu Elect. Eng. E7 R. Davis Mech. Eng. E8 J. Jones Syst. Anal. PROJ PNO PNAME BUDGET P1 Instrumentation 150000 P2 Database Develop. 135000 P3 CAD/CAM 250000 P4 Maintenance 310000 WORKS ENO PNO RESP DUR E1 P1 Manager 12 E2 P1 Analyst 24 E2 P2 Analyst 6 E3 P3 Consultant 10 E3 P4 Engineer 48 E4 P2 Programmer 18 E5 P2 Manager 24 E6 P4 Manager 48 E7 P3 Engineer 36 E8 P3 Manager 40 CS 348 Overview of Database Management Fall 2012 9 / 29

Application of Databases Original More recent inventory control payroll banking and financial systems reservation systems computer aided design (CAD) software development (CASE, SDE/SSE) telecommunication systems e-commerce dynamic/personalized web content CS 348 Overview of Database Management Fall 2012 10 / 29

Application of Databases (cont d) Common Circumstances: There is lots of data (mass storage) Data is formatted Requirements: persistence and reliability efficient and concurrent access Issues: many files with different structure shared files or replicated data need to exchange data (translation programs) CS 348 Overview of Database Management Fall 2012 11 / 29

Why Database Technology? Data constitute an organizational asset =) Integrated control Reduction of redundancy Avoidance of inconsistency Data integrity Sharability Improved security Programmer productivity =) Data Independence Programmers do not have to deal with data organization CS 348 Overview of Database Management Fall 2012 12 / 29

Programmer Productivity High data independence Total System Cost New system development Maintenance Time CS 348 Overview of Database Management Fall 2012 13 / 29

Brief History of Data Management: Ancient 2000 BC: Sumerian Records 350 BC: Syllogisms (Aristotle) 296 BC: Library of Alexandria 1879: Modern Logic (Frege) 1884: U.S. Census (Hollerith) 1941: Model Theory (Tarski) CS 348 Overview of Database Management Fall 2012 14 / 29

Brief History of Data Management: 1950s First generation 50 s and 60 s batch processing sequential files and tape input on punched cards Second generation (60 s) disk enabled random access files new access methods (ISAM, hash files) mostly batch with some interactivity independent applications with separate files growing applications base CS 348 Overview of Database Management Fall 2012 15 / 29

Brief History of Data Management: 1960s (cont d) As the application base grows, we end up with many shared files a multitude of file structures a need to exchange data among applications This causes a variety of problems redundancy: multiple copies inconsistency: independent updates inaccuracy: concurrent updates incompatibility: multiple formats insecurity: proliferation inauditability: poor chain of responsibility inflexibility: changes are difficult to apply CS 348 Overview of Database Management Fall 2012 16 / 29

Brief History of Data Management: 1960s (cont d) Hierarchical data model IBM s Information Management System (IMS): concurrent access only allows 1:N parent-child relationships (i.e. a tree) hierarchy can be exploited for efficiency queries navigate up and down trees one record at a time data access language embedded in business processing language difficult to express some queries Network data model Charles Bachman s Integrated Data Store (IDS) model standardized by Conference On DAta SYstems Languages (CODASYL) data organized as collections of sets of records separation of physical data representation from users view of data pointers between records represent relationships set types encoded as lists queries navigate between records still one record at a time CS 348 Overview of Database Management Fall 2012 17 / 29

Database Management System Idea Abstracts common functions and creates a uniform well defined interface for applications accessing data. 1 Data Model all data stored in a well defined way 2 Access control only authorized people get to see/modify it 3 Concurrency control multiple concurrent applications access data 4 Database recovery nothing gets accidentally lost 5 Database maintenance CS 348 Overview of Database Management Fall 2012 18 / 29

Brief History of Data Management: 1970s Edgar Codd proposes relational data model (1970) firm mathematical foundation! declarative queries Charles Bachman wins ACM Turing award (1973) The Programmer as Navigator Peter Chen proposes E-R model (1976) Transaction concepts (Jim Gray and others) IBM s System R and UC Berkeley s Ingres systems demonstrate feasibility of relational DBMS (late 1970s) CS 348 Overview of Database Management Fall 2012 19 / 29

Three Level Schema Architecture Definition (Schema) A schema is a description of the data interface to the database (i.e., how the data is organized). 1 External schema (view): what the application programs and user see. May differ for different users of the same database. 2 Conceptual schema: description of the logical structure of all data in the database. 3 Physical schema: description of physical aspects (selection of files, devices, storage algorithms, etc.) CS 348 Overview of Database Management Fall 2012 20 / 29

Three-level Schema Architecture (cont.) Users/ Applications External Schema External view External view External view Conceptual Schema Conceptual view Internal Schema DBMS Internal view Database CS 348 Overview of Database Management Fall 2012 21 / 29

Data Independence Idea Applications do not access data directly but, rather through an abstract data model provided by the DBMS. Two kinds of data independence: Physical: applications immune to changes in storage structures Logical: applications immune to changes in data organization Note One of the most important reasons to use a DBMS! CS 348 Overview of Database Management Fall 2012 22 / 29

Interfacing to the DBMS Data Definition Language (DDL): for specifying schemas may have different DDLs for external schema, conceptual schema, internal schema information is stored in the data dictionary, or catalog Data Manipulation Language (DML): for specifying queries and updates navigational (procedural) non-navigational (declarative) CS 348 Overview of Database Management Fall 2012 23 / 29

Types of Database Users End user: Accesses the database indirectly through forms or other query-generating applications, or Generates ad-hoc queries using the DML. Application developer: Designs and implements applications that access the database. Database administrator (DBA): Manages conceptual schema. Assists with application view integration. Monitors and tunes DBMS performance. Defines internal schema. Loads and reformats database. Is responsible for security and reliability. CS 348 Overview of Database Management Fall 2012 24 / 29

Transactions When multiple applications access the same data, undesirable results occur. Example: withdraw(ac,1000) Bal := getbal(ac) if (Bal>1000) <give-money> setbal(ac,bal-1000) withdraw(ac,500) Bal := getbal(ac) if (Bal>500) <give-money> setbal(ac,bal-500) Idea Every application may think it is the sole application accessing the data. The DBMS should guarantee correct execution. CS 348 Overview of Database Management Fall 2012 25 / 29

Transactions (cont d) Definition (Transaction) An application-specified atomic and durable unit of work. Properties of transactions ensured by the DBMS: Atomic: a transaction occurs entirely, or not at all Consistency: Isolated: Durable: each transaction preserves the consistency of the database concurrent transactions do not interfere with each other once completed, a transaction s changes are permanent CS 348 Overview of Database Management Fall 2012 26 / 29

Brief History of Data Management: 1980s Development of commercial relational technology IBM DB2, Oracle, Informix, Sybase Edgar Codd wins ACM Turing award (1981) SQL standardization efforts through ANSI and ISO Object-oriented DBMSs (late 1980 s to middle of 1990 s) persistent objects object id s, methods, inheritence navigational interface reminicent of hierarchical model CS 348 Overview of Database Management Fall 2012 27 / 29

Brief History of Data Management: 1990s-Present Continued expansion of SQL and system capabilities New application areas: the Internet On-Line Analytic Processing (OLAP) data warehousing embedded systems multimedia XML data streams Jim Gray wins ACM Turing award (1998) Relational DBMSs incorporate objects (late 1990s) CS 348 Overview of Database Management Fall 2012 28 / 29

Summary Using a DBMS to manage data helps: to remove common code from applications to provide uniform access to data to guarantee data integrity to manage concurrent access to protect against system failure to set access policies for data CS 348 Overview of Database Management Fall 2012 29 / 29