CSE 233. Database System Overview



Similar documents
CSE 132A. Database Systems Principles

Chapter 1: Introduction

Chapter 1: Introduction. Database Management System (DBMS)

Chapter 1: Introduction. Database Management System (DBMS) University Database Example

Introduction to database management systems

Database System Concepts

Chapter 1: Introduction

Lesson 8: Introduction to Databases E-R Data Modeling

ECS 165A: Introduction to Database Systems

1 File Processing Systems

DATABASE MANAGEMENT SYSTEMS. Question Bank:

What is a database? COSC 304 Introduction to Database Systems. Database Introduction. Example Problem. Databases in the Real-World

Core Syllabus. Version 2.6 B BUILD KNOWLEDGE AREA: DEVELOPMENT AND IMPLEMENTATION OF INFORMATION SYSTEMS. June 2006

CS2Bh: Current Technologies. Introduction to XML and Relational Databases. Introduction to Databases. Why databases? Why not use XML?

Contents RELATIONAL DATABASES

Introduction to Databases

Database Systems. Session 1 Main Theme Introduction to Database Systems Dr. Jean-Claude Franchitti

Chapter 1 Databases and Database Users

Database Concepts. Database & Database Management System. Application examples. Application examples

Chapter 2 Database System Concepts and Architecture

Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley. Chapter 1 Outline

Database System. Session 1 Main Theme Introduction to Database Systems Dr. Jean-Claude Franchitti

Overview of Data Management

Topics in basic DBMS course

Database Management Systems

VALLIAMMAI ENGNIEERING COLLEGE SRM Nagar, Kattankulathur

æ A collection of interrelated and persistent data èusually referred to as the database èdbèè.

Concepts of Database Management Seventh Edition. Chapter 9 Database Management Approaches

Author: Abhishek Taneja

Overview of Database Management

Demystified CONTENTS Acknowledgments xvii Introduction xix CHAPTER 1 Database Fundamentals CHAPTER 2 Exploring Relational Database Components

CS2Bh: Current Technologies. Introduction to XML and Relational Databases. The Relational Model. The relational model

not necessarily strictly sequential feedback loops exist, i.e. may need to revisit earlier stages during a later stage

TIM 50 - Business Information Systems

DATABASE SYSTEM CONCEPTS AND ARCHITECTURE CHAPTER 2

5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2

Database Management. Chapter Objectives

Database Systems. Lecture 1: Introduction

Introductory Concepts

SQL Databases Course. by Applied Technology Research Center. This course provides training for MySQL, Oracle, SQL Server and PostgreSQL databases.

INTRODUCTION DATABASE MANAGEMENT SYSTEMS

Databases What the Specification Says

Database Management Systems. Chapter 1

Introduction. Introduction: Database management system. Introduction: DBS concepts & architecture. Introduction: DBS versus File system

COIS Databases

Chapter 10 Practical Database Design Methodology and Use of UML Diagrams

Introduction: Database management system

Principles of Database. Management: Summary

Database System Architecture & System Catalog Instructor: Mourad Benchikh Text Books: Elmasri & Navathe Chap. 17 Silberschatz & Korth Chap.

CSE 530A Database Management Systems. Introduction. Washington University Fall 2013

Databases. DSIC. Academic Year

We know how to query a database using SQL. A set of tables and their schemas are given Data are properly loaded

Database Design and Programming

CSE 544 Principles of Database Management Systems. Magdalena Balazinska (magda) Winter 2009 Lecture 1 - Class Introduction

Databases and BigData

COMP5138 Relational Database Management Systems. Databases are Everywhere!

Introduction to SQL ( )

Database 10g Edition: All possible 10g features, either bundled or available at additional cost.

DBMS / Business Intelligence, SQL Server

Chapter 3. Database Environment - Objectives. Multi-user DBMS Architectures. Teleprocessing. File-Server

Diploma Of Computing

Course Notes on A Short History of Database Technology

Course Notes on A Short History of Database Technology

The Relational Model. Why Study the Relational Model? Relational Database: Definitions

Computer Information Systems

CSE 544 Principles of Database Management Systems. Magdalena Balazinska (magda) Fall 2007 Lecture 1 - Class Introduction

GUJARAT TECHNOLOGICAL UNIVERSITY, AHMEDABAD, GUJARAT. COURSE CURRICULUM COURSE TITLE: DATABASE MANAGEMENT (Code: ) Information Technology

DBMS Questions. 3.) For which two constraints are indexes created when the constraint is added?


SQL is capable in manipulating relational data SQL is not good for many other tasks

How To Improve Performance In A Database

Databases and Information Management

Object Oriented Databases. OOAD Fall 2012 Arjun Gopalakrishna Bhavya Udayashankar

Introduction to Database Systems. Chapter 1 Introduction. Chapter 1 Introduction

Bridge from Entity Relationship modeling to creating SQL databases, tables, & relations

Data-intensive HPC: opportunities and challenges. Patrick Valduriez

CSE 544 Principles of Database Management Systems. Magdalena Balazinska (magda) Spring 2006 Lecture 1 - Class Introduction

Foundations of Business Intelligence: Databases and Information Management

3. Relational Model and Relational Algebra

Foundations of Business Intelligence: Databases and Information Management

Basic Concepts of Database Systems

Bachelor of Science in Business Administration - Information Systems and Technology Major

Oracle USF

1. INTRODUCTION TO RDBMS

Modern Databases. Database Systems Lecture 18 Natasha Alechina

M.S. Computer Science Program

BCA. Database Management System

Course Notes on Databases and Database Management Systems

Bachelor Degree in Informatics Engineering Master courses

Overview of Database Management Systems

Introduction to Object-Oriented and Object-Relational Database Systems

Data. Data and database. Aniel Nieves-González. Fall 2015

Relational Database Concepts

The Relational Model. Why Study the Relational Model? Relational Database: Definitions. Chapter 3

Foundations of Information Management

Transcription:

CSE 233 Database System Overview 1

Data Management An evolving, expanding field: Classical stand-alone databases (Oracle, DB2, SQL Server) Computer science is becoming data-centric: web knowledge harvesting, crowd sourcing, cloud computing, scientific databases, networks, data mining, streaming sensor monitoring, social networks, bioinformatics, geographic information systems, digital libraries, data-driven business processes Classical database concepts and algorithms continue to provide the core technology 2

What is a database? Persistent data Query and update language for accessing and modifying data Query optimization Transactions and concurrency control What kind of data? Emphasis: many instances of similarly structured data Examples: Airline reservations: database (large set of similar records) Computerized library: information retrieval Medication advisor: expert system 3

Top Level Goals of a Database System Provide users with a meaning-based view of data shield from irrelevant detail abstract view Support operations on data queries, updates Provide data control integrity, protection concurrency, recovery 4

Database System Tailored to specific application Database Management System Generalized DB system used in variety of application environments common approach to data organization data storage data access data control e.g. Ingres/Postgres, DB2, Oracle, SQL Server, MySQL, etc. 5

Levels of Abstraction Logical level: describes data stored in database in terms close to the application type customer = record customer_id : string; customer_name : string; customer_street : string; customer_city : integer; end; Physical level: describes how the data is stored. View level: customized, restructured information. Views can also hide information (such as an employee s salary) for security purposes. 6

Basic Architecture of a Database System Data Independence logical and physical levels are independent 7

Data Models A collection of concepts and tools for describing the data relationships, semantics, constraints + A language for querying and modifying the data Relational model Entity-Relationship data model (mainly for database design, no query language) Object-based data models (Object-oriented and Object-relational) Semi-structured data model (XML) Other older models: Network model Hierarchical model 8

Schemas and Instances Similar to types and values of variables in programming languages Schema the logical structure of the database Example: The database consists of information about a set of customers and accounts and the relationship between them) Analogous to type of a variable in a program Instance the actual content of the database at a particular point in time Analogous to the value of a variable 9

Example: Entity-Relationship Model Models an application as a collection of entities and relationships Entity: a thing or object in the enterprise that is distinguishable from other objects Described by a set of attributes Relationship: an association among several entities Represented diagrammatically by an entity-relationship diagram: 10

Example: Relational Model Schema 11

Example: Relational Model Instance 12

Data Definition Language (DDL) Specification language for defining the database schema Example: create table account ( account-number char(10), balance integer) DDL compiler generates a set of tables stored in a data dictionary Data dictionary contains metadata (i.e., data about data) Database schema Integrity constraints Domain constraints Referential integrity (references constraint in SQL) Assertions Authorization information 13

Data Manipulation Language (DML) Language for accessing and modifying data DML also known as query/update language Two classes of languages Procedural user specifies what data is required and how to get that data Declarative (nonprocedural) user specifies what data is required without specifying how to get it SQL is the most widely used query language primarily declarative 14

Database Architecture Different architectures for different settings: Centralized Parallel (multi-processor) cloud computing/map-reduce Client-server Distributed 15

Centralized DBMS Site 1 Site 2 Site 5 Site 4 Site 3 16

Distributed DBMS Site 1 Site 2 Site 5 Site 4 Site 3 17

Client/Server DBMS SQL requests Answers Client Server 18

Map-reduce 19

Core database issues Data models, query languages Database design Query processing Storage management Transaction management Concurrency control 20

Beyond the Core Deductive databases Temporal databases Multimedia databases Geographic information systems Data warehouses Real-time and active databases XML databases Database-driven Web applications/services Data analytics (aka Big Data) 21

Databases at UCSD Prof. Alin Deutsch Prof. Yannis Papakonstantinou Prof. Victor Vianu Database group Web site: http://db.ucsd.edu/ papers, seminars, bragging. Intersections with other CSE groups storage multimedia machine learning IR/ data mining networks 22

Database Theory CSE 233

Relational db: theory practice Frege: FO logic Tarski: algebra for FO Codd: relational databases

Databases: implemented logic! FO lies at the core of modern database systems Databases = FO on every desk! Relational query languages are based on FO: SQL, QBE More powerful query languages (all the way to XML) are based on extensions of FO

Why is FO so successful as a query language? easy to use syntactic variants SQL, QBE efficient implementation via relational algebra amenable to analysis and simplification potential for perfect scaling to large databases very fast response can be achieved using parallel processing

Journey of a Query SQL FO Relational Algebra Query Rewriting Query Execution Plan select from where 13 (P Q) 14 (P S) Q R 14 Execution Q R P S Physical Level

Journey of a Query SQL FO select from where Relational Algebra Query Rewriting Query Execution Plan 13 (P Q) 14 (P S) Q R 14 Execution Q R P S Physical Level

Most spectacular: theoretical potential for perfect scaling! perfect scaling: given sufficient resources, performance does not degrade as the database becomes larger key: parallel processing cost: number of processors polynomial in the size of the database (FO AC0) role of algebra: operations highlight parallelism

Outline FO (aka CALC), relational algebra Static analysis for query processing Dependency theory Extending FO with recursion: Datalog and fixpoint logics Expressiveness and complexity Ehrenfeucht-Fraisse games, 0/1 laws The quest for a language for PTIME Highly expressive languages

Incomplete information Complex objects Languages for XML Selected research topics Other topics (if time)