External Sorting. Chapter 13. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1



Similar documents
External Sorting. Why Sort? 2-Way Sort: Requires 3 Buffers. Chapter 13

Overview of Storage and Indexing

Overview of Storage and Indexing. Data on External Storage. Alternative File Organizations. Chapter 8

Sorting revisited. Build the binary search tree: O(n^2) Traverse the binary tree: O(n) Total: O(n^2) + O(n) = O(n^2)

Lecture 1: Data Storage & Index

Chapter 13: Query Processing. Basic Steps in Query Processing

6 March Array Implementation of Binary Trees

Unit Storage Structures 1. Storage Structures. Unit 4.3

Binary Heaps * * * * * * * / / \ / \ / \ / \ / \ * * * * * * * * * * * / / \ / \ / / \ / \ * * * * * * * * * *

Storing Data: Disks and Files. Disks and Files. Why Not Store Everything in Main Memory? Chapter 7

Physical Data Organization

COS 318: Operating Systems

Storage in Database Systems. CMPSCI 445 Fall 2010

Big Data and Scripting. Part 4: Memory Hierarchies

1) The postfix expression for the infix expression A+B*(C+D)/F+D*E is ABCD+*F/DE*++

CPSC 211 Data Structures & Implementations (c) Texas A&M University [ 313]

Analysis of Algorithms I: Binary Search Trees

Introduction to IR Systems: Supporting Boolean Text Search. Information Retrieval. IR vs. DBMS. Chapter 27, Part A

AP Computer Science AB Syllabus 1

DATABASE DESIGN - 1DL400

Query Processing C H A P T E R12. Practice Exercises

File System & Device Drive. Overview of Mass Storage Structure. Moving head Disk Mechanism. HDD Pictures 11/13/2014. CS341: Operating System

Outline. Principles of Database Management Systems. Memory Hierarchy: Capacities and access times. CPU vs. Disk Speed

Outline BST Operations Worst case Average case Balancing AVL Red-black B-trees. Binary Search Trees. Lecturer: Georgy Gimel farb

Binary Heaps. CSE 373 Data Structures

Binary Heap Algorithms

File Management. Chapter 12

Operating Systems. Virtual Memory

University of Massachusetts Amherst Department of Computer Science Prof. Yanlei Diao

Previous Lectures. B-Trees. External storage. Two types of memory. B-trees. Main principles

CS473 - Algorithms I

Data Structures and Algorithm Analysis (CSC317) Intro/Review of Data Structures Focus on dynamic sets

Converting a Number from Decimal to Binary

Seeking Fast, Durable Data Management: A Database System and Persistent Storage Benchmark

CSE 326, Data Structures. Sample Final Exam. Problem Max Points Score 1 14 (2x7) 2 18 (3x6) Total 92.

Tables so far. set() get() delete() BST Average O(lg n) O(lg n) O(lg n) Worst O(n) O(n) O(n) RB Tree Average O(lg n) O(lg n) O(lg n)

EFFICIENT EXTERNAL SORTING ON FLASH MEMORY EMBEDDED DEVICES

Databases and Information Systems 1 Part 3: Storage Structures and Indices

Analyze Database Optimization Techniques

SQL Query Evaluation. Winter Lecture 23

Questions 1 through 25 are worth 2 points each. Choose one best answer for each.

Data storage Tree indexes

Elena Baralis, Silvia Chiusano Politecnico di Torino. Pag. 1. Physical Design. Phases of database design. Physical design: Inputs.

CIS 631 Database Management Systems Sample Final Exam

DBMS / Business Intelligence, SQL Server

External Memory Geometric Data Structures

In-Memory Databases Algorithms and Data Structures on Modern Hardware. Martin Faust David Schwalb Jens Krüger Jürgen Müller

Binary Search Trees. A Generic Tree. Binary Trees. Nodes in a binary search tree ( B-S-T) are of the form. P parent. Key. Satellite data L R

1Z0-117 Oracle Database 11g Release 2: SQL Tuning. Oracle

Data Structures. Algorithm Performance and Big O Analysis

Big Data Technology Map-Reduce Motivation: Indexing in Search Engines

The Tower of Hanoi. Recursion Solution. Recursive Function. Time Complexity. Recursive Thinking. Why Recursion? n! = n* (n-1)!

ICOM 6005 Database Management Systems Design. Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 August 23, 2001

Introduction to Algorithms March 10, 2004 Massachusetts Institute of Technology Professors Erik Demaine and Shafi Goldwasser Quiz 1.

IBM DB2: LUW Performance Tuning and Monitoring for Single and Multiple Partition DBs

Chapter 13. Disk Storage, Basic File Structures, and Hashing

DB2 for Linux, UNIX, and Windows Performance Tuning and Monitoring Workshop

Algorithm Analysis [2]: if-else statements, recursive algorithms. COSC 2011, Winter 2004, Section N Instructor: N. Vlajic

Oracle EXAM - 1Z Oracle Database 11g Release 2: SQL Tuning. Buy Full Product.

Exploring the Efficiency of Big Data Processing with Hadoop MapReduce

Indexing Techniques in Data Warehousing Environment The UB-Tree Algorithm

How To Create A Tree From A Tree In Runtime (For A Tree)

Benchmarking Cassandra on Violin

B-Trees. Algorithms and data structures for external memory as opposed to the main memory B-Trees. B -trees

Closest Pair Problem

Lecture 6: Binary Search Trees CSCI Algorithms I. Andrew Rosenberg

6. Storage and File Structures

Data Warehousing und Data Mining

Advanced Oracle SQL Tuning

1/1 7/4 2/2 12/7 10/30 12/25

Physical DB design and tuning: outline

Cpt S 223. School of EECS, WSU

Chapter 2 Data Storage

PRODUCT OVERVIEW SUITE DEALS. Combine our award-winning products for complete performance monitoring and optimization, and cost effective solutions.

Physical Database Design Process. Physical Database Design Process. Major Inputs to Physical Database. Components of Physical Database Design

Datenbanksysteme II: Implementation of Database Systems Implementing Joins

CS711008Z Algorithm Design and Analysis

Data Compression in Blackbaud CRM Databases

CS104: Data Structures and Object-Oriented Design (Fall 2013) October 24, 2013: Priority Queues Scribes: CS 104 Teaching Team

R-trees. R-Trees: A Dynamic Index Structure For Spatial Searching. R-Tree. Invariants

SQL Server 2012 Optimization, Performance Tuning and Troubleshooting

What is RAID? data reliability with performance

CSE 326: Data Structures B-Trees and B+ Trees

Disks and RAID. Profs. Bracy and Van Renesse. based on slides by Prof. Sirer

Lecture 4 Online and streaming algorithms for clustering

Sorting Hierarchical Data in External Memory for Archiving

Distributed Computing over Communication Networks: Maximal Independent Set

A binary search tree or BST is a binary tree that is either empty or in which the data element of each node has a key, and:

Zabin Visram Room CS115 CS126 Searching. Binary Search

Name: 1. CS372H: Spring 2009 Final Exam

VII. Database System Architecture

The Database is Slow

I/O Management. General Computer Architecture. Goals for I/O. Levels of I/O. Naming. I/O Management. COMP755 Advanced Operating Systems 1

Analysis of Algorithms I: Optimal Binary Search Trees

DB2 LUW Performance Tuning and Monitoring for Single and Multiple Partition DBs

Transcription:

External Sorting Chapter 13 Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1

Why Sort? A classic problem in computer science! Data requested in sorted order e.g., find students in increasing gpa order Sorting is first step in bulk loading B+ tree index. Sorting useful for eliminating duplicate copies in a collection of records. Sort-merge join algorithm involves sorting. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 2

2-Way Sort: Requires 3 Buffers Pass 1: Read a page, sort it, write it. only one buffer page is used Pass 2, 3,, etc.: three buffer pages used. INPUT 1 INPUT 2 OUTPUT Disk Main memory buffers Disk Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 3

Two-Way External Merge Sort Each pass we read + write each page in file. N pages in the file => the number of passes = log + 2 N 1 So toal cost is: Idea: Divide and conquer: sort subfiles and merge ( log N + ) 2N 1 2 3,4 6,2 9,4 8,7 5,6 3,1 2 3,4 2,6 4,9 7,8 5,6 1,3 2 2,3 4,6 Input file PASS 0 1-page runs PASS 1 2-page runs PASS 2 4-page runs PASS 3 8-page runs 9 Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 4 2,3 4,4 6,7 8,9 4,7 8,9 1,2 2,3 3,4 4,5 6,6 7,8 1,3 5,6 2 1,2 3,5 6

General External Merge Sort * More than 3 buffer pages. How can we utilize them? To sort a file with N pages using B buffer pages: Pass 0: use B buffer pages. Produce N / B sorted runs of B pages each. Pass 2,, etc.: merge B-1 runs. INPUT 1... INPUT 2...... OUTPUT Disk INPUT B-1 B Main memory buffers Disk Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 5

Cost of External Merge Sort Number of passes: Cost = 2N * (# of passes) 1 + 1 log B N / B E.g., with 5 buffer pages, to sort 108 page file: Pass 0: 108 / 5 = 22 sorted runs of 5 pages each (last run is only 3 pages) Pass 1: 22 / 4 = 6 sorted runs of 20 pages each (last run is only 8 pages) Pass 2: 2 sorted runs, 80 pages and 28 pages Pass 3: Sorted file of 108 pages Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 6

Number of Passes of External Sort N B=3 B=5 B=9 B=17 B=129 B=257 100 7 4 3 2 1 1 1,000 10 5 4 3 2 2 10,000 13 7 5 4 2 2 100,000 17 9 6 5 3 3 1,000,000 20 10 7 5 3 3 10,000,000 23 12 8 6 4 3 100,000,000 26 14 9 7 4 4 1,000,000,000 30 15 10 8 5 4 Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 7

Internal Sort Algorithm Quicksort is a fast way to sort in memory. An alternative is tournament sort (a.k.a. heapsort ) Top: Read in B blocks Output: move smallest record to output buffer Read in a new record r insert r into heap if r not smallest, then GOTO Output else remove r from heap output heap in order; GOTO Top Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 8

More on Heapsort What is the average length of a run in heapsort? The snowplow analogy Imagine a snowplow moving around a circular track on which snow falls at a steady rate. At any instant, there is a certain amount of snow S on the track. What is the amount of snow removed in each revolution? Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 9

More on Heapsort The snowplow analogy: During the next revolution of the plow, all of this is removed, plus 1/2 of what falls during that revolution. Thus, the plow removes 2S amount of snow. B Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 10

More on Heapsort Fact: average length of a run in heapsort is 2B The snowplow analogy Worst-Case: What is min length of a run? How does this arise? Best-Case: What is max length of a run? How does this arise? Quicksort is faster, but... Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 11

I/O for External Merge Sort longer runs often means fewer passes! Actually, do I/O a page at a time In fact, read a block of pages sequentially! Suggests we should make each buffer (input/output) be a block of pages. But this will reduce fan-out during merge passes! In practice, most files still sorted in 2-3 passes. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 12

Number of Passes of Optimized Sort N B=1,000 B=5,000 B=10,000 100 1 1 1 1,000 1 1 1 10,000 2 2 1 100,000 3 2 2 1,000,000 3 2 2 10,000,000 4 3 3 100,000,000 5 3 3 1,000,000,000 5 4 3 * block size = 32, initial pass produces runs of size 2B. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 13

Double Buffering To reduce wait time for I/O request to complete, can prefetch into `shadow block. Potentially, more passes; in practice, most files still sorted in 2-3 passes. INPUT 1 INPUT 1' INPUT 2 OUTPUT INPUT 2' OUTPUT' Disk INPUT k INPUT k' b block size Disk B main memory buffers, k-way merge Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 14

Lower bound for external sorting Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 15

Using B+ Trees for Sorting Scenario: Table to be sorted has B+ tree index on sorting column(s). Idea: Can retrieve records in order by traversing leaf pages. Is this a good idea? Cases to consider: B+ tree is clustered Good idea! B+ tree is not clustered Could be a very bad idea! Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 16

Clustered B+ Tree Used for Sorting Cost: root to the leftmost leaf, then retrieve all leaf pages (Alternative 1) If Alternative 2 is used? Additional cost of retrieving data records: each page fetched just once. Data Records Index (Directs search) Data Entries ("Sequence set") * Always better than external sorting! Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 17

Unclustered B+ Tree Used for Sorting Alternative (2) for data entries; each data entry contains rid of a data record. In general, one I/O per data record! Index (Directs search) Data Entries ("Sequence set") Data Records Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 18

External Sorting vs. Unclustered Index N Sorting p=1 p=10 p=100 100 200 100 1,000 10,000 1,000 2,000 1,000 10,000 100,000 10,000 40,000 10,000 100,000 1,000,000 100,000 600,000 100,000 1,000,000 10,000,000 1,000,000 8,000,000 1,000,000 10,000,000 100,000,000 10,000,000 80,000,000 10,000,000 100,000,000 1,000,000,000 * p: # of records per page * B=1,000 and block size=32 for sorting * p=100 is the more realistic value. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 19

Summary External sorting is important; DBMS may dedicate part of buffer pool for sorting! External merge sort minimizes disk I/O cost: Pass 0: Produces sorted runs of size B (# buffer pages). Later passes: merge runs. # of runs merged at a time depends on B and block size. Larger block size means less I/O cost per page. Larger block size means smaller # runs merged. In practice, # of runs rarely more than 2 or 3. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 20

Summary, cont. Choice of internal sort algorithm may matter: Quicksort: Quick! Heap/tournament sort: slower (2x), longer runs The best sorts are wildly fast: Despite 40+ years of research, we re still improving! Clustered B+ tree is good for sorting; unclustered tree is usually very bad. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 21