Binary Heaps and Heapsort

Similar documents
Binary Heaps * * * * * * * / / \ / \ / \ / \ / \ * * * * * * * * * * * / / \ / \ / / \ / \ * * * * * * * * * *

6 March Array Implementation of Binary Trees

Converting a Number from Decimal to Binary

Binary Heaps. CSE 373 Data Structures

CS104: Data Structures and Object-Oriented Design (Fall 2013) October 24, 2013: Priority Queues Scribes: CS 104 Teaching Team

Cpt S 223. School of EECS, WSU

Ordered Lists and Binary Trees

1/1 7/4 2/2 12/7 10/30 12/25

CSE 326, Data Structures. Sample Final Exam. Problem Max Points Score 1 14 (2x7) 2 18 (3x6) Total 92.

Analysis of Algorithms I: Binary Search Trees

Algorithms and Data Structures

Data Structures and Algorithms

Binary Heap Algorithms

A binary heap is a complete binary tree, where each node has a higher priority than its children. This is called heap-order property

A binary search tree or BST is a binary tree that is either empty or in which the data element of each node has a key, and:

From Last Time: Remove (Delete) Operation

CS711008Z Algorithm Design and Analysis

Sorting revisited. Build the binary search tree: O(n^2) Traverse the binary tree: O(n) Total: O(n^2) + O(n) = O(n^2)

Binary Search Trees. A Generic Tree. Binary Trees. Nodes in a binary search tree ( B-S-T) are of the form. P parent. Key. Satellite data L R

Binary Search Trees. Data in each node. Larger than the data in its left child Smaller than the data in its right child

Class Notes CS Creating and Using a Huffman Code. Ref: Weiss, page 433

Questions 1 through 25 are worth 2 points each. Choose one best answer for each.

Binary Search Trees CMPSC 122

Data Structure [Question Bank]

root node level: internal node edge leaf node Data Structures & Algorithms McQuain

Outline BST Operations Worst case Average case Balancing AVL Red-black B-trees. Binary Search Trees. Lecturer: Georgy Gimel farb

Data Structures. Jaehyun Park. CS 97SI Stanford University. June 29, 2015

Heaps & Priority Queues in the C++ STL 2-3 Trees

Chapter 14 The Binary Search Tree

Quiz 4 Solutions EECS 211: FUNDAMENTALS OF COMPUTER PROGRAMMING II. 1 Q u i z 4 S o l u t i o n s

1) The postfix expression for the infix expression A+B*(C+D)/F+D*E is ABCD+*F/DE*++

PES Institute of Technology-BSC QUESTION BANK

Exam study sheet for CS2711. List of topics

A binary search tree is a binary tree with a special property called the BST-property, which is given as follows:

MAX = 5 Current = 0 'This will declare an array with 5 elements. Inserting a Value onto the Stack (Push)

The following themes form the major topics of this chapter: The terms and concepts related to trees (Section 5.2).

Algorithms Chapter 12 Binary Search Trees

CSE 326: Data Structures B-Trees and B+ Trees

Big Data and Scripting. Part 4: Memory Hierarchies

1. The memory address of the first element of an array is called A. floor address B. foundation addressc. first address D.

Algorithms and Data Structures

CSE373: Data Structures and Algorithms Lecture 1: Introduction; ADTs; Stacks/Queues. Linda Shapiro Spring 2016

How To Create A Tree From A Tree In Runtime (For A Tree)

Binary Search Trees (BST)

Lecture Notes on Binary Search Trees

Lecture Notes on Binary Search Trees

Data Structures and Algorithms Written Examination

Introduction to Data Structures and Algorithms

Krishna Institute of Engineering & Technology, Ghaziabad Department of Computer Application MCA-213 : DATA STRUCTURES USING C

Binary Trees and Huffman Encoding Binary Search Trees

Algorithms and Data Structures Written Exam Proposed SOLUTION

The ADT Binary Search Tree

Data Structures and Algorithm Analysis (CSC317) Intro/Review of Data Structures Focus on dynamic sets

Full and Complete Binary Trees

Analysis of a Search Algorithm

5. A full binary tree with n leaves contains [A] n nodes. [B] log n 2 nodes. [C] 2n 1 nodes. [D] n 2 nodes.

Data Structure and Algorithm I Midterm Examination 120 points Time: 9:10am-12:10pm (180 minutes), Friday, November 12, 2010

Data Structures and Data Manipulation

Operations: search;; min;; max;; predecessor;; successor. Time O(h) with h height of the tree (more on later).

Lecture 4: Balanced Binary Search Trees

Lecture 2 February 12, 2003

Why Use Binary Trees?

Data Structures, Practice Homework 3, with Solutions (not to be handed in)

Data Structures Fibonacci Heaps, Amortized Analysis

Lecture 6: Binary Search Trees CSCI Algorithms I. Andrew Rosenberg

TREE BASIC TERMINOLOGIES

Class Overview. CSE 326: Data Structures. Goals. Goals. Data Structures. Goals. Introduction

Big O and Limits Abstract Data Types Data Structure Grand Tour.

Tables so far. set() get() delete() BST Average O(lg n) O(lg n) O(lg n) Worst O(n) O(n) O(n) RB Tree Average O(lg n) O(lg n) O(lg n)

Sample Questions Csci 1112 A. Bellaachia

Data Structure with C

Data Structures. Level 6 C Module Descriptor

Course: Programming II - Abstract Data Types. The ADT Stack. A stack. The ADT Stack and Recursion Slide Number 1

B-Trees. Algorithms and data structures for external memory as opposed to the main memory B-Trees. B -trees

22c:31 Algorithms. Ch3: Data Structures. Hantao Zhang Computer Science Department

External Sorting. Chapter 13. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1

Lecture 12 Doubly Linked Lists (with Recursion)

- Easy to insert & delete in O(1) time - Don t need to estimate total memory needed. - Hard to search in less than O(n) time

DATA STRUCTURES USING C

ECE 250 Data Structures and Algorithms MIDTERM EXAMINATION /5:15-6:45 REC-200, EVI-350, RCH-106, HH-139

Previous Lectures. B-Trees. External storage. Two types of memory. B-trees. Main principles

Abstract Data Type. EECS 281: Data Structures and Algorithms. The Foundation: Data Structures and Abstract Data Types

Sequential Data Structures

Alex. Adam Agnes Allen Arthur

Queues Outline and Required Reading: Queues ( 4.2 except 4.2.4) COSC 2011, Fall 2003, Section A Instructor: N. Vlajic

Algorithms. Margaret M. Fleck. 18 October 2010

Rotation Operation for Binary Search Trees Idea:

Common Data Structures

recursion, O(n), linked lists 6/14

The Union-Find Problem Kruskal s algorithm for finding an MST presented us with a problem in data-structure design. As we looked at each edge,

Binary Search Trees. basic implementations randomized BSTs deletion in BSTs

Symbol Tables. Introduction

Learning Outcomes. COMP202 Complexity of Algorithms. Binary Search Trees and Other Search Trees

CompSci-61B, Data Structures Final Exam

B+ Tree Properties B+ Tree Searching B+ Tree Insertion B+ Tree Deletion Static Hashing Extendable Hashing Questions in pass papers

Binary search algorithm

Keys and records. Binary Search Trees. Data structures for storing data. Example. Motivation. Binary Search Trees

Linked Lists, Stacks, Queues, Deques. It s time for a chainge!

Persistent Binary Search Trees

Transcription:

CPSC 259: Data Structures and Algorithms for Electrical Engineers Binary Heaps and Heapsort Textbook Reference: Thareja first edition: Chapter 12, pages 501-506 Thareja second edition: Chapter 12, pages 361-365 Hassan Khosravi CPSC 259 Binary Heaps and Heapsort Page 1

Learning Goals Provide examples of appropriate applications for priority queues and heaps. Implement and manipulate a heap using an array as the underlying data structure. Describe and use the Heapify ( Build Heap ) and Heapsort algorithms. Analyze the complexity of operations on a heap. CPSC 259 Binary Heaps and Heapsort Page 2

CPSC 259 Administrative Notes MT2 on Friday See course website for details covers all of the course material up to and including what we cover today No Lecture on Wednesday No labs this week. Quiz 3 Marks are released PeerWise grades for second call are posted PeerWise: Third and final call ends Dec 4 th. CPSC 259 Binary Heaps and Heapsort Page 3

CPSC 259 Journey Abstract Data Types Priority Queue Tools Pointers Dynamic Memory Alloca2on Stack Queue Dictionary Asymptotic Analysis Recursion Linked list Circular Array Array Heapsort Binary Heap Data Structures Binary Search Tree Algorithms CPSC 259 Binary Trees Page 4

Priority Queues Let s say we have the following tasks. 2 - Water plants 5 - Order cleaning supplies 1 - Clean coffee maker 3 - Empty trash 9 - Fix overflowing sink 2 - Shampoo carpets 4 - Replace light bulb 1 - Remove pencil sharpener shavings We are interested in finding the task with the highest priority quickly. CPSC 259 Binary Heaps and Heapsort Page 5

Priority Queues A collection organized so as to permit fast access to and removal of the largest/smallest element Prioritization is a weaker condition than ordering. Order of insertion is irrelevant. Element with the highest priority (whatever that means) comes out next. Not really a queue: not a FIFO CPSC 259 Binary Heaps and Heapsort Page 6

Priority Queue ADT Priority Queue operations create destroy insert deletemin isempty G(9) insert F(7) E(5) D(100) A(4) B(6) deletemin C(3) Priority Queue property: for two elements in the queue, x and y, if x has a lower priority value than y, x will be deleted before y. CPSC 259 Binary Heaps and Heapsort Page 7

Heaps and Priority Queues A priority queue is an abstract data type (ADT) that maintains a bag of items. What is the difference between a set and a bag? A set does not contain duplicates. Two or more distinct items in a priority queue can have the same priority. If all items have the same priority, you might think a priority queue should behave like a queue, but it may not. We ll see why, shortly. A binary heap can implement a priority queue, efficiently. CPSC 259 Binary Heaps and Heapsort Page 8

Applications of the Priority Q Hold jobs for a printer in order of length Manage limited resources such as bandwidth on a transmission line from a network router Simulate events (simulation time used as the priority) Sort numbers Anything greedy: an algorithm that makes the locally best choice at each step CPSC 259 Binary Heaps and Heapsort Page 9

Naïve Priority Q Data Structures Let s use an unsorted list (could be implemented with either an Array or Linked List) Running time of insert? a. O(1) b. O(lg n) c. O(n) d. O(n lg n) e. Something else CPSC 259 Binary Heaps and Heapsort Page 10

Naïve Priority Q Data Structures Let s use an unsorted list (could be implemented with either an Array or Linked List) Running time of insert? a. O(1) b. O(lg n) c. O(n) d. O(n lg n) e. Something else CPSC 259 Binary Heaps and Heapsort Page 11

Naïve Priority Q Data Structures Let s use an unsorted list (could be implemented with either an Array or Linked List) Running time of deletemin? a. O(1) b. O(lg n) c. O(n) d. O(n lg n) e. Something else CPSC 259 Binary Heaps and Heapsort Page 12

Naïve Priority Q Data Structures Let s use an unsorted list (could be implemented with either an Array or Linked List) Running time of deletemin? a. O(1) b. O(lg n) c. O(n) d. O(n lg n) e. Something else CPSC 259 Binary Heaps and Heapsort Page 13

Naïve Priority Q Data Structures Let s use a sorted list (could be implemented with either an Array or Linked List) Running time of insert? a. O(1) b. O(lg n) c. O(n) d. O(n lg n) e. Something else CPSC 259 Binary Heaps and Heapsort Page 14

Naïve Priority Q Data Structures Let s use a sorted list (could be implemented with either an Array or Linked List) Running time of insert? a. O(1) b. O(lg n) c. O(n) d. O(n lg n) e. Something else CPSC 259 Binary Heaps and Heapsort Page 15

Naïve Priority Q Data Structures Let s use a sorted list (could be implemented with either an Array or Linked List) Running time of deletemin? a. O(1) b. O(lg n) c. O(n) d. O(n lg n) e. Something else CPSC 259 Binary Heaps and Heapsort Page 16

Naïve Priority Q Data Structures Let s use a sorted list (could be implemented with either an Array or Linked List) Running time of deletemin? a. O(1) b. O(lg n) c. O(n) d. O(n lg n) e. Something else CPSC 259 Binary Heaps and Heapsort Page 17

Naïve Priority Q Data Structures Let s use a Binary Search Tree (could be implemented with either an Array or Linked List) Worst case running time of insert? a. O(1) b. O(lg n) c. O(n) d. O(n lg n) e. Something else CPSC 259 Binary Heaps and Heapsort Page 18

Naïve Priority Q Data Structures Let s use a Binary Search Tree (could be implemented with either an Array or Linked List) Worst case running time of insert? a. O(1) b. O(lg n) c. O(n) d. O(n lg n) e. Something else CPSC 259 Binary Heaps and Heapsort Page 19

Naïve Priority Q Data Structures Let s use a Binary Search Tree (could be implemented with either an Array or Linked List) Worst case running time of deletemin? a. O(1) b. O(lg n) c. O(n) d. O(n lg n) e. Something else CPSC 259 Binary Heaps and Heapsort Page 20

Naïve Priority Q Data Structures Let s use a Binary Search Tree (could be implemented with either an Array or Linked List) Worst case running time of deletemin? a. O(1) b. O(lg n) c. O(n) d. O(n lg n) e. Something else CPSC 259 Binary Heaps and Heapsort Page 21

Naïve Priority Q Data Structures (Summary) Data Structure Insert (worst case) Unsorted lists O(1) O(n) Sorted lists O(n) O(1) Binary Search Trees O(n) O(n) Binary Heaps O(lg n) O(lg n) deletemin (worst case) CPSC 259 Binary Heaps and Heapsort Page 22

The heap property A node has the heap property if the priority of the node is as high as or higher than the priority of its children. 12 12 12 8 3 Red node has heap property 8 12 Red node has heap property 8 14 Red node does not have heap property All leaf nodes automatically have the heap property. A binary tree is a heap if all nodes in it have the heap property. CPSC 259 Binary Heaps and Heapsort Page 23

Percolate-up Given a node that does not have the heap property, you can give it the heap property by exchanging its value with the value of the child with the higher priority. 12 14 8 14 Red node does not have heap property 8 12 Red node has heap property This is sometimes called Percolate-up (sifting up). Notice that the child may have lost the heap property. CPSC 259 Binary Heaps and Heapsort Page 24

Binary Heap Priority Q Data Structure Heap-order property parent s key is less than or equal to children s keys result: minimum is always at the top Structure property nearly complete tree CPSC 259 Binary Heaps and Heapsort Page 25 11 7 9 4 12 6 14 2 20 10 depth is always O(log n); next open loca2on always known WARNING: this has NO SIMILARITY to the heap you hear about when people say objects you create with new go on the heap. 25 5 8

It is important to realize that two binary heaps can contain the same data but the data may appear in different positions in the heap: 2 5 7 5 7 8 2 5 5 7 7 8 Both of the minimum binary heaps above contain the same data: 2, 5, 5, 7, 7, and 8. Even though both heaps satisfy all the properties necessary of a minimum binary heap, the data is stored in different positions in the tree. CPSC 259 Binary Heaps and Heapsort Page 26

Min-heap and Max-heap (There s also a maximum binary heap, where parent each child simply changes to parent each child.) 2 5 7 5 7 8 Min-heap 7 5 7 4 3 4 Max-heap CPSC 259 Binary Heaps and Heapsort Page 27

Constructing a heap I (Naïve approach) A tree consisting of a single node is automatically a heap. We construct a heap by adding nodes one at a time: Add the node just to the right of the rightmost node in the deepest level. If the deepest level is full, start a new level with the leftmost position. Examples: Add a new node here Add a new node here CPSC 259 Binary Heaps and Heapsort Page 28

Constructing a heap II (Naïve approach) Each time we add a node, we may destroy the heap property of its parent node. To fix this, we percolate-up. But each time we percolate-up, the value of the topmost node in the sift may increase, and this may destroy the heap property of its parent node. We repeat the percolate-up process, moving up in the tree, until either: we reach nodes whose values don t need to be swapped (because the parent is still larger than both children), or we reach the root CPSC 259 Binary Heaps and Heapsort Page 29

Constructing a heap III (Naïve approach) 8 8 10 10 10 8 8 5 1 2 3 10 10 12 8 5 12 5 10 5 12 8 8 4 CPSC 259 Binary Heaps and Heapsort Page 30

Other children are not affected 12 12 14 10 5 14 5 12 5 8 14 8 10 8 10 The node containing 8 is not affected because its parent gets larger, not smaller. The node containing 5 is not affected because its parent gets larger, not smaller. The node containing 8 is still not affected because, although its parent got smaller, its parent is still greater than it was originally. CPSC 259 Binary Heaps and Heapsort Page 31

A sample heap Here s a sample binary tree after it has been heapified. 25 22 17 19 22 14 15 18 14 21 3 9 11 Notice that heapified does not mean sorted. Heapifying does not change the shape of the binary tree; this binary tree is still a nearly complete binary tree. CPSC 259 Binary Heaps and Heapsort Page 32

Clicker Question Is the following binary tree a maximum binary heap? 25 22 17 19 23 16 15 14 18 21 5 9 6 A: It is a maximum binary heap B: It is not a maximum binary heap C: I don t know CPSC 259 Binary Heaps and Heapsort Page 33

Clicker Question (answer) Is the following binary tree a maximum binary heap? 25 22 17 19 23 16 15 14 18 21 5 9 6 A: It is a maximum binary heap B: It is not a maximum binary heap C: I don t know CPSC 259 Binary Heaps and Heapsort Page 34

Clicker question Which of the following statement(s) are true of the tree shown below? a. It is a binary search tree. b. It is a complete tree. c. It is a Max-heap. d. It is a Min-heap. 2 7 e. More than one of the above statements is true. 15 CPSC 259 Binary Heaps and Heapsort Page 35

Clicker question Which of the following statement(s) are true of the tree shown below? a. It is a binary search tree. b. It is a complete tree. c. It is a Max-heap. d. It is a Min-heap. 2 7 e. More than one of the above statements is true. 15 CPSC 259 Binary Heaps and Heapsort Page 36

In-class exercise Build a binary Max-heap using the following numbers, assuming numbers arrive one at a time, so you don t have the whole list to start with). 2,7,26,25,19,17,1,90,3,36 90 36 17 25 26 7 1 2 3 19 See hpp://visualgo.net/heap.html CPSC 259 Binary Heaps and Heapsort Page 37

Mapping into an array 25 22 17 19 22 14 15 18 14 21 3 9 11 0 1 2 3 4 5 6 7 8 9 10 11 12 25 22 17 19 22 14 15 18 14 21 3 9 11 Because of the heap's shape, node values can be stored in an array. CPSC 259 Binary Heaps and Heapsort Page 38

Mapping into an array 25 22 17 19 22 14 15 18 14 21 3 9 11 0 1 2 3 4 5 6 7 8 9 10 11 12 25 22 17 19 22 14 15 18 14 21 3 9 11 LeV child = 2*node + 1 right child= 2*node + 2 parent = floor((node-1)/2) nex\ree = length root = 0 CPSC 259 Binary Heaps and Heapsort Page 39

Adding an item to a heap If a new item is added to the heap, use ReheapUp (percolate-up ) to maintain a heap. /* This function performs the Reheapup operation on an array, to establish heap properties (for a subtree).!! PARAM: data integer array containing the heap! top position of the root! bottom - position of the added element! */! void ReheapUp( int * data, int top, int bottom ){! if (bottom > top) {! int parent = getparent(bottom);! if (data[parent] < data[bottom]) {! swap( &data[parent], &data[bottom]);! ReheapUp(data, top, parent);! }! }! } CPSC 259 Binary Heaps and Heapsort Page 40

In-class exercise For the max-heap below draw the recursion tree of ReheapUp(data, 0, 12), where 25 is stored in data[0]! 25 22 17 19 22 14 15 18 14 21 3 9 26 ReheapUp(data, 0, 12) ReheapUp(data, 0, 5) ReheapUp(data, 0, 2) ReheapUp(data, 0, 0) CPSC 259 Binary Heaps and Heapsort Page 41

Example (min-heap) 2 2 4 5 4 5 7 6 10 8 7 6 3 8 11 9 12 14 20 3 11 9 12 14 20 10 2 2 4 3 4 3 7 6 5 8 7 6 5 8 11 9 12 14 20 10 11 9 12 14 20 10 CPSC 259 Binary Heaps and Heapsort Page 42

Removing the root (Min-heap example) So we know how to add new elements to our heap. We also know that root is the element with the highest priority. But what should we do once root is removed? Which element should replace root?? 4 5 7 6 10 8 11 9 12 14 20 CPSC 259 Binary Heaps and Heapsort Page 43

? One possibility 4 4 5? 5 7 6 10 8 7 6 10 8 11 9 12 14 20 11 9 12 14 20 4 4 6 5 6 5 7? 10 8 7 12 10 8 11 9 12 14 20 11 9 20 14 20 CPSC 259 Binary Heaps and Heapsort Page 44

Removing the root (max-heap example) So we know how to add new elements to our heap. We also know that root is the element with the highest priority. But what should we do once root is removed? Which element should replace root? 25 22 17 19 22 14 15 18 14 21 3 9 11 CPSC 259 Binary Heaps and Heapsort Page 45

Removing the root Suppose we remove the root: 11 22 17 19 22 14 15 18 14 21 3 How can we fix the binary tree so it is once again a nearly complete tree? Solution: remove the rightmost leaf at the deepest level and use it for the new root. 9 11 CPSC 259 Binary Heaps and Heapsort Page 46

The percolate-down method Our tree is now a nearly complete binary tree, but no longer a heap. However, only the root lacks the heap property. 11 22 17 19 22 14 15 18 14 21 We can percolate-down the root. After doing this, one and only one of its children may have lost the heap property. 3 9 CPSC 259 Binary Heaps and Heapsort Page 47

The percolate-down method Now the left child of the root (still the number 11) lacks the heap property. 22 11 17 19 22 14 15 18 14 21 3 9 We can percolate-down this node. After doing this, one and only one of its children may have lost the heap property. CPSC 259 Binary Heaps and Heapsort Page 48

The percolate-down method Now the right child of the left child of the root (still the number 11) lacks the heap property: 22 22 17 19 11 14 15 18 14 21 3 9 We can percolate-down this node. After doing this, one and only one of its children may have lost the heap property. CPSC 259 Binary Heaps and Heapsort Page 49

The percolate-down method Our tree is once again a heap, because every node in it has the heap property 22 22 17 19 21 14 15 18 14 11 3 9 Once again, the largest (or a largest) value is in the root CPSC 259 Binary Heaps and Heapsort Page 50

In-class exercise Build a binary Max-heap using the following numbers 2,7,26,25,19,17,1,90,3,36 90 36 17 25 26 7 1 2 3 19 Now remove max and reheap. 36 26 17 25 19 7 1 2 3 CPSC 259 Binary Heaps and Heapsort Page 51

! Removing an item from a heap Use ReheapDown (percolate-down) to remove an item from a max heap /* This function performs the ReheapDown operation on an array, to establish heap properties (for a subtree).!! PARAM: data integer array containing the heap! top position of the root! bottom position of the final elements in heap! */! void ReheapDown( int * data, int top, int bottom){! if (!isleaf(top, bottom)){ /* top is not a leaf */! int maxchild = getmaxchild(top) /* position of the! child having largest data value */! }! }! if ( data[top] < data[maxchild] ){! swap( &data[top], &data[maxchild])! ReheapDown( data, maxchild, bottom);! }! CPSC 259 Binary Heaps and Heapsort Page 52

In-class exercise For the max heap below draw the recursion tree of ReheapDown(data, 0, 10).! 9 22 17 19 22 14 15 18 14 21 3 ReheapDown(data, 0, 10) ReheapDown(data, 1, 10) ReheapDown(data, 4, 10) ReheapDown(data, 9, 10) CPSC 259 Binary Heaps and Heapsort Page 53

When performing either a ReheapUp or ReheapDown operation, the number of operations depends on the depth of the tree. Notice that we traverse only one path or branch of the tree. Recall that a nearly complete binary tree of height h has between 2 h and 2 h+1-1 nodes: F Time Complexity D J F D J H H K L M We can now determine the height of a heap in terms of the number of nodes n in the heap. The height is lg n. The time complexity of the ReheapUp and ReheapDown operations is therefore O(lg n). CPSC 259 Binary Heaps and Heapsort Page 54

Building a Heap (Naïve) Adding the elements one add a time with a reheapup See http://visualgo.net/heap.html /* This function builds a heap from an array.!! PARAM: data integer array (no order is assumed)! top - position of the root! bottom position of the final elements in heap! */! void Build_heap( int * data, int top, int bottom )! {! int index = 0;! while (index <= bottom){! ReheapUp(data, top, index);! index ++;! }! }! CPSC 259 Binary Heaps and Heapsort Page 55

8 8 10 10 10 8 8 5 10 10 12 8 5 12 5 10 5 12 8 8 Complexity analysis: we add each of n nodes and each node has to be sifted up, possibly as far as the root Since the binary tree is a nearly complete binary tree, sifting up a single node takes O(lg n) time Since we do this N times, Build_heap takes N*O(lg n) time, that is, O(n lg n) time CPSC 259 Binary Heaps and Heapsort Page 56

! Heapify method See http://visualgo.net/heap.html /* This function builds a heap from an array.!! PARAM: data integer array (no order is assumed)! top - position of the root! bottom position of the final elements in heap! */! void Heapify( int * data, int top, int bottom )! {! int index = position of last parent node in entire tree;! while (index => top){! /* go backwards from the last parent */! ReheapDown( data, index, bottom );! index --;! }! } CPSC 259 Binary Heaps and Heapsort Page 57

In-class exercise Example: Convert the following array to a min-heap: 8 9 7 3 2 5 0 1 To do so, picture the array as a nearly complete binary tree: 8 9 3 2 7 5 0 1 CPSC 259 Binary Heaps and Heapsort Page 58

8 8 9 7 9 0 1 2 5 0 1 2 5 7 3 3 8 0 1 0 1 5 3 2 5 7 3 2 8 7 9 9 CPSC 259 Binary Heaps and Heapsort Page 59

Time complexity of Heapify We can determine the time complexity of the Heapify function by looking at the total number of times the comparison and swap operations occur while building the heap. Let us consider the worst case, which is when the last level in the heap is full, and all of the nodes with high priorities are in the leafs. We will colour all the paths from each node, starting with the lowest parent and working up to the root, each going down to a leaf node. The number of edges on the path from each node to a leaf node represents an upper bound on the number of comparison and swap operations that will occur while applying the ReheapDown operation to that node. By summing the total length of these paths, we will determine the time complexity of the Heapify function. CPSC 259 Binary Heaps and Heapsort Page 60

Time complexity of Heapify In the worse possible case how many swaps are going to take place? Relate the number of swaps first to the number of edges and then nodes. CPSC 259 Binary Heaps and Heapsort Page 61

Time complexity of Heapify Note that no edge is coloured more than once. Hence the work done by the Heapify function to build the heap can be measured in terms of the number of coloured edges. CPSC 259 Binary Heaps and Heapsort Page 62

Time complexity of Heapify Note that no edge is coloured more than once. Hence the work done by the Heapify function to build the heap can be measured in terms of the number of coloured edges. CPSC 259 Binary Heaps and Heapsort Page 63

Time complexity of Heapify Note that no edge is coloured more than once. Hence the work done by the Heapify function to build the heap can be measured in terms of the number of coloured edges. CPSC 259 Binary Heaps and Heapsort Page 64

Time complexity of Heapify Note that no edge is coloured more than once. Hence the work done by the Heapify function to build the heap can be measured in terms of the number of coloured edges. CPSC 259 Binary Heaps and Heapsort Page 65

Time complexity of Heapify (sketch) Suppose H is the height of the tree, N is the number of elements in the tree, and E is the number of edges in the tree. How many edges are there in a nearly complete tree with N elements? N-1 Total number of coloured edges or swaps = E H = N - 1 - H = N 1 lg N T(n) O (n) Hence, in the worst case, the overall time complexity of the Heapify algorithm is: O(n) CPSC 259 Binary Heaps and Heapsort Page 66

The Heapsort Algorithm Motivation: In this section, we examine a sorting algorithm that guarantees worst case O(n lg n) time. We will use a binary heap to sort an array of data The Heapsort algorithm consists of 2 phases: 1. [Heapify] Build a heap using the elements to be sorted. 2. [Sort] Use the heap to sort the data. CPSC 259 Binary Heaps and Heapsort Page 67

Having built the heap, we now sort the array: 9 1 3 2 0 5 8 7 0 1 5 3 2 8 7 9 Note: In this section, we represent the data in both binary tree and array formats. It is important to understand that in practice the data is stored only as an array. More about this later when we cover sorting!!! CPSC 259 Binary Heaps and Heapsort Page 68

Time Complexity of Heapsort We need to determine the time complexity of the Heapify O(n) operation, and the time complexity of the subsequent sorting operation. The time complexity of the sorting operation once the heap has been built is fairly easy to determine. For each element in the heap, we perform a single swap and a ReheapDown. If there are N elements in the heap, the ReheapDown operation is O( lg n ), and hence the sorting operation is O( n lg n ). Hence, in the worst case, the overall time complexity of the Heapsort algorithm is: O(n) + O(n lg n) = O(n lg n) build heap from unsorted array essentially perform N RemoveMin s CPSC 259 Binary Heaps and Heapsort Page 69

Learning Goals revisited Provide examples of appropriate applications for priority queues and heaps. Implement and manipulate a heap using an array as the underlying data structure. Describe and use the Heapify ( Build Heap ) and Heapsort algorithms. Analyze the complexity of operations on a heap. CPSC 259 Binary Heaps and Heapsort Page 70