320 likes | 331 Views
Learn how AVL trees are balanced through rotations and maintain properties as integers are inserted, removed, and replaced, with efficient algorithms suggested. Understand ways to reduce extra bits for balancing and support queries efficiently.
E N D
Foundations of Data Structures Practical Session #7 AVL Trees 2
AVL Tree example 14 11 17 7 12 53 4 8 13
Question 1 • Insert the following sequence of integers into an empty AVL tree: 14, 17, 11, 7, 53, 4, 13 14 11 17 7 53 4
A single rightrotation of ’11’ is executed to rebalance the tree: • Insert 13 14 7 17 4 11 53 13
Now insert 12 14 7 17 4 11 53 13 The sub-tree of 11 is unbalanced. Double rotation: right and then left. 12
After right rotation of ’13’ • Now left rotate ’11’ 14 7 17 4 11 53 12 13
After left rotation of ’11’ • Now balanced! 14 7 17 4 12 53 11 13
Now insert 8 14 7 17 4 12 53 11 13 The sub-tree of 7 is unbalanced. Required double rotation: right and then left. 8
After right rotation of ’12’ • Now left rotate ‘7’ 14 7 17 4 11 53 8 12 13
Now balanced! 14 11 17 7 12 53 4 8 13
Remove 53 14 11 17 7 12 53 4 8 13
Unbalanced! • Right rotate ’14’ 14 11 17 7 12 4 8 13
Balanced! • Remove 11 11 7 14 4 8 12 17 13
Remove 11 • Replace it with the maximum in its left branch 11 7 14 4 8 12 17 13
Remove 8 8 7 14 4 12 17 13
Unbalanced! • Required double • rotatation 7 4 14 12 17 13
After right rotation of ‘14’ 7 4 12 14 13 17
After left rotation of ‘7’ 12 7 14 4 13 17
Question 2 • In class we’ve seen an implementation of AVL tree where each node v has an extra field h, the height of the sub-tree rooted at v. The height can be used in order to balance the tree. • How many bits are required to store the height in a node? • Answer: For an AVL tree with n nodes, h=O(logn) thus requires O(loglogn) extra bits. • How can we reduce the number of the extra bits necessary for balancing the AVL tree? • Suggest an algorithm for computing the height of a given AVL tree given in the representation you suggested in 1.
Question 2 solution • Instead of a height field, which is redundant, each node will store 2 balance bits, calculated as the difference of heights between its right and left sub-trees. • Two bits suffice because the difference can be one of the three: -1, 0, 1. (The leftmost bit represents the sign) • The balance field should be updated on insert and deleteoperations, along the path to the root.
Question 2 solution • To compute the height of a tree, follow the path from the root to the deepest leaf by reading the balance field. If a sub tree is balanced to one side, the deepest leaf resides on that side. CalcHeight(T) if T == null return -1 if T.balance == -1 or T.balance == 0 return 1 + CalcHeight( T.left ) else return 1 + CalcHeight( T.right )
Question 3 • Suggest two ways for an AVL tree to support a query for retrieving all the keys in range in time, where is the number of keys in the range.
Question 3 solution • Store in each node pointers to its predecessor and successor. • Requires updating on insertand delete operations. • Finding the successor/predecessor requires an time, equivalent to in an AVL tree, thus the time of the insert and delete operations is unchanged.
Question 3 solution • Use the following claim: • “Starting at any node in a height BST, • successive calls to TREE-SUCCESSOR take time.” • Doesn’t require extra pointers. • Doesn’t require modifications to the insert and delete operations.
Question 3 solution • Reminder: TREE-SUCCESSOR(x) If x.right != NULL then return TREE-MINIMUM(x.right) y ← x.parent while y != NULL and x == y.right do x ← y y ← y.parent return y
Question 3 solution • Claim: “Starting at any node in a height BST, successive calls to TREE-SUCCESSOR take time.” • Proof outline • Let be the starting node and be the ending node after successive calls to TREE-SUCCESSOR. • Let be the simple path between and inclusive. • Let be the common ancestor of and that visits. • The length of is at most , which is . • Let output be the elements with keys between and inclusive. • The size of theoutput is . • In the execution of successive calls to TREE-SUCCESSOR, each node in is visited at most 3 times (on the way to its left, right and up). • Besides the nodes and , if a sub tree of a node in is visited then all its elements are in output. • Hence, the total running time is .
Question 4 • Suggest an efficient algorithm for sorting an array of numbers. Analyze its running time and required space.
Question 4 solution • Define a new empty AVL tree, T. • Traverse the input array and insert each item to T. • Traverse the tree T in a In-order manner, copying the items back to the array. • Time: step 2 requires , step 3 requires . In total . • Extra space: an AVL tree of size requires extra space.
Question 5 • Suggest a data structure for storing integers that supports the following operations.
Question 5 solution • For example, for the following sequence of actions: • Insert(3), Insert(5), Insert(11), Insert(4), Insert(7), Delete(5) • GetPlace(7) returns 4, and DeletePlace(2) will delete 11. • The solution • We will use two AVL trees: • T1stores the elements by their key. • T2 stores the elements by the order of insertion (using a running counter). • There are pointers between the two trees connecting the nodes with the same key.
Question 5 solution • Init() – initialize two empty trees • Insert(x) – insert the element by its key into T1, insert it by its order of insertion into T2and update the pointers. • Delete(x) – find the element in T1and delete it from both the trees, following the pointer. • DeletePlace(i) – find the node with key in T2, delete it from both the trees, following the pointer. • GetPlace(x) – find the node with key in T1, follow the pointer to its copy in T2 and return its key (in T2).