What's Up With Computer Science?!: June 2014

Wednesday, 18 June 2014

What's Up With Binary Trees?!

This post is going to be about binary trees but we haven't really talked much about trees at all so far so let's get some terminology out of the way:

A tree is a connected graph which contains no cycles, meaning there is only one path from one node to another if you aren't traversing the same edge twice.

Every tree can be presented as a hierarchy of nodes, with some arbitrary 'root' node. Say we took '0' as the root node, here's what the hierarchical presentation would look like:

This is called a 'rooted tree'. I've made the tree directed to express it's hierarchical nature. '0' is the root, and all adjacent nodes are its children. This means the parent of nodes '1', '2', and '3' is 0. In the same way, '1' is the parent of '4' and '4' is the child of '1'. Every node except the root node has a parent. By the way you'll end up with this kind of structure regardless of the node you choose as a root, for example if we use '2' as the root we get this:

Now that we've established that these trees are directed I'll stop using the arrows for simplicity (and because drawing arrowheads on paint is harder than it looks) as soon as my copy-pasting is done.

So let's knock out some more terminology. A 'leaf' is a node with no children i.e. all the nodes at the bottom. This makes sense considering in real trees, leaves don't have anything else growing out of them. Also similarly to real trees, we consider all non-leaf nodes to be 'interior' nodes. Similar to genealogy trees, all nodes on the path from the root (including the root) to a particular node are considered to be that node's 'ancestors'. Technically this includes the node itself but if you want it to exclude the node itself you can use the term 'proper ancestors'. In the above tree, the ancestors of '4' are '4', '1', '0', and '2'. And the 'proper' ancestors are '1', '0', and '2'. Same deal with descendants; If node u is on the path to node v from the root (including the root), then node v is a descendant of node u and if you consider the possibility that node u is node v then a node is its own descendant, hence why the term 'proper descendant' exists where a node is not its own proper descendant. In practice, most people say ancestor and mean proper ancestor, including wikipedia, but I'm sticking to the original definition because using 'ancestor'/'proper ancestor' (original definition) is better than using 'ancestor or the node itself'/'ancestor' (more common definition). Also a grandfather node is the parent of a node's parent, and a grandchild node is a child of a node's child. So '0' and '4' make a grandfather/grandchild pair, as do '2' and '3'. Siblings are nodes with the same parent.

A 'subtree' of node v is a tree which is 'rooted' at a child of v. For example in the above tree we can say that if we cut the edge between '0' and '1' then '1' and '4' would make their own tree with '1' being the root i.e. the tree is rooted at '1'. Thus we can say that the part of the tree containing '1' and '4' is a subtree of '0'. A subtree can consist of only 1 node so '5' is a subtree of '2'.

The 'size' of a tree is simply the number of nodes. The 'level' (or 'depth') of a node is the number of edges you need to traverse to get from the root to the node i.e. the length of the path from the root to the node. The height of a node length of the longest path from that node to a leaf (without going up any levels).

The 'height' of a tree is the height of that tree's root so the height of the tree above is 3. If a tree with 2 nodes has a height of 1 and a tree with one node has a height of 0 then it follows that a tree with no nodes i.e. an empty tree has a height of -1. Although this breaks the definition, it's convenient considering we usually return '-1' when something goes wrong so if you're asking for the height of a tree assuming it's got a few nodes and it returns '-1' then you think ahhh it's empty.

N-ary trees are trees which put a limit on how many children a node can have. If you want to impose a one-child policy on a tree, then you'd have a 1-ary or unary tree which would really just be a linked list. Allowing a max of two children gives you a binary tree (the subject of this post which we are yet to reach) and three children gives you a ternary tree. Then there's quaternary, quinary, senary, septenary, octonary. The tree above is a ternary tree because no node has more than 3 children.

A 'full' N-ary tree is one where for each level r, the number of nodes is N^r. So for a binary tree of height 3, a full binary tree will have 2^0 = 1 node in level 0, 2^1 = 2 nodes in level 1, 2^2 = 4 nodes in level 2 and 2^3 = 8 nodes in level 3 which totals to 1 + 2 + 4 + 8 = 15 nodes. That number is 1 less than 2^4 and if you add another level i.e. 16 nodes you'll get 31 nodes which is one less than 2^5. The pattern here is that for a full tree of height h you'll have 2^(h+1) - 1 nodes. The binary tree below has a height of 3 and thus has 2^(3+1) - 1 = 15 nodes.

A 'complete' tree is one where each level except the last one contains the max number of nodes. As for the last level, it doesn't need to contain the max number of nodes but it does need to have no gaps going from left to right i.e. if you scanned from left to right, you'll see node, node, node, node, empty slot, empty slot, empty slot etc. If you started from a tree with 1 node and added child nodes left-to-right, level by level in the same way the words I'm typing go left to right, line by line, you'll always have a complete graph.

Alright so we've nailed the terminology and we can now dive into the two types of binary trees that people will usually think of when they hear the term 'binary trees': Binary Search Trees and Heaps.

Binary Search Trees:

With these trees, each node has a value (which I'll call a 'key' from now on) and if it has a left subtree then its key is greater than the key of all the nodes within that subtree, and if it has a right subtree then its key is less than the key of all the nodes within that subtree. This is called 'symmetric' order. We're going to assume each key is unique or we would be in a bit of trouble with ordering nodes whose keys are identical.

If you can recall, last post we talked about how trees are represented as linked lists with each node being the root of a list of edges connected to that node. With binary trees it's a bit different, as we know each node can only have at most two children so it's easier to give it 4 elements: identifier (something form of identifier like 'Jake', key (a value used to compare the node to others), left (a pointer to the left child) and right (a pointer to the right child). Because we only care about the keys of each node we can ignore the 'identifier' element and just use the key itself to identify the node. Most languages will have you write mynode.left to refer to the 'left' element of the node i.e. the pointer to the left child node and the same with mynode.right, mynode.key etc. Also, a tree can be referred to via a pointer to its root.

The first operation we're going to learn is the inorder-walk where you go through the nodes and do something to all the nodes going from lowest to greatest key. We might be trying to add our keys to an ordered list. Let's use the below graph:

Here's our algorithm:

0 InorderWalk(Tree):

1 if (Tree != NULL) { // if the node actually exists

2 InorderWalk(Tree.left)

3 List.add(Tree.key) //add our node's key to the list

4 InorderWalk(Tree.right)

5 }

And that's it! Let's walk through the algorithm as it applies to the above tree. You first input the pointer to the root node '3' and as this pointer is really taking you to a node (it isn't null) you move on to line 2. Line two just tells you to repeat the process for for the root node's left subtree (which has, as its root, '3's left child '2'). So here we do the same thing, check whether the pointer is null and because it isn't we do line 2 again, meaning now we're doing the algorithm for '2's left child which is '1'. Repeat the process for '1' but it's pointer to its left child is null because it doesn't have a left child, meaning that this call of the algorithm terminates and we go back up one level and find ourselves back at node '1', and we're now at line 3 meaning we just add '1' to the list. Then we call the algorithm for its right child but once we get to line 1 of that recursion we'll work out '1' doesn't have a right child so we go back up again and now we've passed line 5 of the algorithm for the '1' node so that call terminates and we go back up to a level to the call which had node '2' as the input. At this point we're at line 3 and we can add '2' to the list, then we get to line 4 and do what we just did with node '1'; work out node '2' has no right child and so go back up the recursion chain until we're back to the root. Here we're at line 3 so we can add the root and then do the algorithm for its right subtree. This continues until we have our list [1,2,3,4,5,7,8].

Here is a poorly drawn description of the process. The blue arrows representing recursive calls (also known as 'winding') and the red arrows represent call terminations (also known as, wait for it, 'unwinding'). The unwindings that occur at the null nodes (i.e. the non-existent nodes) are due to line 1 and all the other unwindings are just the result of getting to the end of the recursive call's code. I might need to do a post on recursion later to explain this a bit better.

Anyway if anything this should show you that binary search trees are good for storing keys which are easy to put into a sorted list. If we didn't know that lower keys end up in left subtrees and greater keys end up in right subtrees we would need to sort through the whole tree and add the minimum to the list with each search.

Speaking of searching let's look at that. The idea is similar to above except that we'll never need to go back up the tree meaning we won't need to use recursion (as right now we don't have a 'parent' element for the nodes meaning we have no way to go up the tree unless we're backtracking recursive calls or we stored some stuff externally). All we need to do is look at the root and if our target key is less we look at the left child node and if our target is greater we look at the right child node and if the target key is the current node's key then we return the pointer to that node and if we reach a null pointer it means the target isn't in the tree.

0 Search(target,node):

1 while (node != NULL and target != node.key) {

2 if (node.key < target) {

3 node <-- node.left

4 } else {

5 node <-- node.right

6 }

7 }

8 return node

So here we just keep looking until either we've fallen off the tree (reached a null pointer) or our node's key matches the target key and in both cases we return the pointer to the node. This means if the algorithm returns a null pointer, the target isn't in the tree. Using the above tree as an example, if I was looking for '6', I'd start at the root, realise 6 > 3, look at 7 and realise 6 < 7, look at 5 and realise 6 > 5 and then look at 5's right child and realise it doesn't exist i.e. the pointer is null so I return null. The reason I used 'node' instead of 'tree' in this one, despite them both referring to nodes/ the roots of trees, is that here you're not fully traversing each subtree and you only really care about the nodes and their children. It's really just something to help me conceptualise it.

Next operation we'll learn is finding the minimum key of a tree. Fairly simple, we just keep going to the next left child until we find a node with no left child, and then return this node:

0 Minimum(node):

1 while (node.left != NULL) {

2 node <-- node.left

3 }

4 return node

With the above tree if we would input the pointer to the root, '3', and then it would change the pointer to '3's left child '2' and then it would change the pointer to '2's left child '1' and then considering '1' has no left child i.e. node.left = null, it returns the pointer to '1'. If we only cared about the subtree rooted at '7' we would input a pointer to '7' and the algorithm would return '4's pointer.

Finding the minimum of a subtree is useful in finding the successor of a key. In the above tree with the keys 1,2,3,4,5,7,8 the successor of 5 is 7 i.e. the next largest key. If you want to know the successor of a particular node's key, and it has a right subtree, all you need to do is find the minimum of that tree because everything in the right subtree is larger than the current node's key and you want the minimum of those keys. But what if your node has no right children? This means that in order to find the successor you need to travel up the tree until you have to go to the right to get to the parent i.e. the current node is a left child, meaning its parent is greater and you don't need to keep looking because anything further to the right will be even greater than that. Let's say we have a node '6' in the above tree and it's the right child of '5'. Finding its successor involves going up the tree until you can go right and so we find that '7' is '6's successor.

If we want to implement this algorithm then we'll need to store a pointer in each node which takes us to that node's parent which we'll call node.parent. If node.parent is null then the node is the root of the tree. Also if in our algorithm we find out that node.parent is null then we should return null because it means there is no successor to the node we inputted e.g. if we inputted a pointer to '8' then we'd simply go up the tree until we reach the root and then return its parent i.e. null. We'll store the pointer to the current node's parent as 'above'.

0 Successor(node)

1 if (node.right != NULL) {

2 return Minimum(node.right)

3 } else {

4 above <-- node.parent

5 }

6 while (above != NULL and node == above.right) {

7 node <-- above

8 above <-- above.parent

9 }

10 return above

To find a node's predecessor all we have to do is exchange Minimum with Maximum and right will left. The algorithm which finds the Maximum is the same as the Minimum one except it exchanges left for right.

Inserting a node is similar to the search algorithm but when you find a null pointer you want to plug the node in there. We'll assume the new node is already in RAM somewhere and we will input a pointer to the node called 'new'. We'll also input the root of the tree as 'node'.

0 Insert(new, node):

1 if (node == NULL) {

2 node <-- new

3 return

4 }

5 while (TRUE) {

6 if (new.key > node.key) {

7 if (node.right == NULL) {

8 node.right <-- new

9 new.parent <-- node

10 return

11 }

12 node <-- node.right

13 } else {

14 if (node.left == NULL) {

15 node.left <-- new

16 new.parent <-- node

17 return

18 }

19 node <-- node.left

20 }

21 }

This isn't the only way to do the code: you could have done it like this:

0 Insert(new, node):

1 if (node == NULL) {

2 node <-- new

3 return

4 }

5 while (node != NULL) {

6 above <-- node

7 if (new.key > node.key) {

8 node <-- node.right

9 direction <-- right

10 } else {

11 node <-- node.left

12 direction <-- left

13 }

14 above.direction <-- new

15 new.parent <-- above

This code assumes you're programming language will let you make a variable (direction) that refers to an element of a node.

Here's a recursive approach from a youtube channel called mycodeschool:

0 Insert(root,key):
1 if(root == NULL) root <-- new
2 else if(key < root.key) root.left = Insert(root.left,key)
3 else root.right = Insert(root.right,key)
4 return root

To understand how this last one works, just read on to deletion where an almost identical block of code gets some analysis.

Now onto our last operation for binary search trees: deletion. This is by far the most complex but some sneaky recursion will get us through it. We'll use the tree below (also we'll ignore the .parent element for this):

Let's say we wanted to delete node '1'. This is easy, just deallocate the memory in RAM for '1' and set '3's left child pointer to null. So much for nodes with no children.

What about nodes with one child? Let's look at trying to delete '7'. We can just deallocate the memory in RAM for '7' and then set '5's right child to '9':

So the brown part there is us deallocating the memory for '7' which includes its pointer to '9'. The red part is us cutting the tie between '5' and '7' by replacing '5's right child pointer with one that leads to '9' instead of '7' (the blue arrow). Let's look at how our tree looks now:

An important question to ask is; is this still a binary tree? We know that 9,8 and 11 are greater than 5 because they were the child of a node which was greater than 5 as '7' was '5's right child. This means our deletion was completely harmless to the structure because we've retained the property that the nodes in the right subtree of a node have greater keys. If we wanted to delete a node whose one child was a left child, the structure would also be retained. So we've done nodes with no children, nodes with one child, all that's left are nodes with two children. The problem here is deciding which node will take its place. Looking at '9', if we wanted to remove this we could replace its key with 8 or 11 and then delete the node whose key took '9's place, and the structure would stay i.e. if '8' became the new '9', 11 > 8 so the structure is kept and if '11' became the new '9', 8 < 11 so the structure is kept.

Considering the symmetry, let's just say we're going to grab replacements from the right subtree each time we want to delete an interior node (in this case that means using the top alternative).

What if we wanted to delete '15' from the above tree? To make a point, I'll throw a '16' into the tree as '17's left child:

In this situation we can't just take '17' and put that in '15's place because then '17's right subtree will contain 16, 18 and 20 and the 16 being there is not good considering all the values should be greater than 17. The better alternative would be to get the minimum of '15's right subtree and use that as a replacement. The first benefit of this is that we know all other values in the subtree are going to be larger meaning when '16' takes '15's place the structure will hold because the remaining right subtree will still have values greater than 16. The second reason is that minimum values often end up being leaves which themselves are easy to delete. Good thing we've already made a Minimum algorithm. So the process here would be to find the minimum key in '15's right subtree, replace '15's key with that and then delete the now-redundant minimum node from the subtree.

So that's all the possibilities: with 0 children the node simply gets deallocated, with 1 child the node gets deallocated and the gap is bridged between its parent and child, and with 2 children the minimum child in the right subtree takes the place of the node and itself gets deleted, which may involve one of the first two possibilities (but not the third because the minimum node will never have a left child so it will only ever have a max of one child).

So here is our code, you input the root of the tree and the key you want deleted. The algorithm has a recursive approach to searching for the key in the first place which helps it go back up a level to change one of the parent node's child pointers when the target node gets deleted/replaced.

0 Delete(root, key):
1 if(root == NULL) return root // the key isn't in the tree
2 // this is the searching part
3 else if (key < root.key) root.left <-- Delete(root.left,key)
4 else if (key > root.key) root.right <-- Delete(root.right,key)
5 // here's the actual deletion part
6 else {
7 // 0 children
8 if(root.left == NULL and root.right == NULL) {
9 deallocate root;
10 root = NULL;
11 }
12 // 1 child
13 else if(root.left == NULL) {
14 temp <-- root
15 root <-- root.right
16 deallocate temp
17 }
18 else if(root.right == NULL) {
19 temp <-- root
20 root <-- root.left
21 deallocate temp
22 }
23 // 2 children
24 else {
25 temp <-- Minimum(root.right)
26 root.key <-- temp.key
27 root.right <-- Delete(root.right,temp.key)
28 }
29 }
30 return root

Let's go through this code via trying to delete the root itself, '12'. So we input a pointer to the root and the key 12. We can skip lines 1-5 because the root itself has the key we're looking for, meaning we then come to line 24 where we've worked out root.right != null and root.left != null because '12' has two children. We set a temporary pointer value to the minimum of '12's right subtree, which is '13'. We then set 13's key as '12's new key so that was the substitution step, and all that's left is to delete '13' and give the root a new pointer to its right child in case it gets changed, so we kill two birds with one stone in line 27 and just consider the right subtree by inputting '15' as the root and 13 as the key.

So we're back to line 1 in our new recursion, and at line 3 we work out 13 < 15 so we set 15's left child to whatever the result of the next recursion is, inputting '13' as the root and 13 as the key:

So now we're at line 1 again and as our root has the key we want i.e. we want to delete it, we get to line 13 where the root's left child is null and it only has a right child ('14'). So we set our temp pointer to the pointer to our root '13', then set root to '14' and then deallocate '13' which is pointed to by temp. At line 30 we then return the pointer to '14'. So now that we're back to the previous recursion we can tell '15' that its new left child is '14' because the pointer to '14' was returned and assigned to root.left in line 3.

and the cleaner-looking result:

So now that that's done we don't need to worry about any more if conditions and we end up returning our root, which is just the pointer to '15', to the previous recursion (the one we started with). We find ourselves at line 4 and we've just assigned '15' as the right child of our new '13' which it already was so no harm done there. And then we return the original root that we inputted in the first place but that's not really important because we're at the surface now and we don't need to return any more roots. Anyway the final tree looks like this:

That wasn't too hard. Notice that no matter how complex the tree is, only one deallocation of memory will take place and before that there'll just be a number of key substitutions.

Alright that's all there really is to the structure and operations of binary search trees!

Moving on, let's focus on heaps.

Where the one thing determining the structure of binary search trees was the condition that for a certain node's key, the keys of the left subtree must be smaller and the keys of the right subtree must be larger, with heaps we have something slightly simpler. For a max-heap, a parent node's key must be greater than the keys of its children (and for a min-heap, swap greater with lesser). Unlike in binary search trees we can say 'children' rather than 'subtrees' because if the heap-property (what this condition is called) is satisfied for each node then logically, a node's key will be greater than the keys of its two subtrees, whereas with binary search trees it was possible for a each node to have a key greater than its left child and lesser than its right child but for the structure not to be valid (imagine the root is '3', its two children are '1' and '9' and '9' has a left child of '2'; this isn't a valid structure). So how convenient for us now that we're dealing with heaps!

The second thing about heaps is that they are complete trees i.e. the only level of nodes that may not be filled will be the bottom one and all the nodes on the bottom level are on the left hand side. This is called the shape property. The shape property is what gives us an incentive to use an array to store the values rather than the linked data structure. Consider a binary tree which is built from a list of elements [10, 9, 4, 14, 16,18]. This would not end up being complete. What if they were inputted like [14,10,4,9,16,18]. Still not complete. What about [4,9,10,14,16]. This is just atrocious because it's basically a linked list.

So for some binary trees it is simply impossible to be represented as a complete tree, and even more so, the order in which keys are fed into the tree can leave you with degenerate graphs i.e. the last one there. This often leads to a fair bit of wastage with null pointers but it's nothing compared to the wastage that would occur if you tried representing the tree with an array as you would need an element for every possible node. For that degenerate tree, it's got 6 levels so a full tree with that many levels has 2^6 - 1 = 63 nodes meaning you'd be wasting 63 - 6 = 57 element slots in the array. On the other hand, no matter what elements you put into a heap, it can be represented by a complete tree (in fact it's a necessity given the shape property). This means arrays are far better suited towards heaps which can have a node for each element in an array.

So lets look at how you could represent a heap in an array. It's kind-of similar to the arrays storing triangular matrices.

Our array is called A and each node has an element in A. The children of A[i] are A[2i +1] and A[2i + 2] and the parent of A[i] is A[floor((i-1)/2)]. If we had an array with indexes starting at 1 this would be slightly cleaner but base 0 arrays are the life-force of computer science so we'll go with that. Anyway it is absolutely awesome that you can so efficiently store heaps, although I guess that's in part why they were invented in the first place.

Let's look at some operations. We'll only consider min-heaps because the difference between min and max heaps is trivial. First up is insert which is going to require a dynamic (resizable) array. You append the new key to the end of the array and then compare it to its parent and if it is larger then you've got no problem but if it's smaller you need to swap the two. Say we want to insert '2' into the heap above. It starts at the bottom-right and because 2 < 6 it gets swapped with '6'. A this point the heap-property from '2' down is conserved because 8 > 6 and 6 > 2 so 8 > 2 meaning we don't need ever to worry about how the left child gets affected. But we're not done because now we need to compare '2' to its parent again and this time 1 < 2 so the heap is fine and the insertion is done. Here's some pseudocode for insertion where you input the array A and the new key.

0 Insert(A,key):

1 add key to A

2 i <-- length(A) - 1

3 while (A[floor((i-1)/2)] > A[i]) {

4 swap(A,floor((i-1)/2),i)

5 i <-- floor((i-1)/2)

6 }

Toooo easy. Let's try deletion. With heaps you can only delete the root from the tree (similar to how you can only delete the element at the front of a queue) but this means you need to replace it with something. You could replace it with the next best thing but then that needs to be replaced by something and it's strenuous to follow that chain down to the last element in the array (and you would need to do this because otherwise you won't have a complete tree by the end) let alone have a heap left over by the time you've done all the swaps. A better way is to just swap the last element in the array with the root, then delete the root. Now you've got a root which is not going to satisfy the heap property because it will be larger than its children so we can just swap it with the smaller one (meaning the new root will be smaller than both its children) and repeat the process until the key which started at the bottom has trickled back down there, leaving behind a heap which satisfies both the heap and shape properties. By swapping the root with the last element we ensure the shape property is preserved because once you've deleted the key you wanted to delete, the tree is still complete and after that all you're doing is swapping meaning it will stay complete.

The deletion part (we'll call it Extract-Min) involves swapping the root with the last element and deleting it. The rearranging part is called min-heapify-ing. The min heapify function takes a parent (A) and its two children (B and C) and satisfies the heap property for them, and if A needed to be swapped for that to happen then it repeats the process for A and its new children until A is smaller than its children. This assumes B and C are already smaller than their children when the process starts so we only need to worry about A, and luckily this is the exact situation we have when we swap the root of a heap with the last element and try and satisfy the heap property again through a number of swaps.

0 ExtractMin(A):

1 min <-- A[0]

2 swap(A,0,length(A) - 1)

3 remove A[length(A) - 1]

4 Min-Heapify(A,0)

5 // the return is for if you wanted to 'pop' the root off the heap

6 return min

0 Min-Heapify (A, i):
1 left <-- 2i + 1
2 right <-- 2i + 2
3 smallest <-- i
4 if (left <= length(A) - 1 and A[left] < A[smallest]) {
5 smallest <-- left

6 }
7 if (right <= length(A) - 1 and A[right] < A[smallest]) {
8 smallest <-- right

9 }
10 if (smallest != i) {
11 swap(A,i,smallest)
12 Min-Heapify(A, smallest)

13 }

The final operation we're going to look at for heaps is building a heap i.e. taking an array of keys and rearranging it so that the heap property is satisfied. There are two ways of doing this: the sift-down approach and the sift-up approach. When we dealt with insertions you would put your new key at the bottom of the heap and then it would be compared to its parent and if smaller it would be swapped with the parent. This process is called sifting-up. With deletions you're getting the last element and then making it the root and allowing it to go back towards the bottom until it finds a place where it's no longer violating the heap property. This is called sifting down, which uses the min-heapify function.

The sift-up approach it to just get your array and start inserting elements into your heap from scratch. So you may have 15 elements but at the beginning you only look at the first and you make that the root, then you look at the 2nd and that goes one level down and swaps with the root if need be, then this process continues with adding more to the heap and having them sift up towards the top if they need to. Every single element will need to be inserted so we know our complexity is at least O(n), but we also need to consider how many moves each element may take once it's in the heap. When you insert the root (level 0) it can't move because it's the only node in the heap. Then in the next level (level 1) you'll have two nodes which each may need to move to the top level i.e. one move. Then on level 2 you've got 4 nodes which each may move 3 times. The pattern here is that for any level k you'll have 2^k nodes moving k times in the worst-case. So the total amount of moves is going to be:

2^0 * 0 + 2^1 * 1 + 2^2 * 2 + 2^3 * 3 ... + 2^h * h where h is the height of the completed heap.

We worked out above (far far above) that the #nodes in a binary tree is given by 2^(h+1) -1. Working backwards, the height h is log2(n+1)-1. However for different #nodes you will get the same height e.g. 4,5,6 and 7 nodes correspond to a height of 2. The solution is to ceil the log2(n+1) part so that an incomplete row will be considered a full row because either way it causes the same height, hence h = ceil(log2(n+1))-1. That whole ceiling thing is just a technicality which doesn't matter for this next part. So if we substitute that equation into the 2^h * h above we get 2^(ceil(log2(n+1))-1) * ceil(log2(n+1))-1 which is O(nlogn) because the first bit is O(n) and the second bit is O(logn), and all the hairy parts just fall out as constants or smaller terms. So knowing that complexity, the complexity of 2^0 * 0 + 2^1 * 1 + 2^2 * 2 + 2^3 * 3 ... + 2^h * h is also O(nlogn) because all the smaller terms don't matter compared to 2^h * h and the processing cost per move will just be an arbitrary constant. Not too bad as far as complexity goes.

Here's the pseudocode: (you'll need two lists, the list that you get your keys from (B) and the list for the heap itself (A))

0 Build-Heap-One(A,B):

1 for i <-- 0 to length(B) - 1

2 insert(A,B[i])

The sift-down approach works from the bottom-up. You look at all the bottom interior nodes (you can ignore the leaves because they don't have any children and thus already satisfy the heap property) and compare them to their children and if swapping needs to happen, do so. Then go one level up and do the same, looking at the roots and swapping them down if you need to; potentially down to the leaf level. The beauty of this approach is that you get to ignore all the leaves which make up half the elements. So there will be 2^h leaves and you don't need to move any (they may be moved on behalf of higher nodes later but that will be factored in to the total number of moves), and there will be 2^(h-1) nodes above that level and they will do a max of 1 move, and the nodes above them will do a max of two moves, all the way to the root which will do a max of h moves. So the total number of moves is:

2^h * 0 + 2^(h-1) * 1 + 2^(h-2) * 2 ... + 2^0 * h

This is exactly the same as what we had below except that the 2^x terms have been reversed. Here's what our two #moves sums look like:

We already worked out the top-down insertion approach was O(nlogn) so let's see what we can get out of our bottom-up approach:

So if our sum is O(logn) then we'll end up with that same complexity as the top-down approach, but if we can show that it's O(1) (i.e. it's just a constant) then it means our total complexity is O(n) * O(1) = O(n). Let's take the absolute worst case scenario and say that the height of our heap if infinity i.e. h = infinity and if the sum proves to be a constant then obviously for any h our sum will be a constant. Well I'm not going to prove it here but the infinite series k/2^k converges to 2 i.e. it is bounded by a constant meaning the sum for any h is O(1) and so our total complexity for the bottom-up approach is O(n). An intuitive explanation for why it has a lower complexity is that with the sift-up approach we had majority of nodes (the leaves) having to move the most for the worst case scenario and the root having to move the least, whereas here we have the leaves not moving at all and the root is the node which moves the most in the worst-case. So we have a whole lot less moving; so much so that for enormous values of n we never have more than 2n moves when building the heap.

Here's the pseudo-code: (We don't need two lists to do this because it works in-place)

0 Build-Heap(A):

1 for i <-- floor((length(A) - 1)/2) downto 0

2 Min-Heapify(A,i)

Now that we know how to build a heap and how to extract the root of the heap we can combine some functions to make a heap-sort. The idea is to build a heap from an array of unsorted items and then keep extracting the root (which will always be the minimum) and putting that into another list. A is the unsorted list which we use to store the heap and B is the sorted list:

0 Heapsort-One(A,B):

1 Build-Heap-Two(A)

2 for i <-- 1 to length(A)

3 add ExtractMin(A) to B

This is pretty good but it would be better to do an in-place sort. If we use a max-heap where the root has the greatest key, we can continually extract it from the top of the heap and put it at the end of the array rather than putting it in a new array. The extract function we have already swaps the root with the last element and then removes it but we want to change that a bit: We want to not delete the root after we move it to the end of the array and instead just move a pointer that points to the left of it which indicates the new 'last' node in the heap. That way we can continually remove the max from the heap, put it into our sorted list which grows from right-to-left, and end up with an empty heap and a sorted list. Here's the code, with modified functions to help us out: (the extract-max has just been melded into my heapsort algorithm in the for loop)

0 Heapsort(A):

1 Build-Heap(A)

2 for end <-- length(A) - 1 downto 1

3 swap(A, 0, end)

4 Max-Heapify(A, 0, end - 1)

0 Max-Heapify (A, i, end):
1 left <-- 2i + 1
2 right <-- 2i + 2
3 largest <-- i
4 if (left <= end and A[left] > A[largest]) {
5 largest <-- left

6 }
7 if (right <= end and A[right] > A[largest]) {
8 largest <-- right

9 }
10 if (largest != i) {
11 swap(A, i, largest)
12 Max-Heapify(A, largest, end)

13 }

In terms of complexity we first build the heap which is O(n) and then we get each node (O(n)) and potentially move them all the way to the leaves (this is the worst case for half the nodes at least) which is O(logn) meaning we end up with O(n) + O(nlogn) which is O(nlogn), so it's a pretty good complexity for a sorting algorithm.

There we go, we have now done binary search trees and heaps, next up is spanning tree algorithms.

Tuesday, 10 June 2014

What's Up With Graph Representations?!

This post was going to be all about trees, but It got pretty long with the discussion on adjacency matrices and lists so I decided to just make this a post on representing graphs. I'm going to handle the two basics here and there are some modified versions which make use of priority queues but as I haven't discussed that yet it will need to wait for later. Here we go!

Adjacency Matrix (it gets its name due to the fact it shows which vertices are adjacent i.e. have an edge connecting them):

FROM 0 1 2 3 4
TO
0 0 0 2 0 0
1 3 0 0 9 0
2 0 0 0 0 0
3 0 8 0 0 0
4 7 0 0 5 0

The above 2D array gives tells us everything we need to know about a directed, weighted graph. The indices 0-4 represent the indices of the vertices (which I'm going to call nodes from now on because it sounds cooler and is faster to type). Scanning along the top row I can see that there's an edge going from node 2 to node 0 with a weight (or 'length') of 2. If I read the whole thing, I'll be able to actually draw the graph:

If you're dealing with an undirected graph, you can either duplicate the edge values like so (we'll ignore the edge going from 3 to 1 because you only need 1 edge between nodes in an undirected graph, also 0 means there's no edge):

FROM 0 1 2 3 4
TO
0 0 3 2  0  7
1   3 0 0 8 0
2   2 0 0 0 0
3   0 8 0 0 5
4   7 0 0 5 0

Notice the sweet symmetry, but also notice it's fairly wasteful to duplicate values like that because we're using twice the space that we need. When adjacency matrices get enormous (and I mean enormous), it's a good idea to use a triangular matrix. We can just get rid of the bottom-left repeated elements and keep the top-right ones (i.e. we'll use an upper triangular matrix). I'm going to keep the diagonal line of 0's there in case for some reason certain nodes had loops (edges that lead to themselves).

FROM 0 1 2 3 4
TO
0 0 3 2  0  7
1    0 0 8 0
2     0 0 0
3    0 5
4     0

Now we just need to find a way to store this in a clean array. We can use a 1D array (called E) with each row head-to-tail like this:

0 3 2  0  7 0 0 8 0 0 0 0 0 5 0

If there are n nodes, the first row has n and the second row has n-1 and then n-2 and so on until there's only one left. You could also see it as starting from 1 at the bottom, to 2, then 3, and eventually n. What's the formula for adding up 1+2+3+4...+n? If you can recall from my post about this exact formula, it's n(n+1)/2. So in this example where n = 5 we'll have 5(5+1)/2 = 15 elements in our 1D array.

The next step is to make a function which lets us put in the row number (i) and the column number (j) and get the value the edge's weight. After some whiteboard wizardry, here's the algorithm I came up with to do it:

0 GetWeight(i,j):
1 if (i > j) swap(i,j)
2 k <-- 1
3 index <-- 0 //this is the index in the 1D array
4 while (k <= i) {
5 index += n - k + 1
6 k += 1
7 }
8 index += j - i
9 return E[index]

Line 1 checks whether you're referring to the bottom-half of the array and if so, swaps i and j to redirect you to the upper-half of the array where the actual edge weight is stored. Whenever you're referring to somewhere in the bottom-half, i is going to be greater than j (check for yourself).

As for the while-loop, the idea is that for every row you go down the number of elements decreases by 1 so if you're looking at the second row i.e. i=1 you need to add n or n-1+1 to your index but if you want to go to the next row i.e. i=2 you need to add another n - 1 or n - 2 + 1 to your index. The relationship here is that you're adding n - i + 1 for each extra row you go down (and 'k' in the code takes the place of i because we need to preserve i for line 8). As for the column position, let's say i = 2 and j = 3, well the while loop will get you up to the first element in the row i=2 but because of the missing elements to the left you need to re-calibrate your column index by subtracting i from j. You don't want to go adding 3 to the index you're already at, you just want to add 1 to go from the element [2][2] to [2][3] and to do that you can just add j-i to the index.

I am very much hoping that's the standard way of going about this problem, it seems like a pretty tight algorithm to me (so long as the user ensures i and j are less than n i.e. they don't go outside the bounds of the matrix). If you don't want the diagonal line of 0's in there you can just replace the i with i + 1 because effectively you'll be dealing with the same number of elements as if you had ignored the row i = 0. Another way to look at it is to just chuck in a -1 in line 5 and 8 because you've got one less element in each row above the current element (line 5) and one less element in the element's row (line 8):

0 GetWeight(i,j):
1 if (i > j) swap(i,j)
2 k <-- 1
3 index <-- 0 //this is the index in the 1D array
4 while (k <= i) {
5 index += n - k
6 k += 1
7 }
8 index += j - i - 1
9 return E[index]

Alright so we've handled adjacency matrices and specifically triangular matrices, now onto the next way of representing graphs:

Adjacency Lists:

Instead of storing our edges in a table we can just have a 1D array of all the nodes and each element will be a pointer to a linked list of edge weights and destination nodes. What is a linked list you say? It's a very flexible data structure which is all about modifying after creation. In an array you can modify the values of each element but you can't change the size of the array itself (and if you can, your programming language is lying to you about it being an array; it's just some variation of a linked list). A linked list stores each element in wherever the next accessible chunk of memory is with a 'pointer' which will lead to the next element. Let's say I want to make some linked list which describes the outwardly directing edges that my node 0 shares with with other nodes. Considering that in a linked list, the elements plus the pointer form a 'node', I'm going go back on my original statement and call my actual graph's nodes 'vertices' from here on. So I need two elements in each node, the index of the destination vertex and the weight of the edge.

...can be represented as...

A pointer is just a number referencing the next node's address in RAM. The RAM is just the one array to rule them all so when I say 'address' I really mean 'index'. The null pointer could be anything from a special command to just a '0' as it often is in C++ (the compiler won't do anything to those parts of the RAM because the operating system takes up all of the early addresses). The elements in each node (the destination vertex's index, the edge weight and the pointer to the next node) are all contiguous i.e. adjacent in the RAM but nodes are not necessarily contiguous. If you put vertex 0's first node (the '1 |3 | -|->' one) into memory and then let your program do some other stuff, more memory could have been placed after it. And if then you decided to add vertex 0's second node to memory, you would need to find the next available space for 3 integers i.e. 12 bytes (recalling the pointer is also an integer) and then start from the pointer at index 0 in the array of linked lists and follow the chain until you get to the first null pointer and then set the value of that pointer to the address of the new node. This kind of memory allocation is called 'dynamic' because it can deal with the changes to memory that occur over time.

The array of indices is static (i.e. not dynamic) and won't change size but if you instead used a linked list of vertices then you would be able to add and remove different vertices whenever you wanted.

All you really need to deal with a data structure like that is a pointer which points you to the address of the top-left node (i.e. the first element in the top left node) and a knowledge of what each element position in each node represents i.e. the second element in the vertex nodes (on the left) is a pointer to the next vertex, and the third position is a pointer to the first outward edge coming from that vertex.

Slight aside about how memory addresses work: For all the RAM conventions (definitely for 32bit processors) that I've heard of, each address is the index of a 'word' or 4 bytes or 32 bits which is the standard size of an integer. So if my pointer's value is 2000 it's pointing to the integer at the index of 2000 i.e. the 2001st word (there's a word when the address is 0). If I know the next word is an element referring to a vertex index I can just add 1 to my address and if I read that value I'll find out what the vertex's index is. It's just like working with smaller arrays. All this talk of pointers has made me think I should have just done a post on them earlier but instead I'm going to leave that for later and charge ahead!

Let's compare these two approaches, using basic adjacency matrices (not the triangular ones) and the first adjacency list I showed (where the list of source vertices is an array). The reason I'm choosing these versions is that neither can add vertices so we can focus purely on the edges.

Adjacency Matrices are great when you have a lot of edges because lots of the grid spaces are being utilised whereas if you have 100 vertices and only 3 edges, you're using 3 out of 100^2 elements in your matrix which is a huge waste of space because those extra 9997 elements don't tell you anything extra about the graph. An adjacency list on the other hand will take up barely any space with just three edges because the space required is just 1 integer for each vertex (plus the array's header) and another 3 for each edge. On the other hand if you have a huge amount of edges (like an almost complete graph) then those elements on the adjacency matrix will be well utilised and the extra baggage that comes with the adjacency list will add up (i.e. all those pointers which aren't present in the randomly-accessible adjacency array).

In terms of big O notation, with an adjacency matrix you will require a space of O(|V|^2) (where V is the set of vertices and so |V| is the length of this set i.e. the number of vertices) and with an adjacency list it's O(|V| + |E|). Remember that there can be |V|^2 edges and the coefficients are larger for the latter so if there are a lot of edges compared to vertices an adjacency list is a good idea. Also, adding, modifying or deleting an edge from the adjacency list can all be done in constant time whereas with an adjacency list, removing or modifying an edge requires you to find it which takes O(|K|) time where K is the set of all vertices adjacent to a particular vertex. However like with adjacency lists, adding an edge can be done in constant time if you have stored the address of the null pointer at the end of the edge list somewhere (meaning you can store the new edge in the next available place in memory and set the pointer at the end of the list to the address of the new edge). If you want to count the number of adjacent vertices to a particular vertex (i.e. find |K|) with an adjacency matrix you'll need to scan the whole row (or column or both) meaning the time complexity is O(|V|) whereas with the adjacency list you will only need to scan along the edge list for a particular vertex i.e. O(|K|). OK? For everything other than finding |K| for some vertex, the adjacency matrix outperforms on time but the adjacency list wins on space, assuming the graph doesn't have too many edges.

If you really wanted you could have just made a single-chain linked list of edges including the source vertex, destination vertex and edge weight but that doesn't sound very efficient to me. Also these edge lists can have pointers that allow you to go in the reverse direction (which would make them doubly linked lists) but there is rarely a need (that I know of) when it comes to graphs like this.

Alrighty next up will be spanning tree algorithms and then binary trees.

Sunday, 8 June 2014

What's Up With Invariants?!

An invariant is something you can rely upon for the duration of your algorithm i.e. something which will always hold true. Identifying invariants is often the key to solving some problems. If you haven't heard of them before, don't worry; neither has google's spellchecker:

One example where identifying an invariant is useful is the chocolate cutting problem:

You've got a block of chocolate with 10 rows and 5 columns of pieces. We'll say if you were to cut along one of the ridges you'd end up with two 'blocks' of chocolate. Assuming you can only cut all the way through a block of chocolate (i.e. you can't cut half-way), what is the minimum amount of cuts required to separate the starting block into all the individual pieces.

You could go through trying to cut in the middle of pieces to keep the size of the blocks equal as you reduce their sizes, or you could try to cut off the smallest possible blocks from the larger ones, it doesn't matter. Because no matter what approach you take, there is an invariant which prevents the total number of cuts from being any different. At the beginning you've got 1 block and you've performed 0 cuts. You cut that block anywhere and then you've got 2 blocks and 1 cut. Cut either of the two new smaller blocks and then you've got 3 blocks and 2 cuts. What's the pattern? #cuts = #blocks - 1. And we want to know the smallest number of cuts to end up with 10 * 5 = 50 individual pieces of chocolate? By the time we end up with 50 pieces we will have performed 50 - 1 = 49 cuts. So that is our answer. 49 is the minimum number of cuts to separate all the pieces but it's also the only possible number of cuts due to the invariant we discovered. So knowing this, for any block of chocolate with m rows and n columns, the number of cuts required is just (m*n) - 1.

Alrighty next example is something which actually caused a lot of aggression and argument within my family (do not bring this problem up in a family car trip): The milk-and-water problem.

You've got a glass of milk labeled M and a glass of water labeled W of equal amounts of liquid. You take a teaspoon of milk from M and put it into W. Then after that's mixed in evenly you take a teaspoon of liquid from W and put it into M. The question is whether M will have the same amount of milk by the end as W has water and vise versa. Because a teaspoon is difficult to quantify we'll just say that we start off with two enormous glasses each with 10L of liquid, and instead of a teaspoon we have a 1L jug.

So what has happened here? We start off with M containing 10m (10 litres of milk) and W containing 10w. So initially W = 10w, and M = 10m. We do a transfer of 1m; so now W = 10w + 1m and M = 9m. Then we transfer one 1L of the liquid from W back so that 1L is (10w + 1m)/11 and we end up with
W = 10w + 1m - (10w + 1m)/11 = (10/11)*(10w + 1m) = 100w/11 + 10m/11
and
M = 9m + (10w + 1m)/11 = 100m/11 + 10w/11.

If we had started with our glasses and our jug being 11/10 times larger our numbers would be:

W = 10w + 1m
and
M = 10m + 1w

So the proportions are the same. Is this a fluke? Let's say both glasses started with A litres and the size of the 'teaspoon' was B litres. Initially M = Am and W = Aw.

It is the equality of the blue and red parts of the equations which is the invariant in this problem. Choosing any two positive values A and B for the initial amount of liquid in each glass and the amount in each transfer respectively, the small parts will be equal in amount and the large parts will be equal too.

If this all sounds complicated, there's an even more complicated way of looking at it which for some reason simplifies the problem. In the above calculations we assumed that we were getting an even amount from the glass when we did the second transfer i.e. we couldn't choose how much of milk/water was in the teaspoon because both liquids mixed evenly. Let's say that they don't mix and you actually can choose how much of each liquid to add to the teaspoon. We'll still call the capacity of the teaspoon B but we're going to label some other things. The part of the teaspoon containing milk will be called C and the part of the teaspoon containing water will be called D. After the first transfer we have B units of milk already in W. We'll also call the amount of milk that will remain in W after the second transfer E. Let's look at three possibilities. That shaded part below is the section which will be transferred via the teaspoon (meaning it's size is B) from W to M.

First scenario, you just take the teaspoon of milk you just put in W and put it back in M in which case you're certain the proportions of liquids will be equal afterwards. Second scenario you only put water in the teaspoon meaning the proportions will equal due to both glasses having a teaspoon of the dominant liquid in the other glass. Third example is the more general case: you simply have some combination of milk and water in the teaspoon. All you need to notice is that no matter what happens, The amount of milk remaining after the transfer, E is going to be equal to the amount before the transfer (which is just B; the capacity of the teaspoon) minus C. So that amount of milk that will remain is B-C. The amount of water which will end up in the M glass is ALSO B-C because it's just the capacity of the teaspoon minus however much of the teaspoon gets taken up by milk. This means that no matter what proportions you choose, you're going to end up with the same amount of water in the M glass as there is milk in the W glass, meaning the proportions are always going to correspond if you start with pure glasses and do two transfers. Thus as we have the invariants E = C-B and D = C-B we can deduce the invariant E = D which as I type this, has completely set my intuition in stone. The lesson here is that some invariants are caused by others. The complicated mathematical approach we took earlier was just a consequence of the simpler invariant we've just discovered.

So we've done the chocolate cutting problem and the milk-water problem. Now onto something a bit trickier: the 14-15 puzzle. An understanding of how to tackle this problem requires an understanding of a smaller problem which goes like this:

We have a list of 7 letters in reverse order: [G,F,E,D,B,C,A] and we are allowed to swap any two adjacent letters, with each swap being called a 'move'. The question is, can we put the letters into correct order with exactly 50 moves? We'll use the notation that a particular permutation of the letters is 's' and the set of all 7! = 5040 permutations is the set 'S' meaning s is an element of S.

For this problem, we need some short and simple method for knowing how many moves there are between states. But clearly going between any two states could take an infinite number of moves e.g. to get from [G,F,E,D,B,C,A] to [G,F,E,D,B,A,C], I could swap the F and E a billion times and then swap the A and C. So how about something that merely tells us of the parity of the number of moves requires to get from one state to another. Parity means 'odd or even' and is more formally described as a function parity(n) = n % 2. This means the parity of an odd number is 1 and for even numbers it's 0.

We know for certain that going between [G,F,E,D,B,C,A] and [G,F,E,D,B,A,C] requires an odd number of moves. You can screw around with the first 5 elements all you want but for every shift to the right that one of those elements takes, it needs shift to the left in order to get back to its original position i.e. an even number of shifts and hence an even number of moves. As for A and C they each move one space without going back which requires an odd number of moves. But this is a simple example and when things get more complicated this reasoning gets tiring. Here is where inversions come in. The number of inversions in a list is the number of elements out of order so for example in s = [A,B,C,D,E,G,F] only G and F are out of order meaning inv(s) = 1. For our completely out-of-order set s' = [G,F,E,D,B,C,A], each pair of letters is an inversion and this isn't just adjacent letters, so how many pairs can you make with 7 elements? C(7,2) = 21 inversions.

The reason inversions are useful to look at is that 1) for any state you can count the number of inversions without caring about all the previous states and the moves that got you to that state, and 2) each swap changes the amount of inversions by exactly 1. If two letters are in-order and you swap them, you increase the amount of inversions by 1. If they're out-of-order and you swap them, you decrease the amount of inversions by 1. So with each move the parity of your inversion count is changing. The part where this gets cool is that with each move the parity of your total number of moves is changing too. You start off with 0 moves i.e. even number of moves and 21 inversions i.e. odd number of inversions. So no matter what you do, one of them is even and the other is odd because they both switch parity with each move. What happens when you add an even and an odd number? You get an odd number.

So the number of inversions plus the number of moves is always odd. And the problem asks whether we can get to the state s = [A,B,C,D,E,F,G] in 50 (even) moves. inv(s) = 0 because all the letters are in order, so we know that N + i = 50 + 0 = 50. And 50 is an even number meaning that it is impossible to put all the letters in order with 50 moves!

Now before we get to the 14-15 puzzle, we need to show one more thing. We know that the parity of the number of inversions switches when you swap any two adjacent elements, but can the same be said for non-adjacent elements? Let's say you have two elements X and Y which you want to swap but there are m elements between them, how many adjacent swaps are required to make X and Y switch positions (recalling that each swap will change the parity of the number of inversions by 1)? If you start by swapping elements with X, you need to traverse the m positions and also swap with the Y meaning m+1 moves. Then with the y you need to go in the opposite direction and swap with the m elements meaning m moves. The total number of moves is 2m + 1 and regardless of whether m is odd or even, 2m is even meaning 2m + 1 is odd.

What does this mean? Well one swap changes the parity of the #inversions and so an odd number of swaps will also change the parity of the #inversions. Say you start off with 5 inversions (an odd number) and then swap any two non-adjacent elements; you're going to add an odd number to that 5 meaning your #inversions will now be even and if it had started even it would now be odd, just like with single adjacent swaps. Why does this matter? In the case of the 14-15 puzzle it allows us to look at a 2D array of elements.

That thing there is a slider puzzle with tiles 1-15 which can swap places with the blank space at the bottom right. The 14-15 puzzle is about finding out how to get to the state where the 14 and 15 are swapped but all other tiles are in their original position, and the blank space is back to the bottom-right. Firstly, we'll just consider the blank space to be the 16th tile which is the only tile allowed to be swapped. We want to get from the left state to the right state:

We are dealing with a 2D matrix but we can consider it a 1D matrix going from 1 to 16 left-to-right. The 16 is able to be swapped with anything 1 or 4 elements to the left or right of it in the 1D array, which as we proved above, would change the parity of the #inversions in our array. We're starting with 0 (even) inversions and want to get to 1 (odd). What does this mean? We need to perform an odd number of moves. But the only moves we're allowed to make is with the 16 and the fact that it ends up back where it started means that for every upwards move there must be a downwards move and with every leftwards move there must be rightwards move i.e. there must be an even number of moves. This means that we have an end state which requires an even number of moves and has an odd number of inversions but as the parity of the moves and inversions are equal this is not possible (i.e. you start with no moves (even) and no inversions (even) and each move increments both N and i).

What have been the invariants in this case? In the first example with the out-of-order letters it was the N + i always being odd, and in the 14-15 puzzle it was the equality of the parity of N and i (i.e. they were either both odd or both even). Identifying these invariants makes solving problems a whole lot easier, particularly problems asking whether a certain transition between states is possible rather than optimisation problems.

Next up TREEEES.

Wednesday, 18 June 2014

What's Up With Binary Trees?!

Tuesday, 10 June 2014

What's Up With Graph Representations?!

Sunday, 8 June 2014

What's Up With Invariants?!

Blog Archive