CM10228 Lecture 4

I. Big O Notation.

Normally, certain aspects of the algorithm (for example, its complexity, or the duration of its longest operations) have the most impact on how long it takes. These aspects are said to dominate.
To make it clear that we are only talking about the dominant factor when we analyze an algorithm, we talk about the order of the algorithm.

The order has its own notation, called "big O" and written like this: O(n) (read "Order n").
Order notation throws out constants. For example, if an algorithm takes 2n time (for example, reads through a whole list twice) it is still O(n), the same order as if you only went through the list once.
Sometimes constants matter – for example, if it's going to take a year to go through the list – but usually they don't.

Since "N is the length of the original list, the Number of items", a function of linear complexity is said to be O(n).
Just as constants don't matter, neither do lower-order components of an algorithm. So if an algorithm actually takes n⁵ + n³ + 7, we say its complexity is O(n⁵).
This is only for addition! Obviously if the algorithm takes n⁵ * n³ its complexity is O(n⁸)!
Since ultimately the most important thing about algorithms isn't their exact rate of growth, but rather what the curves look like (remember the graph) we often don't even bother to specify the base for a logarithm or the exponent for a polynomial-time algorithm. We just say "logarithmic" or "polynomial".

II. Types of Complexity & Their Dominance Relations

This table is shamelessly cribbed from Gerald Kruse's page on Algorithm Efficiency (which also tells you about big-O arithmetic! Something I've never used, but then I do AI & psychology, not theory.)

Let a, b, c, and d denote constants

In order of dominance (top is most dominant). Realize that algorithms of higher dominance are bad!

Function	condition	common name
Nⁿ
N!		N factorial
aⁿ	dominates bⁿ if a > b	exponential
N^c	dominates N^d if c > d	polynomial (cubic, quadratic, etc.)
n log n		n log n
n		linear
log n		logarithmic (regardless of base)
1		constant

III. Criteria for Evaluating an Algorithm, Re-revisited

What do you need to know about algorithms and complexity?

Worst case,

If everything is organized in the worst possible way, how long will it take?

Best case,

If you are really lucky, what's the fastest it could run?

Average or expected case.

What is the most probable situation?

For example, with our tree-searching algorithm, the best case is log₂n, the worst case (if everything was already sorted when we started, so it just makes one big list) is n. But if we know that the original ordering is random, then we can be fairly certain that the real order is pretty close to log₂n.
In the old days, people mostly were obsessed with the worst case, but nowadays we often care more about the expected case. This is because if we approach the problem as a system issue rather than strictly an algorithmic one, we can often recognize & terminate a worst-case scenerio. But this depends on how bad and how frequent worst-cases are.
Technically, Big O notation should be used to give a boundary for the worst case. Follow that link (to Wikipedia) to see the formal definition of other sorts of notation for defining complexity bounds. But computer scientists tend to be sloppy & use the same notation & just say in words "average case is..."
Old notes from previous lecturer said: "Average case analysis is mathematically much harder to perform, and for many algorithms is not known."
One practical way to determine average case is with a stopwatch (well, anyway, to run statistics!)
Here's some folks who used a stopwatch on the sorting algorithms (then went away so I had to get that from the Wayback Machine).

Notice they get more precise results than we'll get below.
But notice also the difference between O(n²) and O(nlogn) matters a lot more!

Things to remember about big O values:

Since it an upper bound on running time, performance could be much better.
The input that causes worst case performance could be very rare --- and it might be recognizable or avoidable.
You don't know what the constants are ---

May be very large.
Even a constant of 2 matters a lot if your operation is going to take 3 years (e.g. database conversions.)

IV. How Bad is Bad?

Another table shamelessly cribbed from Gerald Kruse's page on Algorithm Efficiency.

Table of growth rates

Linear N	logarithmic log₂N	*nlog₂N**	quadratic N²	cubic N³	exponential 2^N	exponential 3^N	factorial N!
1	0	0	1	1	2	3	1
2	1	2	4	8	4	9	2
4	2	8	16	64	16	81	24
8	3	24	64	512	256	6561	40320
16	4	64	256	4096	65,536	43,046,721	2.09E+013
32	5	160	1024	322,768	4,294,967,296	…1.85E+15	2.63E+035
64	6	384	4096	262,144	1.84E+17 (Note 1)	…3.43E+30	1.27E+089
128	7	896	16,384	2,097,152	3.4E+38 (Note 2)	…1.18E+61	3.86E+215
256	8	2048	65,536	1,677,216	1.16E+77 ???	…1.39E+122	Find other calculator

Note 1: The value here is approximately the number of machine instructions executed by a 1 gigaflop computer in 5000 years, or 5 years on a current supercomputer (teraflop computer)

Note 2: The value here is about 500 billion times the age of the universe in nanoseconds, assuming a universe age of 20 billion years.

Some people think that it doesn't really matter how complex an algorithm is because computers are getting so much faster.
They are wrong. It matters.
See the table above --- not going to get much done in less than a nanosecond!
Let's say that for each type of problem space, there's a largest size N for how long we can wait around for our program to finish.

The time we are waiting is always the same, but the size is different because the complexity is different.
N1-N5

How much will faster computers help us? This is a weird chart I got off the previous lecturer for this course 8 years ago, but it gives the general idea... Suppose you can achieve N in a certain amount of time with a current computer, and your computer gets faster, how many more than N can you get done? Depends on the algorithm:

	log₂n	n	nlog₂n	n²	2ⁿ
Current Computers	N1	N2	N3	N4	N5
Ten times faster	N1 x 30	N2 x 10	N3 x 3	N4 x 3	N5 + 3
Thousand times faster	N1 x 9,000	N2 x 1,000	N3 x 111	N4 x 31	N5 + 10
Million times faster	N1 x 19E+6	N2 x 1E+6	N3 x 5,360	N4 x 1,000	N5 + 20

You can see that there are no big wins once you have an exponential algorithm!

CM10228 / Programming II: Lecture 4

Sorting Algorithms & Complexity

I. Big O Notation.

II. Types of Complexity & Their Dominance Relations

III. Criteria for Evaluating an Algorithm, Re-revisited

IV. How Bad is Bad?

Table of growth rates

V. The Complexity of Our Sort Algorithms

i) `Real' Selection Sort

ii) Easy Selection Sort

iii) Insertion Sort

iv) Bubble Sort

v) Quick Sort

vi) Merge Sort

VI. What have we learned?