Asymptotic Analysis

Thomas A. Anastasio

9 February 2000

Introduction

As a programmer, you often have a choice of data structures and algorithms. Choosing the best one for a particular job involves, among other factors, two important measures:

Time Complexity: how much time will the program take?
Space Complexity: how much storage will the program need?

You will sometimes seek a tradeoff between space and time complexity. For example, you might choose a data structure that requires a lot of storage in order to reduce the computation time. There is an element of art in making such tradeoffs, but you must make the choice from an informed point of view. You must have some verifiable basis on which to make the selection of a data structure or algorithm. Complexity analysis provides one such basis.

Complexity

Complexity refers to the rate at which storage or time grows as a function of the problem size ( i.e., the size of the data on which an algorithm operates). The absolute growth depends on the machine used to execute the program, the compiler used to construct the program, and many other factors. You would expect most programs to run much faster on a supercomputer than on an old desktop PC (and that often turns out to be the case).

We would like to have a way of describing the inherent complexity of a program (or piece of a program), independent of machine/compiler considerations. This means that we must not try to describe the absolute time or storage needed. We must instead concentrate on a ``proportionality'' approach, expressing the complexity in terms of its relationship to some known function. This type of analysis is known as asymptotic analysis.

Asymptotic Analysis

Asymptotic analysis is based on the idea that as the problem size grows, the complexity will eventually settle down to a simple proportionality to some known function. This idea is incorporated in the ``Big Oh,'' ``Big Omega,'' and ``Big Theta'' notation for asymptotic performance. These notations are useful for expressing the complexity of an algorithm without getting lost in unnecessary detail.

Speaking very roughly, ``Big Oh'' relates to an upper bound on complexity, ``Big Omega'' relates to a lower bound, and ``Big Theta'' relates to a tight bound. While the ``bound'' idea may help us develop an intuitive understanding of complexity, we really need formal and mathematically rigorous definitions. You will see that having such definitions will make working with complexity much easier. Besides, the ``bounds'' terminology is really not quite correct; it's just a handy mnemonic device for those who know what asymptotic analysis is really about.

Definitions of Big-Oh, Big-Omega, and Big-Theta

$\Diamond$ Definition: if and only if there are constants and such that $T(n) \le c_0 f(n)$ for all $n \ge n_0$ .

The expression is read as ``T of n is in BigOh of f of n,'' or ``T of n is in the set BigOh of f of n,'' or ``T of n is BigOh of f of n.''

The definition may seem a bit daunting at first, but it's really just a mathematical statement of the idea that eventually becomes proportional to . The ``eventually'' part is captured by `` $n \ge n_0$ ;'' must be ``big enough'' for to have settled into its asymptotic growth rate. The ``proportionality'' part is captured by , the constant of proportionality.¹

The definition is ``if and only if.'' If you can find the constants and such that $T(n) \le c_0 f(n)$ , then . Also, if , then there are such constants. It works both ways. It may be helpful to recognize that is a set. It's the set of all functions for which the constants exist and the inequality holds.

If a function , then eventually the value will exceed the value of . ``Eventually'' means ``after exceeds some value.'' Does this really mean anything useful? We might say (correctly) that $x^2 + 2x = O(x^{25})$ , but we don't get a lot of information from that; $x^{25}$ is simply too big. When we use BigOh analysis, we implicitly agree that the function we choose is the smallest one which still satisfies the definition. It is correct and meaningful to say ; this tells us something about the growth pattern of the function , namely that the term will dominate the growth as increases.

While BigOh relates to upper bounds on growth, BigOmega relates to lower bounds. The definitions are similar, differing only in the direction of the inequality.

$\Diamond$ Definition: $T(n) = \Omega(f(n))$ if and only if there are constants and such that $T(n) \ge c_0 f(n)$ for all $n \ge n_0$ .

In this case, the function is proportional to when is big enough, but the bound is from below, not above. As with BigOh, the expression $T(n) = \Omega(f(n))$ is read as ``T of n is BigOmega of f of n,'' or ``T of n is in BigOmega of f of n,'' or ``T of n is in the set BigOmega of f of n.'' $\Omega(f(n))$ , like , is the set of functions for which the constants exist and the inequality holds.

Finally, we define BigTheta. It is defined in terms of BigOh and BigOmega. A function is in BigTheta if it's both in BigOh and in BigOmega.

$\Diamond$ Definition: $T(n) = \Theta(f(n))$ if and only if and also $T(n) = \Omega(f(n))$ .

Intuitively this means that forms both an upper- and lower-bound on the function ; it is a ``tight'' bound.

Some Commonly Encountered Functions

The following functions are often encountered in computer science complexity analysis. We'll express them in terms of BigOh just to be specific. They apply equally to BigOmega and BigTheta.

. This is called constant growth. does not grow at all as a function of , it is a constant. It is pronounced ``BigOh of one.'' For example, array access has this characteristic. The operation A[i] takes the same number of steps no matter how big A is.
$T(n) = O(\lg(n))$ . This is called logarithmic growth. grows as the base 2 logarithm of (actually, the base doesn't matter, it's just traditional to use base-2 in computer science). It is pronounced ``BigOh of log n.'' For example, binary search has this characteristic.
. This is called linear growth. grows linearly with . It is pronounced ``BigOh of n.'' For example, looping over all the elements in a one-dimensional array would be an operation.
$T(n) = O(n\lg(n))$ . This is called ``n log n'' growth. grows as times the base 2 logarithm of . It is pronounced ``BigOh of n log n.'' For example, heapsort and mergesort have this characteristic.
. This is called polynomial growth. grows as the -th power of . Computer applications with greater than about 3 are often impractical. They just take too long to run for any reasonably large problem. Selection sort is an example of a polynomial growth rate algorithm. It is in , pronounced ``BigOh of n squared.''
. This is called exponential growth. grows exponentially. It is pronounced ``BigOh of 2 to the n.'' Exponential growth is the most-feared growth pattern in computer science; algorithms that grow this way are basically useless for anything but very small problems. In computer science, we traditionally use the constant 2, but the any other positive constant could be used to express the same idea.

The growth patterns above have been listed in order of increasing ``size.'' That is,

$\begin{displaymath}O(1) = O(\lg(n)) = O(n) = O(n\lg(n)) = O(n^2) = O(n^3) = O(2^n) \end{displaymath}$

It may appear strange to use the equals sign between two BigOh expressions. Since BigOh defines a set of functions, the notation

means that the set of functions

is contained in the the set of functions

. Note that it is not true that if

then

. The ``

'' sign does not mean equality in the usual algebraic sense.

Best, Worst, and Average Cases

It's usually not enough to describe an algorithm's complexity by simply giving an expression for

, $\Omega$ , or $\Theta$ . Many computer science problems have different complexities depending on the data on which they work. Best, worst, and average cases are statements about the data, not about the algorithm.

The best case for an algorithm is that property of the data that results in the algorithm performing as well as it can. The worst case is that property of the data that results in the algorithm performing as poorly as possible. The average case is determined by averaging algorithm performance over all possible data sets. It is often very difficult to define the average data set.

Note that the worst case and best case do not correspond to upper bound and lower bound. A complete description of the complexity of an algorithm might be ``the time to execute this algorithm is in in the average case.'' Both the complexity and the property of the data must be stated. A famous example of this is given by the quicksort algorithm for sorting an array of data. Quicksort is in worst case, but is in $O(n \lg n)$ best and average cases. The worst case for quicksort occurs when the array is already sorted ( i.e., the data have the property of being sorted).

One of the all-time great theoretical results of computer science is that any sorting algorithm that uses comparisons must take at least $n \lg n$ steps. That means that any such sorting algorithm, including quicksort, must be in $\Omega(n \lg n)$ . Since quicksort is in $O(n \lg n)$ in best and average cases, and also in $\Omega(n \lg n)$ , it must be in $\Theta(n \lg n)$ in best and average cases.

Example: `List` Implementation of `Queue`

Consider a simple list implementation of queue. In this implementation elements are put on the queue at the end of the list and taken off the queue at the head. Since a list requires sequential access to its elements, each enqueue operation requires traversing the entire list to find the end, then ``attaching'' the new item at the end. What is the asymptotic time complexity of this operation as a function of

, the number of items in the queue? First of all, to find the end of the list, we must traverse the entire list. This takes

operations (one ``next'' operation for each item on the list). Therefore, a single enqueue operation has asymptotic complexity

. But, suppose we enqueue

items, one after another? The asymptotic complexity will be

You may object that the list is not long until all items have been enqueued. Look at it this way; the first item takes one operation, the second takes two operations, the third takes three operations, etc. Therefore, the total number of operations to enqueue items is

$\begin{displaymath}1 + 2 + 3 + \ldots + n \end{displaymath}$

We can express this as

$\begin{displaymath}\sum_{i=1}^n{i} = \frac{n(n+1)}{2} = \frac{n^2 + n}{2} = O(n^2) \end{displaymath}$

A better implementation would provide performance for each enqueue operation, thereby allowing items to be enqueued in time. This is significantly better than (check the chart for , for instance).

Theorems Involving BigOh

$\begin{theorem}{\rm$O(cf(x)) = O(f(x))$\ {\it (constants don't matter)}}\end{theorem}$
$\rhd$ Proof: implies that there are constants and such that $T(x) \leq c_0(cf(x))$ when $x \geq n_0$
Therefore, $T(x) \leq c_1f(x)$ when $x \geq n_0$ where
Therefore,
$\lhd$

$\begin{theorem}{\rm Let $T_1(n) = O(f(n))$\ and $T_2(n) = O(g(n))$.\\ Then, $T_1(n) + T_2(n) = O(\max(f(n), g(n)))$\ {\it (the sum rule)}}\end{theorem}$
$\rhd$ Proof: From the definition of Big Oh, $T_1(n) \leq c_1f(n)$ for $n \geq n_1$ and $T_2(n) \leq c_2g(n)$ for $n \geq n_2$
Let $n_0 = \max(n_1, n_2)$
Then, for $n \geq n_0$ , $T_1(n) + T_2(n) \leq c_1f(n) + c_2g(n)$
Let $c_3 = \max(c_1,c_2)$
Then

$\begin{eqnarray*} T_1(n) + T_2(n) & \leq & c_3f(n) + c_3g(n)\\ & \leq & 2c_3 \max(f(n), g(n))\\ & \leq & c \max(f(n), g(n))\\ \end{eqnarray*}$

$\lhd$

$\begin{theorem}{\rm If $T(n)$\ is a polynomial of degree $x$, then $T(n) = O(n^x)$}\end{theorem}$
$\rhd$ Proof: $T(n) = n^x + n^{x-1} + \cdots + k$ is a polynomial of degree
By the sum rule (Theorem ), the largest term dominates.
Therefore,
$\lhd$

As an example, it can be shown that in BubbleSort, SelectionSort, and InsertionSort, the worst case number of comparisons is $\frac{n(n-1)}{2}$ . What is $O(\frac{n(n-1)}{2})$ ? Well, $\frac{n(n-1)}{2} = \frac{n^2 - n}{2}$ which is a polynomial. Since constants don't matter (Theorem ), $O(\frac{n^2 - n}{2}) = O(n^2 - n)$ . By the sum rule (Theorem ), .

$\begin{theorem}{\rm If $T_1(n) = O(f(n))$\ and $T_2(n) = O(g(n))$,\\ then $T_1(n)T_2(n) = O(f(n)g(n))$\ {\it (the product rule)}}\end{theorem}$
$\rhd$ Proof: $T_1(n)T_2(n) \leq c_1c_2f(n)g(n) \leq cf(n)g(n)$ when $n \geq n_0$
Therefore,
$\lhd$

The proofs of Theorems and , below, use L'Hospital's rule. Recall that this rule pertains to finding the limit of a ratio of two functions as the independent variable approaches some value. When the limit is infinity, the rule states that

$\begin{displaymath}\lim_{x \rightarrow \infty}\frac{f(x)}{g(x)} = \lim_{x \rightarrow \infty}\frac{f'(x)}{g'(x)} \end{displaymath}$

where

and

denote the first derivatives with respect to

We use L'Hospital's rule to determine or $\Omega$ ordering of two functions.

$\begin{displaymath}f(x) = O(g(x))\;\;\; {\rm if}\;\;\; \lim_{x \rightarrow \infty}\frac{f(x)}{g(x)} = 0 \end{displaymath}$

$\begin{displaymath}f(x) = \Omega(g(x))\;\;\; {\rm if}\;\;\; \lim_{x \rightarrow \infty}\frac{g(x)}{f(x)} = 0 \end{displaymath}$

$\begin{theorem}{\rm$\lg^kn = O(n)$\ for any positive constant $k$}\end{theorem}$
$\rhd$ Proof: Note that $\lg^kn$ means $(\lg n)^k$
We must show $\lg^kn \leq cn$ for $n \geq n_0$ . Equivalently, we can show $\lg n \leq cn^{\frac{1}{k}}$ . Letting $a = \frac{1}{k}$ , we will show that $\lg n = O(n^a)$ , for any positive constant . Use L'Hospital's rule:

$\begin{displaymath} \lim_{n \rightarrow \infty}\frac{\lg n}{cn^a} = \lim_{n \ri... ...}{acn^{a-1}} = \lim_{n \rightarrow \infty}\frac{c_2}{n^a} = 0 \end{displaymath}$

$\lhd$

As a really outrageous (but true) example, $\lg ^ {1000000} (n) = O(n)$ . $\lg^k$ grows very slowly!

$\begin{theorem}{\rm$n^k = O(a^n)$\ for $a > 1$\ {\it (no polynomial is bigger than an exponential)}}\end{theorem}$
$\rhd$ Proof: Use L'Hospital's rule

$\begin{eqnarray*} \lim_{n \rightarrow \infty}\frac{n^k}{a^n} & = & \lim_{n \ri... ...ghtarrow \infty}\frac{k(k-1)\cdots 1}{a^n\ln^k a}\\ & = & 0\\ \end{eqnarray*}$

$\lhd$

As a really outrageous (but true) example, $n ^ {1000000} = O(1.00000001 ^ n)$ . Exponentials grow very rapidly!

About this document ...

Asymptotic Analysis

This document was generated using the LaTeX2HTML translator Version 99.1 release (March 30, 1999)

The command line arguments were:
latex2html -split 0 asymptotic.tex

The translation was initiated by Thomas Anastasio on 2000-02-09

Footnotes

... proportionality.¹: To be mathematically precise about the notation, we should point out that it's the absolute value of the functions that is being compared. Since the time or storage of computer science problems is always positive, we can neglect this nicety.

Thomas Anastasio
2000-02-09