4.4 The Principle of Mathematical Induction

Mathematical induction is a key method of proof which follows from a fundamental property of the natural numbers called the Principle of Mathematical Induction. You may have seen proofs by induction before, especially to prove statements about summation or divisibility22 2 You can see some of these kinds of examples in the Companion notes.. Here, we will study the method more formally and apply it to a much wider set of problems.

In the Companion, we saw an analogy for induction as a series of dominoes: if we can prove that one domino will fall (a statement P(k)P(k) is true) and that the falling domino will cause the next domino to fall (the statement P(k)P(k) implies P(k+1)P(k+1)) then we have shown that the statement P(n)P(n) must be true for all nkn\geq k.

Definition 4.29 (The Principle of Mathematical Induction).

Given a list of statements P(k),P(k+1),P(k),P(k+1),..., we may conclude that P(n)P(n) is true for every integer nkn\geqslant k provided that

  • we know that P(k)P(k) is true (the base case)

  • and we can prove that P(n)P(n+1)P(n)\Rightarrow P(n+1) for any integer nkn\geqslant k (the induction step).

A full written proof by induction has the following four parts clearly separated.

  1. 1.

    A clear and explicit statement of the induction hypothesis P(n)P(n) and a stated intention to prove that P(n)P(n) is true for all nkn\geqslant k by induction.

  2. 2.

    The base case. Typically we prove a single statement P(k)P(k).

  3. 3.

    The induction step. This normally involves assuming P(n)P(n) is true and then reasoning that P(n+1)P(n+1) must also be true.

  4. 4.

    A conclusion that steps (2) and (3) mean that P(n)P(n) is true for all natural numbers nkn\geqslant k by the principle of mathematical induction.

Mathematics written by and for experts might skip some of the detail, for example saying “the base case is trivial” and not proving it in depth. For now, in this course and during the beginning of your degree, your proofs must be written systematically using all four parts of the format described above.

As an example, we now prove the following generalisation of Theorem 4.26.

Theorem 4.30 (Generalised De Morgan’s Laws).

Let A1,,AnA_{1},\cdots,A_{n} be nn sets where nn\in\mathbb{N}. Then

  1. (a)

    (i=1nAi)c=i=1nAic\displaystyle\left(\bigcup_{i=1}^{n}A_{i}\right)^{c}=\bigcap_{i=1}^{n}A_{i}^{c},

  2. (b)

    (i=1nAi)c=i=1nAic\displaystyle\left(\bigcap_{i=1}^{n}A_{i}\right)^{c}=\bigcup_{i=1}^{n}A_{i}^{c}.

Proof (for (a)).

We proceed via induction on nn\in\mathbb{N}. Let P(n)P(n) be the statement (i=1nAi)c=i=1nAic\displaystyle\left(\bigcup_{i=1}^{n}A_{i}\right)^{c}=\bigcap_{i=1}^{n}A_{i}^{c}.

For the base case, first consider P(1)P(1):(i=11Ai)c=i=11Aic:\left(\bigcup_{i=1}^{1}A_{i}\right)^{c}=\bigcap_{i=1}^{1}A_{i}^{c}. The union or intersection of one set is the set itself so the left hand side of this equation is (A1)c(A_{1})^{c} while the right hand side is A1cA_{1}^{c}. The brackets make no difference here so we have proven the base case P(1)P(1) to be true.

For the induction step, to prove P(k)P(k) implies P(k+1)P(k+1), we suppose that P(k):(i=1kAi)c=i=1kAicP(k):\left(\bigcup_{i=1}^{k}A_{i}\right)^{c}=\bigcap_{i=1}^{k}A_{i}^{c} holds for some kk\in\mathbb{N}.

Let B=i=1kAiB=\bigcup_{i=1}^{k}A_{i}. Then,

(i=1k+1Ai)c\displaystyle\left(\bigcup_{i=1}^{k+1}A_{i}\right)^{c} =(BAk+1)c\displaystyle=\left(B\cup A_{k+1}\right)^{c}\quad by definition of union
=BcAk+1c\displaystyle=B^{c}\cap A_{k+1}^{c} by Theorem 4.26
=(i=1kAi)cAk+1c\displaystyle=\left(\bigcup_{i=1}^{k}A_{i}\right)^{c}\cap A_{k+1}^{c} by definition of BB
=(i=1kAic)Ak+1c\displaystyle=\left(\bigcap_{i=1}^{k}A_{i}^{c}\right)\cap A_{k+1}^{c} by inductive assumption P(k)P(k)
=i=1k+1Aic\displaystyle=\bigcap_{i=1}^{k+1}A_{i}^{c} by definition of intersection.

This shows that P(k+1)P(k+1) holds if P(k)P(k) holds so, by the principle of mathematical induction, we have shown P(n)P(n) holds for all nn\in\mathbb{N}.

Exercise 4.31.

Adapt this proof in order to prove part (b) of Theorem 4.30. Remember to make sure the four different parts of the induction proof are clear.

Induction proofs are not only used to prove results about sets. They are used in many different areas of maths.

Example 4.32.

Claim: For every nn\in\mathbb{N}, the sum of the first nn positive odd integers equals n2n^{2}.

Proof: We prove by induction on nn. Let P(n)P(n) be the statement “k=1n(2k1)=n2\sum_{k=1}^{n}(2k-1)=n^{2}”. The sum of the first odd number, 11, equals 121^{2} so that k=11(2k1)=12\sum_{k=1}^{1}(2k-1)=1^{2} and so P(1)P(1) is true. Assume that P(n)P(n) is true for some nn\in\mathbb{N}. Then

k=1n+1(2k1)\displaystyle\sum_{k=1}^{n+1}(2k-1) =2(n+1)1+k=1n(2k1)\displaystyle=2(n+1)-1+\sum_{k=1}^{n}(2k-1)
=n2+2n+1=(n+1)2.\displaystyle=n^{2}+2n+1=(n+1)^{2}.

Hence P(n+1)P(n+1) is also true. Since P(1)P(1) holds and P(n)P(n+1)P(n)\Rightarrow P(n+1), it follows that P(n)P(n) is true for all nn\in\mathbb{N} by the principal of mathematical induction. \square

Example 4.33.

For every nn\in\mathbb{N}, we define n!=n(n1)(n2)21n!=n(n-1)(n-2)\cdots 2\cdot 1 and we read n!n! as “nn factorial”.

For which nn\in\mathbb{N} do we have 2n<n!2^{n}<n!? If we plug in n=1,2,3n=1,2,3 this inequality fails, but for n4n\geq 4 it starts to hold. This gives rise to the following conjecture:

Claim: 2n<n!2^{n}<n! for every integer n4n\geq 4.

Proof: We prove by induction on nn. Let P(n)P(n) be the statement “2n<n!2^{n}<n!”.

  • Base case: For the base case, we consider when n=4n=4. Since 24=16<4!=242^{4}=16<4!=24, we conclude that P(4)P(4) is true.

  • Induction Step: Assume P(n)P(n) is true, i.e. 2n<n!2^{n}<n! for some integer n4n\geq 4. Then

    2n+1=2n2<n!2<n!(n+1)=(n+1)!.2^{n+1}={2^{n}}\cdot 2<{n!}\cdot 2<n!\cdot(n+1)=(n+1)!.

Since we have shown P(4)P(4) is true and P(n)P(n+1)P(n)\implies P(n+1) for every positive integer nn, we conclude that P(n)P(n) holds for every integer n4n\geqslant 4. \square

Example 4.34.

For this example, we need to introduce some definitions.

A polygon is a two-dimensional geometrical object consisting of a number of points (vertices), and the same number of straight line segments connecting them (edges), so that the set of edges forms the boundary of precisely one bounded region (face). A polygon is convex if every straight line segment joining any pair of vertices lies wholly within (or on the boundary of) the polygon.

A straight line segment connecting two vertices that are not adjacent to each other in an n-vertex polygon is called a diagonal of that polygon.

Claim: Every nn-vertex convex polygon has n(n3)/2n(n-3)/2 diagonals, for all integers n3n\geqslant 3.

Proof: Let P(n)P(n) denote the statement “Every nn-vertex convex polygon has n(n3)/2n(n-3)/2 diagonals.” We will prove that P(n)P(n) is true for every integer n3n\geqslant 3 by induction on nn.

For the base case, observe that every 33-vertex convex polygon is a triangle. Since every pair of vertices is adjacent in a triangle, there are no diagonals. Hence, every 33-vertex convex polygon has 0=3(33)/20=3(3-3)/2 diagonals, which means that P(3)P(3) is true.

For the induction step, let’s assume that P(n)P(n) is true for some integer n3n\geqslant 3. Now consider a general (n+1)(n+1)-vertex convex polygon AA and some vertex vv belonging to AA. Then vv has some pair of vertices, say uu and ww, that vv is adjacent to in AA. Suppose we remove vertex vv and its adjoining line segments vuvu and vwvw connecting vv to uu and ww respectively from AA and connect uu and ww with a line segment uwuw. Let BB be the nn-vertex polygon that remains. Note that BB is also convex since any line segment connecting a pair of vertices in BB also connects a pair of vertices in AA and therefore must lie completely in AA by its convexity. So if there was a line segment connecting a pair of vertices in BB that did not lie wholly in BB, it would have to cross the section of AA that we cut off to form BB. This would only be possible if that line segment ended with the vertex vv, which is not in BB. Therefore, by P(n)P(n), BB has n(n3)/2n(n-3)/2 diagonals. If we add vv and the line segments vuvu and vwvw back to BB to form AA, we introduce n2n-2 new diagonals, one for each vertex of BB that is not adjacent to vv, and uwuw becomes an additional diagonal of AA. Also, every diagonal of BB is also a diagonal of AA. Hence, in total, AA has

n(n3)2+n2+1=(n+1)((n+1)3)2,\frac{n(n-3)}{2}+n-2+1=\frac{(n+1)((n+1)-3)}{2},

diagonals. Thus P(n+1)P(n+1) holds.

Therefore, since we have shown that P(3)P(3) is true and P(n)P(n+1)P(n)\implies P(n+1) for every positive integer nn, we conclude that P(n)P(n) holds for every integer n3n\geqslant 3. \square

Notice that, as the examples above show, the base case does not always have to consider when n=1n=1. The base case just involves the smallest value of nn for which P(n)P(n) is true.

4.4.1 Strong Mathematical Induction

Sometimes, in the induction step, it’s not enough to know that P(n)P(n) is true in order to prove that P(n+1)P(n+1) is true; we might additionally need to use that some of the "previous" statements P(k),P(k+1),,P(n1)P(k),P(k+1),\dots,P(n-1) are true.

Definition 4.35 (Principle of Strong Mathematical Induction).

Let kk\in\mathbb{N} and let P(k),P(k+1),P(k),P(k+1),... be statements. Suppose that

  • P(k)P(k) is true, (base case(s) - we may require more than one base case in a strong induction)

  • and for any integer nkn\geqslant k, all the statements P(k),P(k+1),P(k+2),,P(n)P(k),P(k+1),P(k+2),\ldots,P(n) taken together imply P(n+1)P(n+1) (induction step).

Then P(n)P(n) is true for every positive integer nkn\geqslant k.

We saw a proof of the following theorem as part of the Fundamental Theorem of Arithmetic in B1 Numbers, but there we used a contradiction argument. We can also prove this result using strong induction.

Theorem 4.36.

Every integer n2n\geqslant 2 can be written as a product of primes n=p1pkn=p_{1}\cdots p_{k}.

If nn is prime, this statement still makes sense: we just interpret nn as being the product of just one number, nn itself.

Proof.

We prove by strong induction on nn. Let P(n)P(n) denote the statement “There exist kk\in\mathbb{N} and primes p1,,pkp_{1},\ldots,p_{k} such that n=p1pkn=p_{1}\cdots p_{k}.

The base case n=2n=2 immediately holds since 2 is prime.

For the induction step, let N3N\geqslant 3 be an integer and suppose that P(n)P(n) holds for all integers nn satisfying 2n<N2\leqslant n<N (this is our strong inductive hypothesis). If NN is prime, there is nothing to prove; otherwise, if NN is not prime, then N=abN=a\cdot b for some positive integers aa and bb greater than 11 and smaller than NN. Both aa and bb can be decomposed into a product of primes by the strong induction hypothesis, thus so can N=abN=ab. Hence P(N)P(N) holds.

Therefore, since P(2)P(2) is true and we’ve shown that, for every integer N3N\geqslant 3, P(N)P(N) follows from the statements P(n)P(n) for all integers nn satisfying 2n<N2\leqslant n<N, we conclude by strong induction that P(n)P(n) holds for all integers N2N\geqslant 2.

Another situation which often requires strong induction is when studying recurrence relations. A recurrence relation is a sequence a1,a2,a_{1},a_{2},... of numbers where each ana_{n} is defined in terms of previous terms in the sequence. The most famous recurrence relation is the Fibonacci sequence, which is defined as f1=1f_{1}=1, f2=1f_{2}=1 and for n>2n>2, fn=fn1+fn2f_{n}=f_{n-1}+f_{n-2}.

Example 4.37.

Let x1=1,x2=3x_{1}=1,x_{2}=3 and for n2n\geq 2, let

xn+1=4xn3xn1.x_{n+1}=4x_{n}-3x_{n-1}.

Let’s try and find a general pattern for what xnx_{n} is. If we test the first few values, we get

x1=1,x2=3,x3=4331=9,x4=4933=27x_{1}=1,\;\;x_{2}=3,\;\;x_{3}=4\cdot 3-3\cdot 1=9,\;\;x_{4}=4\cdot 9-3\cdot 3=27

so it seems like xn=3n1x_{n}=3^{n-1}. We will prove this by strong induction.

Let P(n)P(n) be the statement "xn=3n1x_{n}=3^{n-1}".

Let’s first prove the first two base cases. Indeed, x1=1=311x_{1}=1=3^{1-1} and x2=3=321x_{2}=3=3^{2-1}, so P(1)P(1) and P(2)P(2) are true.

Now suppose P(k)P(k) is true for k=1,2,,nk=1,2,...,n for some n2n\geq 2. Then

xn+1=4xn3xn1=43n13xn1=43n133n2=3n=3(n+1)1.x_{n+1}=4x_{n}-3x_{n-1}=4\cdot 3^{n-1}-3x_{n-1}=4\cdot 3^{n-1}-3\cdot 3^{n-2}=% 3^{n}=3^{(n+1)-1}.

This proves that P(n+1)P(n+1) is true, and thus xn=3n1x_{n}=3^{n-1} for all nn by strong induction.

Exercise 4.38.

Why did we need to prove two base cases here? Why did we use strong induction instead of standard induction?

Solution (please try for yourself before looking)

Note that the formula xn+1=4xn3xn1x_{n+1}=4x_{n}-3x_{n-1} holds only for n2n\geq 2, so in particular, we can’t use it to prove P(2)P(2) (i.e. that x2=321x_{2}=3^{2-1}). For this reason, we need to prove a few more base cases that can’t be covered in the induction step.

Both P(n)P(n) and P(n1)P(n-1) need to be true to prove P(n+1)P(n+1), but with standard induction, we would need to prove P(n+1)P(n+1) follows from P(n)P(n) alone.

4.4.2 When induction fails

It can be very easy to write something that looks like an induction proof but is actually invalid. Sometimes these errors can be very subtle.

Exercise 4.39.

Spot the flaws in the following argument:

Let P(n)P(n) be the statement that “1.1!+2.2!+3.3!++n.n!=(n+1)!1.1!+2.2!+3.3!+\cdots+n.n!=(n+1)!”.

Assume P(k)P(k) holds for some kk\in\mathbb{N}, then

11!++kk!+(k+1)(k+1)!\displaystyle 1\cdot 1!+\cdots+k\cdot k!+(k+1)(k+1)! =(k+1)!+(k+1)(k+1)!\displaystyle=(k+1)!+(k+1)(k+1)!\quad by assumption
=(k+1)!(k+2)\displaystyle=(k+1)!(k+2) by rearranging
=(k+2)!\displaystyle=(k+2)! by definition of n!n!

Therefore, we can conclude P(n)P(n) holds for all nn\in\mathbb{N}.

Solution (please try for yourself before looking)

In this argument, the induction step is clear and valid. However, there is no base case. Checking P(1)P(1), we find 121\neq 2 which means this proof cannot hold.

Exercise 4.40.

Spot the flaws in the following argument:

Define S(n)S(n) to be the statement “For any collection of nn lines in the plane, if no two are parallel then all lines intersect at one point”.

The base case S(1)S(1) holds immediately.

Assume S(k)S(k) for some kk\in\mathbb{N} so that any collection of kk lines (no two of which are parallel) intersect at a single point. Then consider having k+1k+1 lines, where no two are parallel. The first kk lines must intersect at a single point PP by the inductive hypothesis. Moreover, the last kk lines also intersect at a single point QQ. Two non-parallel lines intersect at a single point so P=QP=Q.

Therefore S(n)S(n) holds for all nn\in\mathbb{N}.

Solution (please try for yourself before looking)

The error here is more subtle. The base case is valid for P(1)P(1) and the induction step shows P(n)P(n+1)P(n)\implies P(n+1), but only for n3n\geq 3 because you need the same two lines to be present in the first kk lines and the last kk lines to conclude that P=QP=Q. So we are missing two additional base cases: P(2)P(2) (which is true) and P(3)P(3) which is false.

Exercise 4.41.

Spot the flaws in the following argument:

We will prove by induction on nn that an=1a^{n}=1 whenever nn is a non-negative integer and aa is a non-zero real number.

Let P(n)P(n) be the statement “an=1a^{n}=1 for every non-zero aa\in\mathbb{R}”.

As a0=1a^{0}=1, P(0)P(0) is true.

Assume P(k)P(k) holds. Then

ak+1=akakak1=111=1.a^{k+1}=\frac{a^{k}\cdot a^{k}}{a^{k-1}}=\frac{1\cdot 1}{1}=1.

Hence P(k+1)P(k+1) is true.

Therefore, P(n)P(n) holds for all values nn\in\mathbb{N}.

Solution (please try for yourself before looking)

This is an example where the induction step fails from too few base cases. Here, the induction step uses strong induction to assume both ak=1a^{k}=1 and ak1=1a^{k-1}=1. But we have only shown that one base case P(0)P(0) is true. So in order to rely on P(k1)P(k-1) and P(k)P(k) to show P(k+1)P(k+1), we would need to show P(1)P(1) is true too. But clearly, a1=1a^{1}=1 for all non-zero aa\in\mathbb{R} is false. So the proof fails.