4.4 The Principle of Mathematical Induction
Mathematical induction is a key method of proof which follows from a fundamental property of the natural numbers called the Principle of Mathematical Induction. You may have seen proofs by induction before, especially to prove statements about summation or divisibility22 2 You can see some of these kinds of examples in the Companion notes.. Here, we will study the method more formally and apply it to a much wider set of problems.
In the Companion, we saw an analogy for induction as a series of dominoes: if we can prove that one domino will fall (a statement is true) and that the falling domino will cause the next domino to fall (the statement implies ) then we have shown that the statement must be true for all .
Definition 4.29 (The Principle of Mathematical Induction).
Given a list of statements , we may conclude that is true for every integer provided that
-
•
we know that is true (the base case)
-
•
and we can prove that for any integer (the induction step).
A full written proof by induction has the following four parts clearly separated.
-
1.
A clear and explicit statement of the induction hypothesis and a stated intention to prove that is true for all by induction.
-
2.
The base case. Typically we prove a single statement .
-
3.
The induction step. This normally involves assuming is true and then reasoning that must also be true.
-
4.
A conclusion that steps (2) and (3) mean that is true for all natural numbers by the principle of mathematical induction.
Mathematics written by and for experts might skip some of the detail, for example saying “the base case is trivial” and not proving it in depth. For now, in this course and during the beginning of your degree, your proofs must be written systematically using all four parts of the format described above.
As an example, we now prove the following generalisation of Theorem 4.26.
Theorem 4.30 (Generalised De Morgan’s Laws).
Let be sets where . Then
-
(a)
,
-
(b)
.
Proof (for (a)).
We proceed via induction on . Let be the statement .
For the base case, first consider . The union or intersection of one set is the set itself so the left hand side of this equation is while the right hand side is . The brackets make no difference here so we have proven the base case to be true.
For the induction step, to prove implies , we suppose that holds for some .
Let . Then,
| by definition of union | ||||
| by Theorem 4.26 | ||||
| by definition of | ||||
| by inductive assumption | ||||
| by definition of intersection. |
This shows that holds if holds so, by the principle of mathematical induction, we have shown holds for all .
Exercise 4.31.
Adapt this proof in order to prove part (b) of Theorem 4.30. Remember to make sure the four different parts of the induction proof are clear.
Induction proofs are not only used to prove results about sets. They are used in many different areas of maths.
Example 4.32.
Claim: For every , the sum of the first positive odd integers equals .
Proof: We prove by induction on . Let be the statement “”. The sum of the first odd number, , equals so that and so is true. Assume that is true for some . Then
Hence is also true. Since holds and , it follows that is true for all by the principal of mathematical induction.
Example 4.33.
For every , we define and we read as “ factorial”.
For which do we have ? If we plug in this inequality fails, but for it starts to hold. This gives rise to the following conjecture:
Claim: for every integer .
Proof: We prove by induction on . Let be the statement “”.
-
•
Base case: For the base case, we consider when . Since , we conclude that is true.
-
•
Induction Step: Assume is true, i.e. for some integer . Then
Since we have shown is true and for every positive integer , we conclude that holds for every integer .
Example 4.34.
For this example, we need to introduce some definitions.
A polygon is a two-dimensional geometrical object consisting of a number of points (vertices), and the same number of straight line segments connecting them (edges), so that the set of edges forms the boundary of precisely one bounded region (face). A polygon is convex if every straight line segment joining any pair of vertices lies wholly within (or on the boundary of) the polygon.
A straight line segment connecting two vertices that are not adjacent to each other in an n-vertex polygon is called a diagonal of that polygon.
Claim: Every -vertex convex polygon has diagonals, for all integers .
Proof: Let denote the statement “Every -vertex convex polygon has diagonals.” We will prove that is true for every integer by induction on .
For the base case, observe that every -vertex convex polygon is a triangle. Since every pair of vertices is adjacent in a triangle, there are no diagonals. Hence, every -vertex convex polygon has diagonals, which means that is true.
For the induction step, let’s assume that is true for some integer . Now consider a general -vertex convex polygon and some vertex belonging to . Then has some pair of vertices, say and , that is adjacent to in . Suppose we remove vertex and its adjoining line segments and connecting to and respectively from and connect and with a line segment . Let be the -vertex polygon that remains. Note that is also convex since any line segment connecting a pair of vertices in also connects a pair of vertices in and therefore must lie completely in by its convexity. So if there was a line segment connecting a pair of vertices in that did not lie wholly in , it would have to cross the section of that we cut off to form . This would only be possible if that line segment ended with the vertex , which is not in . Therefore, by , has diagonals. If we add and the line segments and back to to form , we introduce new diagonals, one for each vertex of that is not adjacent to , and becomes an additional diagonal of . Also, every diagonal of is also a diagonal of . Hence, in total, has
diagonals. Thus holds.
Therefore, since we have shown that is true and for every positive integer , we conclude that holds for every integer .
Notice that, as the examples above show, the base case does not always have to consider when . The base case just involves the smallest value of for which is true.
4.4.1 Strong Mathematical Induction
Sometimes, in the induction step, it’s not enough to know that is true in order to prove that is true; we might additionally need to use that some of the "previous" statements are true.
Definition 4.35 (Principle of Strong Mathematical Induction).
Let and let be statements. Suppose that
-
•
is true, (base case(s) - we may require more than one base case in a strong induction)
-
•
and for any integer , all the statements taken together imply (induction step).
Then is true for every positive integer .
We saw a proof of the following theorem as part of the Fundamental Theorem of Arithmetic in B1 Numbers, but there we used a contradiction argument. We can also prove this result using strong induction.
Theorem 4.36.
Every integer can be written as a product of primes .
If is prime, this statement still makes sense: we just interpret as being the product of just one number, itself.
Proof.
We prove by strong induction on . Let denote the statement “There exist and primes such that .
The base case immediately holds since 2 is prime.
For the induction step, let be an integer and suppose that holds for all integers satisfying (this is our strong inductive hypothesis). If is prime, there is nothing to prove; otherwise, if is not prime, then for some positive integers and greater than and smaller than . Both and can be decomposed into a product of primes by the strong induction hypothesis, thus so can . Hence holds.
Therefore, since is true and we’ve shown that, for every integer , follows from the statements for all integers satisfying , we conclude by strong induction that holds for all integers .
Another situation which often requires strong induction is when studying recurrence relations. A recurrence relation is a sequence of numbers where each is defined in terms of previous terms in the sequence. The most famous recurrence relation is the Fibonacci sequence, which is defined as , and for , .
Example 4.37.
Let and for , let
Let’s try and find a general pattern for what is. If we test the first few values, we get
so it seems like . We will prove this by strong induction.
Let be the statement "".
Let’s first prove the first two base cases. Indeed, and , so and are true.
Now suppose is true for for some . Then
This proves that is true, and thus for all by strong induction.
Exercise 4.38.
Why did we need to prove two base cases here? Why did we use strong induction instead of standard induction?
Solution (please try for yourself before looking)
Note that the formula holds only for , so in particular, we can’t use it to prove (i.e. that ). For this reason, we need to prove a few more base cases that can’t be covered in the induction step.
Both and need to be true to prove , but with standard induction, we would need to prove follows from alone.
4.4.2 When induction fails
It can be very easy to write something that looks like an induction proof but is actually invalid. Sometimes these errors can be very subtle.
Exercise 4.39.
Spot the flaws in the following argument:
Let be the statement that “”.
Assume holds for some , then
| by assumption | ||||
| by rearranging | ||||
| by definition of |
Therefore, we can conclude holds for all .
Solution (please try for yourself before looking)
In this argument, the induction step is clear and valid. However, there is no base case. Checking , we find which means this proof cannot hold.
Exercise 4.40.
Spot the flaws in the following argument:
Define to be the statement “For any collection of lines in the plane, if no two are parallel then all lines intersect at one point”.
The base case holds immediately.
Assume for some so that any collection of lines (no two of which are parallel) intersect at a single point. Then consider having lines, where no two are parallel. The first lines must intersect at a single point by the inductive hypothesis. Moreover, the last lines also intersect at a single point . Two non-parallel lines intersect at a single point so .
Therefore holds for all .
Solution (please try for yourself before looking)
The error here is more subtle. The base case is valid for and the induction step shows , but only for because you need the same two lines to be present in the first lines and the last lines to conclude that . So we are missing two additional base cases: (which is true) and which is false.
Exercise 4.41.
Spot the flaws in the following argument:
We will prove by induction on that whenever is a non-negative integer and is a non-zero real number.
Let be the statement “ for every non-zero ”.
As , is true.
Assume holds. Then
Hence is true.
Therefore, holds for all values .
Solution (please try for yourself before looking)
This is an example where the induction step fails from too few base cases. Here, the induction step uses strong induction to assume both and . But we have only shown that one base case is true. So in order to rely on and to show , we would need to show is true too. But clearly, for all non-zero is false. So the proof fails.