10.4 Important inequalities
If we know particular inequalities are true, we can use them to prove other inequalities. The following general inequalities are very useful.
10.4.1 The Triangle Inequality
The Triangle Inequality relates the absolute value of a sum of two numbers to the sum of the absolute values (or modulus) of those two numbers. This makes it very useful when calculating and manipulating quantities involving the absolute value.
Theorem 10.18 (Triangle Inequalities).
For any real numbers
-
1.(10.2)
-
2.(10.3)
Proof.
(1) The proof relies only on careful algebra as follows. For , notice that
Indeed, if , we have and thus, . On the other hand, if , then and hence, . Thus
Factoring this gives
Taking the positive square root on each side
(2) The second part is proved as follows: for
Thus
Factoring this gives
Taking the positive square root on each side
Note that if , then
Similarly, if , then
Exercise 10.19.
Determine what conditions and need to satisfy for (10.3) to be an equality.
Solution (please try for yourself before looking)
We are trying to solve
Note that if
Similarly if
Equality holds if and only if and have the same algebraic sign.
This is one of many triangle inequalities that you will see in your degree, so called because they reflect the geometric result that the sum of the lengths of any two sides of a triangle is greater than or equal to the length of the third side.
Example 10.20.
Suppose we wish to prove that
for all .
By the triangle inequality,
for all . So since
we have
for all integers .
10.4.2 Bernoulli’s inequality
Bernoulli’s inequality has a lot of applications in real analysis22 2 You will see it used in IMA. It has a number of variants. Its most general statement is as follows.
Theorem 10.21 (Bernoulli’s Inequality).
Assume is a real number and . Then
The proof of this general statement is beyond the scope of this course. However, we are equipped to prove a more restricted version of Bernoulli’s Inequality.
Exercise 10.22.
Prove Bernoulli’s inequality for by induction.
Solution (please try for yourself before looking)
Let be the statement
For we have
Hence is true.
Assume is true and consider :
Hence is true.
Since holds and , we conclude holds for all by induction.
Notice that is the tangent line to at the point . Indeed if we take and differentiate with respect to , we obtain
Evaluating this at then gives , which is the gradient of the tangent line at .
Since the tangent line to at is given by
we get our tangent line
by substituting our value .
For a specific value of , we can find a larger range of for which Bernoulli’s inequality holds by calculating the intersection point between this tangent line and our graph of . For example, when , we can bring the lower bound for down to .
10.4.3 AM–GM inequality
Given , the AM-GM inequality relates the arithmetic mean (AM) with the geometric mean (GM) .
Theorem 10.23.
for all .
Proof.
Hence
Exercise 10.24.
Prove that in (10.5) equality holds if and only if .
Solution (please try for yourself before looking)
The goal is to prove
Assume , then and so .
Assume .
Example 10.25.
We can apply (10.5) twice to to prove
In more general form the AM–GM inequality states that the arithmetic mean of a list of non-negative real numbers is greater than or equal to the geometric mean.
Theorem 10.26 (Generalised AM-GM Inequality).
For any list of non-negative real numbers ,
Cauchy’s original proof of the AM-GM inequality used a special kind of forward-backward induction. Let be the statement
The idea of the proof is as follows.
-
1.
Prove each of .
-
2.
If then prove
To prove use (10.6) repeatedly.
If , to show let be defined as
Consider the following product with terms:
and then apply so
By clearing the powers of to the right-hand side, we have
Finally, raising both sides to the power gives
Exercise 10.27.
Let be positive real numbers. Prove that
Solution (please try for yourself before looking)
Applying the AM-GM inequality to
Applying the AM-GM inequality to
Multiplying these two inequalities together, and multiplying by gives
10.4.4 Cauchy–Schwarz inequality
Theorem 10.28 (Cauchy–Schwarz Inequality).
Let and be two sequences (lists) of real numbers then
(10.7) is one of the most important inequalities in mathematics. Using sigma-notation this is written as
We present two proofs of this inequality here.
Cauchy-Schwarz Inequality - Proof 1
Proof.
First note that for all , . In particular, let and then
Add , and to both sides
and factor both sides.
This gives (10.7) for , and is the crucial step in proving (10.7) in general.
Note also, that by the algebra which proved (10.8) we see that is also true.
Assume that is true and consider
Note, we proved independently, and so may use it within this proof in the form.
It therefore follows that also holds, and so follows by mathematical induction.
Cauchy-Schwarz Inequality - Proof 2
This proof of the Cauchy-Schwarz Inequality makes use of the following three lemmas.
Lemma 10.29.
The quadratic inequality holds for all if and only if and (or , and ).
Proof.
First suppose and . Then
Also, if and , then . Thus, by completing the square,
Hence, if and (or , and ), we obtain .
For the other direction, suppose that holds for all .
If then holds for all . This means that and .
If , then by completing the square, we have
for all . This means and its minimum value must be non-negative i.e.
Lemma 10.30.
Let and . Then,
for every .
Proof.
We prove this by induction on .
Let be the statement “”.
For we have
since for all . Hence is true.
Assume is true. Then
The first term is non-negative by , and we have .
Hence is true.
Since holds and , we conclude holds for all by induction.
Lemma 10.31.
Let and .Then
for all .
Proof.
We prove by induction on . Let be the statement
For we have
Hence is true.
Assume is true and consider:
Hence is true.
Since holds and , we conclude holds for all by induction.
Another proof of the Cauchy-Schwarz Inequality is as follows.
Proof.
By Lemma 10.30 and Lemma 10.31, we see that
By Lemma 10.29, we know for all if and only if and (or , and ). So take , and .
If and , then for all and the stated inequality holds since .
If and , we conclude that
and thus,
Exercise 10.32.
Prove that for all
Solution (please try for yourself before looking)
Start by taking (10.7) with .
Divide both sides by :
Exercise 10.33.
Let be positive real numbers. Prove that
Solution (please try for yourself before looking)
Let and . Apply the Cauchy-Schwarz inequality
Taking these values
Whenever we see a new inequality it is sensible to ask “how good is this?”. That is, does equality ever hold? To see when actually equals
we consider vectors in the plane.
10.4.5 The Cauchy–Schwarz Inequality and Vectors
Consider vectors and in to be any two vectors in the plane.
Then the length or norm of the vector is given by
This follows from the Pythagorean Theorem. Notice that the term appears as part of the right-hand side of (10.7). We can therefore consider how to interpret the left-hand side of (10.7) in terms of vectors and .
Let us assume that and . If we use our particular values to show when equality holds in (10.7), then we are trying to solve equation
Looking back at the special case of the Cauchy-Schwarz inequality, the algebra is identical:
This proves that equality holds in (10.7) if and only if .
The goal is to show if and only if and are parallel.
-
1.
Assume . Then since , we have . Then and if and only if and are parallel.
-
2.
Assume . Then since , we have . Then and if and only if and are parallel
-
3.
Assume and . Divide both sides by to conclude that
and hence if and only if and are parallel.
Hence, in all cases if and only if and are parallel.
Above we have worked directly in . Much of the above generalised to vectors, and will be covered fully in a linear algebra course.
The inner product or dot product of two vectors
is given by
Notice this is exactly the left-hand side of (10.7) for vectors in . In we can prove directly that
where is the angle between the vectors and .
As we shall soon see in linear algebra courses, we can generalize both the length/norm of a vector and the inner product two of dimensional vectors.
and the Cauchy-Schwarz inequality shows that
for any two vectors.
Indeed, mathematicians choose to generalise the idea of angle between two (non-zero) vectors in general by setting
If particular equality holds in the Cauchy-Schwarz inequality if and only if the vectors are parallel. This is one reason why the Cauchy-Schwarz inequality proves to be so important.