10.4 Important inequalities

If we know particular inequalities are true, we can use them to prove other inequalities. The following general inequalities are very useful.

10.4.1 The Triangle Inequality

The Triangle Inequality relates the absolute value of a sum of two numbers to the sum of the absolute values (or modulus) of those two numbers. This makes it very useful when calculating and manipulating quantities involving the absolute value.

Theorem 10.18 (Triangle Inequalities).

  
For any real numbers a,ba,b\in\mathbb{R}

  1. 1.
    |a+b||a|+|b|,|a+b|\leq|a|+|b|, (10.2)
  2. 2.
    |ab|||a||b||.|a-b|\geq||a|-|b||. (10.3)
Proof.

(1) The proof relies only on careful algebra as follows. For a,ba,b\in\mathbb{R}, notice that

ab|ab|, so that ab|a||b|.ab\leq|ab|,\mbox{ so that }ab\leq|a|\ |b|.

Indeed, if ab<0ab<0, we have |ab|=ab>0|ab|=-ab>0 and thus, ab<0<|ab|ab<0<|ab|. On the other hand, if ab0ab\geq 0, then |ab|=ab|ab|=ab and hence, ab|ab|ab\leq|ab|. Thus

a2+2ab+b2a2+2|a||b|+b2=|a|2+2|a||b|+|b|2.a^{2}+2ab+b^{2}\leq a^{2}+2|a|\ |b|+b^{2}=|a|^{2}+2|a|\ |b|+|b|^{2}.

Factoring this gives

|a+b|2(|a|+|b|)2|a+b|^{2}\leq(|a|+|b|)^{2}

Taking the positive square root on each side

|a+b||a|+|b|.|a+b|\leq|a|+|b|.

(2) The second part is proved as follows: for a,ba,b\in\mathbb{R}

ab|ab|, so that ab|a||b|.ab\leq|ab|,\mbox{ so that }ab\leq|a|\ |b|.

Thus

a22ab+b2a22|a||b|+b2=|a|22|a||b|+|b|2.a^{2}-2ab+b^{2}\geq a^{2}-2|a|\ |b|+b^{2}=|a|^{2}-2|a|\ |b|+|b|^{2}.

Factoring this gives

|ab|2(|a||b|)2|a-b|^{2}\geq(|a|-|b|)^{2}

Taking the positive square root on each side

|ab|||a||b||.|a-b|\geq||a|-|b||.

Note that if a,b0a,b\geq 0, then

|a+b|=a+b=|a|+|b|.|a+b|=a+b=|a|+|b|.

Similarly, if a,b<0a,b<0, then

|a+b|=(a+b)=ab=|a|+|b|.|a+b|=-(a+b)=-a-b=|a|+|b|.
Exercise 10.19.

Determine what conditions aa and bb need to satisfy for (10.3) to be an equality.

Solution (please try for yourself before looking)

We are trying to solve

|ab|=||a||b||.|a-b|=||a|-|b||.

Note that if a,b0a,b\geq 0

||a||b||=|ab|.||a|-|b||=|a-b|.

Similarly if a,b0a,b\leq 0

||a||b||=|a+b|=|ab|.||a|-|b||=|-a+b|=|a-b|.

Equality holds if and only if aa and bb have the same algebraic sign.

This is one of many triangle inequalities that you will see in your degree, so called because they reflect the geometric result that the sum of the lengths of any two sides of a triangle is greater than or equal to the length of the third side.

aabbca+bc\leq a+b
Figure 10.3: A triangle whose third side is shorter than the sum of the other two sides
Example 10.20.

Suppose we wish to prove that

|5n2+2nn2+45|22n\left|\frac{5n^{2}+2n}{n^{2}+4}-5\right|\leqslant\frac{22}{n}

for all nn\in\mathbb{N}.

By the triangle inequality,

|5n2+2nn2+45|=|2n20n2+4||2nn2+4|+|20n2+4|=2nn2+4+20n2+4\left|\frac{5n^{2}+2n}{n^{2}+4}-5\right|=\left|\frac{2n-20}{n^{2}+4}\right|% \leq\left|\frac{2n}{n^{2}+4}\right|+\left|\frac{-20}{n^{2}+4}\right|=\frac{2n}% {n^{2}+4}+\frac{20}{n^{2}+4}

for all n1n\geqslant 1. So since

2nn2+4+20n2+42nn2+20n=22n,\frac{2n}{n^{2}+4}+\frac{20}{n^{2}+4}\leq\frac{2n}{n^{2}}+\frac{20}{n}=\frac{2% 2}{n},

we have

|5n2+2nn2+45|22n\left|\frac{5n^{2}+2n}{n^{2}+4}-5\right|\leq\frac{22}{n}

for all integers n1n\geq 1.

10.4.2 Bernoulli’s inequality

Bernoulli’s inequality has a lot of applications in real analysis22 2 You will see it used in IMA. It has a number of variants. Its most general statement is as follows.

Theorem 10.21 (Bernoulli’s Inequality).

Assume rr is a real number and r1r\geq 1. Then

(1+x)r1+rxfor allx1.(1+x)^{r}\geq 1+rx\quad\text{for all}\quad x\geq-1. (10.4)

The proof of this general statement is beyond the scope of this course. However, we are equipped to prove a more restricted version of Bernoulli’s Inequality.

Exercise 10.22.

Prove Bernoulli’s inequality for rr\in\mathbb{N} by induction.

Solution (please try for yourself before looking)

Let P(n)P(n) be the statement

(1+x)n1+nxfor allx1.(1+x)^{n}\geq 1+nx\quad\text{for all}\quad x\geq-1.

For n=1n=1 we have

1+x1+xfor allx1.1+x\geq 1+x\quad\text{for all}\quad x\geq-1.

Hence P(1)P(1) is true.

Assume P(k)P(k) is true and consider (x+1)k+1(x+1)^{k+1}:

(1+x)k+1\displaystyle(1+x)^{k+1} =(1+x)k(1+x)\displaystyle=(1+x)^{k}(1+x)
(1+kx)(1+x), since 1+x0,\displaystyle\geq(1+kx)(1+x)\text{, since $1+x\geq 0$},
=1+kx+x+kx2\displaystyle=1+kx+x+kx^{2}
=1+x(k+1)+kx2\displaystyle=1+x(k+1)+kx^{2}
1+(k+1)x.\displaystyle\geq 1+(k+1)x.

Hence P(k+1)P(k+1) is true.

Since P(1)P(1) holds and P(k)P(k+1)P(k)\Rightarrow P(k+1), we conclude P(n)P(n) holds for all nn\in\mathbb{N} by induction.

Notice that y=1+rxy=1+rx is the tangent line to y=(1+x)ry=(1+x)^{r} at the point x=0x=0. Indeed if we take f(x)=(x+1)rf(x)=(x+1)^{r} and differentiate f(x)f(x) with respect to xx, we obtain

f(x)=r(1+x)r1.f^{\prime}(x)=r(1+x)^{r-1}.

Evaluating this at x=0x=0 then gives f(0)=rf^{\prime}(0)=r, which is the gradient of the tangent line at x=0x=0.

Since the tangent line to y=f(x)y=f(x) at x=x0x=x_{0} is given by

y=f(x0)+f(x0)(xx0),y=f(x_{0})+f^{\prime}(x_{0})(x-x_{0}),

we get our tangent line

g(x)=1+nx,g(x)=1+nx,

by substituting our value x0=0x_{0}=0.

For a specific value of rr, we can find a larger range of xx for which Bernoulli’s inequality holds by calculating the intersection point between this tangent line and our graph of y=(1+x)ry=(1+x)^{r}. For example, when r=3r=3, we can bring the lower bound for xx down to 3-3.

3-32-21-111222-21-1112233
Figure 10.4: Illustrating Bernoulli’s inequality for r=3r=3.

10.4.3 AM–GM inequality

Given a,b0a,b\geq 0, the AM-GM inequality relates the arithmetic mean (AM) a+b2\frac{a+b}{2} with the geometric mean (GM) ab\sqrt{ab}.

Theorem 10.23.
aba+b2\sqrt{ab}\leq\frac{a+b}{2} (10.5)

for all a,b0a,b\geq 0.

Proof.
0\displaystyle 0 (ab)2\displaystyle\leq(a-b)^{2}
=a22ab+b2\displaystyle=a^{2}-2ab+b^{2}
=a2+2ab+b24ab\displaystyle=a^{2}+2ab+b^{2}-4ab
=(a+b)24ab.\displaystyle=(a+b)^{2}-4ab.

Hence

aba+b2\sqrt{ab}\leq\frac{a+b}{2}
Exercise 10.24.

Prove that in (10.5) equality holds if and only if a=ba=b.

Solution (please try for yourself before looking)

The goal is to prove

ab=a+b2a=b.\sqrt{ab}=\frac{a+b}{2}\,\Leftrightarrow\,a=b.

Assume b=ab=a, then a+a2=a=a2\frac{a+a}{2}=a=\sqrt{a^{2}} and so ab=a+b2\sqrt{ab}=\frac{a+b}{2}.

Assume ab=a+b2\sqrt{ab}=\frac{a+b}{2}.

ab=a+b2\displaystyle\sqrt{ab}=\frac{a+b}{2}
\displaystyle\Leftrightarrow\, 4ab=(a+b)2\displaystyle 4ab=(a+b)^{2}
\displaystyle\Leftrightarrow\, 0=(ab)2\displaystyle 0=(a-b)^{2}
\displaystyle\Leftrightarrow\, a=b\displaystyle a=b
Example 10.25.

We can apply (10.5) twice to a1,a2,a3,a4a_{1},a_{2},a_{3},a_{4} to prove

(a1a2a3a4)14(a1a2)12+(a3a4)122a1+a2+a3+a44.\left(a_{1}a_{2}a_{3}a_{4}\right)^{\frac{1}{4}}\leq\frac{\left(a_{1}a_{2}% \right)^{\frac{1}{2}}+\left(a_{3}a_{4}\right)^{\frac{1}{2}}}{2}\leq\frac{a_{1}% +a_{2}+a_{3}+a_{4}}{4}. (10.6)

In more general form the AM–GM inequality states that the arithmetic mean of a list of non-negative real numbers is greater than or equal to the geometric mean.

Theorem 10.26 (Generalised AM-GM Inequality).

For any list of nn non-negative real numbers a1,a2,,ana_{1},\ a_{2},\ \ldots,\ a_{n},

a1a2anna1+a2++ann.\sqrt[n]{a_{1}\cdot a_{2}\cdots a_{n}}\leq\frac{a_{1}+a_{2}+\cdots+a_{n}}{n}.

Cauchy’s original proof of the AM-GM inequality used a special kind of forward-backward induction. Let P(n)P(n) be the statement

a1a2anna1+a2++ann.\sqrt[n]{a_{1}\cdot a_{2}\cdots a_{n}}\leq\frac{a_{1}+a_{2}+\cdots+a_{n}}{n}.

The idea of the proof is as follows.

  1. 1.

    Prove each of P(2n)P(2^{n}).

  2. 2.

    If n<2kn<2^{k} then prove P(2k)P(n)P(2^{k})\Rightarrow P(n)

To prove P(2n)P(2^{n}) use (10.6) repeatedly.

If n<2kn<2^{k}, to show P(2k)P(n)P(2^{k})\Rightarrow P(n) let AA be defined as

A:=a1+a2++ann.A:=\frac{a_{1}+a_{2}+\cdots+a_{n}}{n}.

Consider the following product with 2k2^{k} terms:

(a1a2anA2kn),\left(a_{1}a_{2}\cdots a_{n}A^{2^{k}-n}\right),

and then apply P(2k)P(2^{k}) so

(a1a2anA2kn)1/2ka1+a2++an+(2kn)A2k=2kA2K=A.\left(a_{1}a_{2}\cdots a_{n}A^{2^{k}-n}\right)^{1/2^{k}}\leq\frac{a_{1}+a_{2}+% \cdots+a_{n}+(2^{k}-n)A}{2^{k}}=\frac{2^{k}A}{2^{K}}=A.

By clearing the powers of AA to the right-hand side, we have

(a1a2an)1/2kAn/2k.\left(a_{1}a_{2}\cdots a_{n}\right)^{1/2^{k}}\leq A^{{n}/{2^{k}}}.

Finally, raising both sides to the power 2k/n2^{k}/n gives

(a1a2an)1/nA=a1+a2++ann.\left(a_{1}a_{2}\cdots a_{n}\right)^{1/n}\leq A=\frac{a_{1}+a_{2}+\cdots+a_{n}% }{n}.
Exercise 10.27.

Let c1,,cnc_{1},\ldots,c_{n} be positive real numbers. Prove that

(k=1nck)(k=1n1ck)n2.\left(\sum_{k=1}^{n}c_{k}\right)\left(\sum_{k=1}^{n}\frac{1}{c_{k}}\right)\geq n% ^{2}.
Solution (please try for yourself before looking)

Applying the AM-GM inequality to c1,,cnc_{1},\ldots,c_{n}

c1+c2++cnnc1c2cnn.\frac{c_{1}+c_{2}+\cdots+c_{n}}{n}\geq\sqrt[n]{c_{1}\cdot c_{2}\cdots c_{n}}.

Applying the AM-GM inequality to 1c1,,1cn\frac{1}{c_{1}},\ldots,\frac{1}{c_{n}}

1n(1c1+1c2++1cn)1c11c21cnn=1c1c2cnn.\frac{1}{n}\left(\frac{1}{c_{1}}+\frac{1}{c_{2}}+\cdots+\frac{1}{c_{n}}\right)% \geq\sqrt[n]{\frac{1}{c_{1}}\cdot\frac{1}{c_{2}}\cdots\frac{1}{c_{n}}}=\frac{1% }{\sqrt[n]{c_{1}\cdot c_{2}\cdots c_{n}}}.

Multiplying these two inequalities together, and multiplying by n2n^{2} gives

(k=1nck)(k=1n1ck)n2.\left(\sum_{k=1}^{n}c_{k}\right)\left(\sum_{k=1}^{n}\frac{1}{c_{k}}\right)\geq n% ^{2}.

10.4.4 Cauchy–Schwarz inequality

Theorem 10.28 (Cauchy–Schwarz Inequality).

Let a1,a2,,ana_{1},\,a_{2},\cdots,a_{n} and b1,b2,,bnb_{1},\,b_{2},\cdots,b_{n} be two sequences (lists) of real numbers then

a1b1+a2b2++anbna12+a22++an2b12+b22++bn2.a_{1}b_{1}+a_{2}b_{2}+\cdots+a_{n}b_{n}\leq\sqrt{a_{1}^{2}+a_{2}^{2}+\cdots+a_% {n}^{2}}\sqrt{b_{1}^{2}+b_{2}^{2}+\cdots+b_{n}^{2}}. (10.7)

(10.7) is one of the most important inequalities in mathematics. Using sigma-notation this is written as

k=1nakbkk=1nak2k=1nbk2.\sum_{k=1}^{n}a_{k}b_{k}\leq\sqrt{\sum_{k=1}^{n}a_{k}^{2}}\sqrt{\sum_{k=1}^{n}% b_{k}^{2}}.

We present two proofs of this inequality here.

Cauchy-Schwarz Inequality - Proof 1

Proof.

First note that (xy)2=x22xy+y20(x-y)^{2}=x^{2}-2xy+y^{2}\geq 0 for all xx, yy. In particular, let x=a1b2x=a_{1}b_{2} and y=a2b1y=a_{2}b_{1} then

0(a1b2)22(a1b2)(a2b1)+(a2b1)2.0\leq(a_{1}b_{2})^{2}-2(a_{1}b_{2})(a_{2}b_{1})+(a_{2}b_{1})^{2}.

Add 2a1b1a2b22a_{1}b_{1}a_{2}b_{2}, (a1b1)2(a_{1}b_{1})^{2} and (a2b2)2(a_{2}b_{2})^{2} to both sides

a12b12+2a1b1a2b2+a22b22a12b12+a12b22+a22b12+a22b22,a_{1}^{2}b_{1}^{2}+2a_{1}b_{1}a_{2}b_{2}+a_{2}^{2}b_{2}^{2}\leq a_{1}^{2}b_{1}% ^{2}+a_{1}^{2}b_{2}^{2}+a_{2}^{2}b_{1}^{2}+a_{2}^{2}b_{2}^{2},

and factor both sides.

(a1b1+a2b2)2(a12+a22)(b12+b22).(a_{1}b_{1}+a_{2}b_{2})^{2}\leq(a_{1}^{2}+a_{2}^{2})(b_{1}^{2}+b_{2}^{2}). (10.8)

This gives (10.7) for n=2n=2, and is the crucial step in proving (10.7) in general.

Let P(n)P(n) be the statement (10.7), that is

a1b1+a2b2++anbna12+a22++an2b12+b22++bn2.a_{1}b_{1}+a_{2}b_{2}+\cdots+a_{n}b_{n}\leq\sqrt{a_{1}^{2}+a_{2}^{2}+\cdots+a_% {n}^{2}}\sqrt{b_{1}^{2}+b_{2}^{2}+\cdots+b_{n}^{2}}.

Then, since a1b1a12b12a_{1}b_{1}\leq\sqrt{a_{1}^{2}}\sqrt{b_{1}^{2}} we see that P(1)P(1) is true for all a1,b1a_{1},b_{1}.

Note also, that by the algebra which proved (10.8) we see that P(2)P(2) is also true.

Assume that P(n)P(n) is true and consider

a1b1+a2b2++anbn+an+1bn+1\displaystyle a_{1}b_{1}+a_{2}b_{2}+\cdots+a_{n}b_{n}+a_{n+1}b_{n+1}
=\displaystyle= (a1b1+a2b2++anbn)+an+1bn+1\displaystyle(a_{1}b_{1}+a_{2}b_{2}+\cdots+a_{n}b_{n})+a_{n+1}b_{n+1}
\displaystyle\leq a12+a22++an2b12+b22++bn2+an+1bn+1\displaystyle\sqrt{a_{1}^{2}+a_{2}^{2}+\cdots+a_{n}^{2}}\sqrt{b_{1}^{2}+b_{2}^% {2}+\cdots+b_{n}^{2}}+a_{n+1}b_{n+1} Applying P(n)\displaystyle\mbox{Applying }P(n)
\displaystyle\leq a12+a22++an2+an+12b12+b22++an2+an+12\displaystyle\sqrt{a_{1}^{2}+a_{2}^{2}+\cdots+a_{n}^{2}+a_{n+1}^{2}}\sqrt{b_{1% }^{2}+b_{2}^{2}+\cdots+a_{n}^{2}+a_{n+1}^{2}} Applying P(2)\displaystyle\mbox{Applying }P(2)

Note, we proved P(2)P(2) independently, and so may use it within this proof in the form.

αβ+an+1bn+1α2+an+12β2+bn+12.\alpha\beta+a_{n+1}b_{n+1}\leq\sqrt{\alpha^{2}+a_{n+1}^{2}}\sqrt{\beta^{2}+b_{% n+1}^{2}}.

It therefore follows that P(n+1)P(n+1) also holds, and so P(n)P(n) follows by mathematical induction.

Cauchy-Schwarz Inequality - Proof 2

This proof of the Cauchy-Schwarz Inequality makes use of the following three lemmas.

Lemma 10.29.

The quadratic inequality Ax2+2Bx+C0Ax^{2}+2Bx+C\geq 0 holds for all xx\in\mathbb{R} if and only if A>0A>0 and ACB20AC-B^{2}\geq 0 (or A=B=0A=B=0, and C>0C>0).

Proof.

First suppose A=B=0A=B=0 and C0C\geq 0. Then

Ax2+2Bx+C=C0.Ax^{2}+2Bx+C=C\geq 0.

Also, if A>0A>0 and ACB20AC-B^{2}\geq 0, then CB2AC\geq\frac{B^{2}}{A}. Thus, by completing the square,

Ax2+2Bx+C=A(x+BA)2B2A+CB2A+C0.Ax^{2}+2Bx+C=A\left(x+\frac{B}{A}\right)^{2}-\frac{B^{2}}{A}+C\geq-\frac{B^{2}% }{A}+C\geq 0.

Hence, if A>0A>0 and ACB20AC-B^{2}\geq 0 (or A=B=0A=B=0, and C0C\geq 0), we obtain Ax2+2Bx+C0Ax^{2}+2Bx+C\geq 0.

For the other direction, suppose that Ax2+2Bx+C0Ax^{2}+2Bx+C\geq 0 holds for all xx\in\mathbb{R}.

If A=0A=0 then Ax2+2Bx+C=2Bx+C0Ax^{2}+2Bx+C=2Bx+C\geq 0 holds for all xx\in\mathbb{R}. This means that B=0B=0 and C0C\geq 0.

If A0A\neq 0, then by completing the square, we have

Ax2+2Bx+C=A(x+BA)2B2A+C0,Ax^{2}+2Bx+C=A\left(x+\frac{B}{A}\right)^{2}-\frac{B^{2}}{A}+C\geq 0,

for all xx\in\mathbb{R}. This means A>0A>0 and its minimum value B2A+C-\frac{B^{2}}{A}+C must be non-negative i.e.

B2A+C0ACB20.-\frac{B^{2}}{A}+C\geq 0\iff AC-B^{2}\geq 0.
Lemma 10.30.

Let nn\in\mathbb{N} and a1,,an,b1,,bna_{1},\ldots,a_{n},b_{1},\ldots,b_{n}\in\mathbb{R}. Then,

k=1n(akx+bk)20,\sum_{k=1}^{n}(a_{k}x+b_{k})^{2}\geq 0,

for every xx\in\mathbb{R}.

Proof.

We prove this by induction on nn.

Let P(n)P(n) be the statement “i=1n(aix+bi)20\sum_{i=1}^{n}(a_{i}x+b_{i})^{2}\geq 0”.

For n=1n=1 we have

(a1x+b1)20(a_{1}x+b_{1})^{2}\geq 0

since y20y^{2}\geq 0 for all yy\in\mathbb{R}. Hence P(1)P(1) is true.

Assume P(k)P(k) is true. Then

i=1k+1(aix+bi)2=(i=1k(aix+bi)2)+(ak+1x+bk+1)2.\sum_{i=1}^{k+1}(a_{i}x+b_{i})^{2}=\left(\sum_{i=1}^{k}(a_{i}x+b_{i})^{2}% \right)+(a_{k+1}x+b_{k+1})^{2}.

The first term is non-negative by P(k)P(k), and we have (ak+1x+bk+1)20(a_{k+1}x+b_{k+1})^{2}\geq 0.

Hence P(k+1)P(k+1) is true.

Since P(1)P(1) holds and P(k)P(k+1)P(k)\Rightarrow P(k+1), we conclude P(n)P(n) holds for all nn\in\mathbb{N} by induction.

Lemma 10.31.

Let nn\in\mathbb{N} and a1,,an,b1,,bna_{1},\ldots,a_{n},b_{1},\ldots,b_{n}\in\mathbb{R}.Then

k=1n(akx+bk)2=x2k=1nak2+2xk=1nakbk+k=1nbk2\sum_{k=1}^{n}(a_{k}x+b_{k})^{2}=x^{2}\sum_{k=1}^{n}a_{k}^{2}+2x\sum_{k=1}^{n}% a_{k}b_{k}+\sum_{k=1}^{n}b_{k}^{2}

for all xx\in\mathbb{R}.

Proof.

We prove by induction on nn. Let P(n)P(n) be the statement

i=1n(aix+bi)2=x2i=1nai2+2xi=1naibi+i=1nbi2.\sum_{i=1}^{n}(a_{i}x+b_{i})^{2}=x^{2}\sum_{i=1}^{n}a_{i}^{2}+2x\sum_{i=1}^{n}% a_{i}b_{i}+\sum_{i=1}^{n}b_{i}^{2}.

For n=1n=1 we have

(a1x+b1)2=x2a12+2xa1b1+b12.(a_{1}x+b_{1})^{2}=x^{2}a_{1}^{2}+2xa_{1}b_{1}+b_{1}^{2}.

Hence P(1)P(1) is true.

Assume P(k)P(k) is true and consider:

i=1k+1(aix+bi)2\displaystyle\sum_{i=1}^{k+1}(a_{i}x+b_{i})^{2} =(i=1k(aix+bi)2)+(ak+1x+bk+1)2\displaystyle=\left(\sum_{i=1}^{k}(a_{i}x+b_{i})^{2}\right)+(a_{k+1}x+b_{k+1})% ^{2}
=(x2i=1kai2+2xi=1kaibi+i=1kbi2)+(ak+1x+bk+1)2, by P(k),\displaystyle=\left(x^{2}\sum_{i=1}^{k}a_{i}^{2}+2x\sum_{i=1}^{k}a_{i}b_{i}+% \sum_{i=1}^{k}b_{i}^{2}\right)+(a_{k+1}x+b_{k+1})^{2},\text{ by $P(k)$},
=(x2i=1kai2+2xi=1kaibi+i=1kbi2)\displaystyle=\left(x^{2}\sum_{i=1}^{k}a_{i}^{2}+2x\sum_{i=1}^{k}a_{i}b_{i}+% \sum_{i=1}^{k}b_{i}^{2}\right)
+(x2ak+12+2xak+1bk+1+bk+12)\displaystyle\hphantom{=}+(x^{2}a_{k+1}^{2}+2xa_{k+1}b_{k+1}+b_{k+1}^{2})
=x2(i=1kai2+ak+12)\displaystyle=x^{2}\left(\sum_{i=1}^{k}a_{i}^{2}+a_{k+1}^{2}\right)
+2x(i=1kaibi+ak+1bk+1)\displaystyle\hphantom{=}+2x\left(\sum_{i=1}^{k}a_{i}b_{i}+a_{k+1}b_{k+1}\right)
+(i=1kbi2+bk+12)\displaystyle\hphantom{=}+\left(\sum_{i=1}^{k}b_{i}^{2}+b_{k+1}^{2}\right)
=x2i=1k+1ai2+2xi=1k+1aibi+i=1k+1bi2.\displaystyle=x^{2}\sum_{i=1}^{k+1}a_{i}^{2}+2x\sum_{i=1}^{k+1}a_{i}b_{i}+\sum% _{i=1}^{k+1}b_{i}^{2}.

Hence P(k+1)P(k+1) is true.

Since P(1)P(1) holds and P(k)P(k+1)P(k)\Rightarrow P(k+1), we conclude P(n)P(n) holds for all nn\in\mathbb{N} by induction.

Another proof of the Cauchy-Schwarz Inequality is as follows.

Proof.

By Lemma 10.30 and Lemma 10.31, we see that

k=1n(akx+bk)2=x2k=1nak2+2xk=1nakbk+k=1nbk20.\sum_{k=1}^{n}(a_{k}x+b_{k})^{2}=x^{2}\sum_{k=1}^{n}a_{k}^{2}+2x\sum_{k=1}^{n}% a_{k}b_{k}+\sum_{k=1}^{n}b_{k}^{2}\geq 0.

By Lemma 10.29, we know Ax2+2Bx+C0Ax^{2}+2Bx+C\geq 0 for all xx\in\mathbb{R} if and only if A>0A>0 and ACB20AC-B^{2}\geq 0 (or A=B=0A=B=0, and C>0C>0). So take A=k=1nak20A=\sum_{k=1}^{n}a_{k}^{2}\geq 0, B=k=1nakbkB=\sum_{k=1}^{n}a_{k}b_{k} and C=k=1nbk2C=\sum_{k=1}^{n}b_{k}^{2}.

If A=B=0A=B=0 and C>0C>0, then ak=0a_{k}=0 for all k=1,2,,nk=1,2,\ldots,n and the stated inequality holds since 000\leq 0.

If A>0A>0 and ACB20AC-B^{2}\geq 0, we conclude that

B2AC(k=1nakbk)2k=1nak2k=1nbk2B^{2}\leq AC\iff\left(\sum_{k=1}^{n}a_{k}b_{k}\right)^{2}\leq\sum_{k=1}^{n}a_{% k}^{2}\sum_{k=1}^{n}b_{k}^{2}

and thus,

k=1nakbkk=1nak2k=1nbk2.\sum_{k=1}^{n}a_{k}b_{k}\leq\sqrt{\sum_{k=1}^{n}a_{k}^{2}}\sqrt{\sum_{k=1}^{n}% b_{k}^{2}.}
Exercise 10.32.

Prove that for all a1,a2ana_{1},a_{2}\cdots a_{n}

a1+a2++anna12+a22++an2n.\frac{a_{1}+a_{2}+\cdots+a_{n}}{n}\leq\sqrt{\frac{a_{1}^{2}+a_{2}^{2}+\cdots+a% _{n}^{2}}{n}}.
Solution (please try for yourself before looking)

Start by taking (10.7) with b1=b2==bn=1b_{1}=b_{2}=\cdots=b_{n}=1.

a1+a2++anna12+a22++an2.a_{1}+a_{2}+\cdots+a_{n}\leq\sqrt{n}\sqrt{a_{1}^{2}+a_{2}^{2}+\cdots+a_{n}^{2}}.

Divide both sides by nn:

a1+a2++ann\displaystyle\frac{a_{1}+a_{2}+\cdots+a_{n}}{n} nna12+a22++an2\displaystyle\leq\frac{\sqrt{n}}{n}\sqrt{a_{1}^{2}+a_{2}^{2}+\cdots+a_{n}^{2}}
=1na12+a22++an2\displaystyle=\frac{1}{\sqrt{n}}\sqrt{a_{1}^{2}+a_{2}^{2}+\cdots+a_{n}^{2}}
=a12+a22++an2n\displaystyle=\sqrt{\frac{a_{1}^{2}+a_{2}^{2}+\cdots+a_{n}^{2}}{n}}
Exercise 10.33.

Let c1,,cnc_{1},\ldots,c_{n} be positive real numbers. Prove that

(k=1nck)(k=1n1ck)n2.\left(\sum_{k=1}^{n}c_{k}\right)\left(\sum_{k=1}^{n}\frac{1}{c_{k}}\right)\geq n% ^{2}.
Solution (please try for yourself before looking)

Let ak=cka_{k}=\sqrt{c_{k}} and bk=1ckb_{k}=\frac{1}{\sqrt{c_{k}}}. Apply the Cauchy-Schwarz inequality

k=1nak2k=1nbk2(k=1nakbk)2.\sum_{k=1}^{n}a_{k}^{2}\sum_{k=1}^{n}b_{k}^{2}\geq\left(\sum_{k=1}^{n}a_{k}b_{% k}\right)^{2}.

Taking these values

(k=1nck)(k=1n1ck)(k=1nck1ck)2=(k=1n1)2=n2.\left(\sum_{k=1}^{n}c_{k}\right)\left(\sum_{k=1}^{n}\frac{1}{c_{k}}\right)\geq% \left(\sum_{k=1}^{n}\sqrt{c_{k}}\frac{1}{\sqrt{c_{k}}}\right)^{2}=\left(\sum_{% k=1}^{n}1\right)^{2}=n^{2}.

Whenever we see a new inequality it is sensible to ask “how good is this?”. That is, does equality ever hold? To see when a1b1+a2b2++anbna_{1}b_{1}+a_{2}b_{2}+\cdots+a_{n}b_{n} actually equals

a12+a22++an2b12+b22++bn2\sqrt{a_{1}^{2}+a_{2}^{2}+\cdots+a_{n}^{2}}\sqrt{b_{1}^{2}+b_{2}^{2}+\cdots+b_% {n}^{2}}

we consider vectors in the plane.

10.4.5 The Cauchy–Schwarz Inequality and Vectors

Consider vectors a\vec{a} and b\vec{b} in 2\mathbb{R}^{2} to be any two vectors in the plane.

a=(a1a2),b=(b1b2).\vec{a}=\begin{pmatrix}a_{1}\\ a_{2}\end{pmatrix},\quad\vec{b}=\begin{pmatrix}b_{1}\\ b_{2}\end{pmatrix}.

Then the length or norm of the vector is given by

a=a12+a22.||\vec{a}||=\sqrt{a_{1}^{2}+a_{2}^{2}}.

This follows from the Pythagorean Theorem. Notice that the term a12+a22\sqrt{a_{1}^{2}+a_{2}^{2}} appears as part of the right-hand side of (10.7). We can therefore consider how to interpret the left-hand side of (10.7) in terms of vectors a\vec{a} and b\vec{b}.

Let us assume that a0\vec{a}\neq\vec{0} and b0\vec{b}\neq\vec{0}. If we use our particular values to show when equality holds in (10.7), then we are trying to solve equation

(a1b1+a2b2)2=(a12+a22)(b12+b22).(a_{1}b_{1}+a_{2}b_{2})^{2}=(a_{1}^{2}+a_{2}^{2})(b_{1}^{2}+b_{2}^{2}).

Looking back at the special case P(2)P(2) of the Cauchy-Schwarz inequality, the algebra is identical:

(a1b1+a2b2)2=(a12+a22)(b12+b22)\displaystyle(a_{1}b_{1}+a_{2}b_{2})^{2}=(a_{1}^{2}+a_{2}^{2})(b_{1}^{2}+b_{2}% ^{2})
\displaystyle\Leftrightarrow\quad 0=(a1b2)22(a1b2)(a2b1)+(a2b1)2.\displaystyle 0=(a_{1}b_{2})^{2}-2(a_{1}b_{2})(a_{2}b_{1})+(a_{2}b_{1})^{2}.
\displaystyle\Leftrightarrow\quad 0=(a1b2a2b1)2.\displaystyle 0=(a_{1}b_{2}-a_{2}b_{1})^{2}.

This proves that equality holds in (10.7) if and only if a1b2=a2b1a_{1}b_{2}=a_{2}b_{1}.

The goal is to show a1b2=a2b1a_{1}b_{2}=a_{2}b_{1} if and only if a\vec{a} and b\vec{b} are parallel.

  1. 1.

    Assume a2=0a_{2}=0. Then since a0\vec{a}\neq\vec{0}, we have a10a_{1}\neq 0. Then b2=0b_{2}=0 and a1b2=a2b1a_{1}b_{2}=a_{2}b_{1} if and only if a\vec{a} and b\vec{b} are parallel.

  2. 2.

    Assume b2=0b_{2}=0. Then since b0\vec{b}\neq\vec{0}, we have b10b_{1}\neq 0. Then a2=0a_{2}=0 and a1b2=a2b1a_{1}b_{2}=a_{2}b_{1} if and only if a\vec{a} and b\vec{b} are parallel

  3. 3.

    Assume a20a_{2}\neq 0 and b20b_{2}\neq 0. Divide both sides by a2b2a_{2}b_{2} to conclude that

    a1a2=b1b2\frac{a_{1}}{a_{2}}=\frac{b_{1}}{b_{2}}

    and hence a1b2=a2b1a_{1}b_{2}=a_{2}b_{1} if and only if a\vec{a} and b\vec{b} are parallel.

Hence, in all cases a1b2=a2b1a_{1}b_{2}=a_{2}b_{1} if and only if a\vec{a} and b\vec{b} are parallel.

Above we have worked directly in 2\mathbb{R}^{2}. Much of the above generalised to vectors, and will be covered fully in a linear algebra course.

The inner product or dot product of two vectors

a=(a1a2),b=(b1b2).\vec{a}=\begin{pmatrix}a_{1}\\ a_{2}\end{pmatrix},\quad\vec{b}=\begin{pmatrix}b_{1}\\ b_{2}\end{pmatrix}.

is given by

a,b=ab=a1b1+a2b2.\langle\vec{a},\vec{b}\rangle=\vec{a}\cdot\vec{b}=a_{1}b_{1}+a_{2}b_{2}.

Notice this is exactly the left-hand side of (10.7) for vectors in 2\mathbb{R}^{2}. In 2\mathbb{R}^{2} we can prove directly that

a,b=abcosθ\langle\vec{a},\vec{b}\rangle=||\vec{a}||\,||\vec{b}||\cos{\theta}

where θ\theta is the angle between the vectors a\vec{a} and b\vec{b}.

As we shall soon see in linear algebra courses, we can generalize both the length/norm of a vector and the inner product two of nn dimensional vectors.

a:=(k=1nak)12, and a,b:=k=1nakbk||\vec{a}||:=\left(\sum_{k=1}^{n}a_{k}\right)^{\frac{1}{2}},\mbox{ and }% \langle\vec{a},\vec{b}\rangle:=\sum_{k=1}^{n}a_{k}b_{k}

and the Cauchy-Schwarz inequality shows that

a,bab\langle\vec{a},\vec{b}\rangle\leq||\vec{a}||\,||\vec{b}||

for any two vectors.

Indeed, mathematicians choose to generalise the idea of angle θ\theta between two (non-zero) vectors in general by setting

cosθ:=a,bab.\cos{\theta}:=\frac{\langle\vec{a},\vec{b}\rangle}{||\vec{a}||\,||\vec{b}||}.

If particular equality holds in the Cauchy-Schwarz inequality if and only if the vectors are parallel. This is one reason why the Cauchy-Schwarz inequality proves to be so important.