10.1 Foundations and Motivation

In this chapter, we consider inequalities.

Definition 10.1 (Inequalities).

Given x,yx,y\in\mathbb{R}, if there exists a positive zz\in\mathbb{R} such that

x+z=yx+z=y

then we say “xx is less than yy” and we write x<yx<y.

Equivalently, we may say “yy is greater than xx” and write y>xy>x.

If zz\in\mathbb{R} may have the value 0, we say “xx is at most yy” or “xx is less than or equal to yy”, and we write xyx\leqslant y. Equivalently, we may say “yy is at least xx” and write yxy\geqslant x.

Informally, we say x<yx<y exactly when “xx is strictly to the left of yy on the number line” and y>xy>x when “yy is strictly to the right of xx”.

xxyy
Figure 10.1: A real number line with x<yx<y and y>xy>x.

When we write xyx\leq y we mean that either x<yx<y or x=yx=y. Hence it is true that 232\leq 3.

Inequalities are particularly useful to allow us to place a constraint on what values a variable can take.

Definition 10.2 (Bounds).

If xyx\leqslant y or x<yx<y, we say that yy is an upper bound for xx and xx is a lower bound for yy.

If the bound we use is the best possible bound (so x<yax<y-a is not valid for any a>0a>0), then we call the bound sharp.

10.1.1 Axioms of Inequalities

There are some foundational rules11 1 An axiom is a foundational statement that is accepted as true without any proof. You will see more on axioms next semester in IMA but can read more about them now. that the inequality obeys. Note that the rules below only refer to the strict inequality << but the corresponding rules for >>, \leqslant and \geqslant can be derived from these rules as a result of their definitions.

Definition 10.3 (Axioms of Inequalities on the Reals).

For every x,y,zx,y,z\in\mathbb{R}, the inequality << on \mathbb{R} obeys the following rules.

  • (R1)

    Exactly one of the following is true: 0<x0<x, x=0x=0, or x<0x<0.

  • (R2)

    If x<yx<y and cc\in\mathbb{R}, then x+c<y+cx+c<y+c.

  • (R3)

    If 0<x0<x and 0<y0<y, then 0<xy0<xy.

  • (R4)

    If x<yx<y and y<zy<z, then x<zx<z. [Transitivity of <<]

Again, you may be able to think of some more rules for inequalities that aren’t listed in Definition 10.3 but we can derive them from these rules for inequalities and our core rules for addition and multiplication over \mathbb{R}.

Proposition 10.4 (More rules for inequalities).

For every x,y,zx,y,z\in\mathbb{R},

  1. (a)

    if x>0x>0, then x<0-x<0.

  2. (b)

    if y<xy<x, then x<y-x<-y.

  3. (c)

    if x0x\neq 0, then x2>0x^{2}>0.

  4. (d)

    if x>0x>0 then 1x>0\frac{1}{x}>0.

  5. (e)

    if x>0x>0, then y>zy>z if and only if xy>xzxy>xz.

  6. (f)

    if 0<x0<x and 0<y0<y, then x2>y2x^{2}>y^{2} if and only if x>yx>y.

Proof.
  • (a)

    Suppose x>0x>0. Then taking c=xc=-x in (R2), we have x+(x)>0+(x)x+(-x)>0+(-x) and thus 0>x0>-x.

Exercise 10.5.

Prove the remaining claims in Proposition 10.4 using the axioms in Definition 10.3.

Solution (please try for yourself before looking)

For every x,y,zx,y,z\in\mathbb{R},

  1. (b)

    Suppose y<xy<x. Adding c=xyc=-x-y to both sides of the inequality using (R2), we have x<y-x<-y.

  2. (c)

    Suppose x0x\neq 0.

    If x>0x>0, then by (R3) we have x2>0x^{2}>0.

    If x<0x<0, then 0<x0<-x by part (b), and

    (x)(x)=x2>0(-x)(-x)=x^{2}>0

    by (R3).

  3. (d)

    Suppose x>0x>0 and define y=1x0y=\frac{1}{x}\neq 0. By (R1) either y>0y>0 or y<0y<0.

    If y<0y<0 then 0<y0<-y (by part (b)) and so

    0<x(y)=x1x=1.0<x\cdot(-y)=-x\frac{1}{x}=-1.

    This is a contradiction. So we cannot have y<0y<0.

    Hence y=1x>0y=\frac{1}{x}>0.

  4. (e)

    Suppose x>0x>0.

    If y>zy>z, then yz>0y-z>0 by (R2). Next xyxz>0xy-xz>0 by (R3), and so xy>xzxy>xz by (R2).

    Now suppose xy>xzxy>xz. Then x(yz)>0x(y-z)>0 by (R2). By part (d) we have 1x>0\frac{1}{x}>0, so multiplying both sides of our inequality by 1x\frac{1}{x} gives us yz>0y-z>0 by (R3). Finally y>zy>z by (R2).

  5. (f)

    Suppose 0<x0<x and 0<y0<y.

    If x>yx>y then (i) x2>xyx^{2}>xy by part (e), and (ii) xy>y2xy>y^{2} by part (e). Combining these with (R4) gives x2>y2x^{2}>y^{2}.

    If x2>y2x^{2}>y^{2} then (x+y)(xy)=x2y2>0(x+y)(x-y)=x^{2}-y^{2}>0 by (R2). We know x+y>0x+y>0 by (R2) so 1x+y>0\frac{1}{x+y}>0 by part (d). Hence xy=1x+y(x+y)(xy)>0x-y=\frac{1}{x+y}(x+y)(x-y)>0 by (R3). Therefore, x>yx>y by (R2).

Many inequalities are derived from the fact that if two factors have the same algebraic sign then their product is positive.

Exercise 10.6.

If a,A,b,Ba,A,b,B are all positive, and a>ba>b and A>BA>B then prove that Aa>BbAa>Bb.

Solution (please try for yourself before looking)

Since A>0A>0 and a>ba>b, then by Proposition 10.4 (e) Aa>AbAa>Ab.

Since b>0b>0 and A>BA>B, then by Proposition 10.4 (e) Ab>BbAb>Bb.

Since Aa>AbAa>Ab and Ab>BbAb>Bb it follows that Aa>BbAa>Bb by transitivity (Definition 10.3 (R4)).

Proposition 10.7.

For all a,ba,b\in\mathbb{R}, ab>0ab>0 if and only if a,b>0a,b>0, or a,b<0a,b<0.

Proof.

Suppose a,b>0a,b>0 or a,b<0a,b<0. If a,b>0a,b>0, then ab>0ab>0 follows from Definition 10.3 (R3). If a,b<0a,b<0, then Proposition 10.4 (a) tells us a,b>0-a,-b>0. Thus, by Definition 10.3 (R3), we get (a)(b)>0(-a)\cdot(-b)>0 and hence, ab>0ab>0.

For the other direction, suppose ab>0ab>0. We have two cases.

Case 1: a>0a>0. Then, since ab>0=a0ab>0=a\cdot 0, by Proposition 10.4 (e) we have b>0b>0.

Case 2: a<0a<0. Then, by Proposition 10.4 (b), a>0-a>0 and thus, 1a>0\frac{1}{-a}>0 by part Proposition 10.4 (d). Hence, by Definition 10.3 (R3), we have 1aab>0b>0\frac{1}{-a}\cdot ab>0\implies-b>0 and hence, b<0b<0 by Proposition 10.4 (b).

NOTE: For the basic rules of inequalities stated above, we usually use them in proofs without explicit reference to these results every time, just as we would typically take for granted that the order of multiplication and addition in the reals doesn’t matter (commutativity), or the product of two integers produces an integer.

We can continue building on our rules for inequalities to prove more complicated statements.

Exercise 10.8.

Prove that for any a,ba,b\in\mathbb{R},

aba2+b22ab\leq\frac{a^{2}+b^{2}}{2} (10.1)
Solution (please try for yourself before looking)

For any a,ba,b\in\mathbb{R} we have (ab)20(a-b)^{2}\geq 0. Expanding out the brackets gives

a22ab+b20.a^{2}-2ab+b^{2}\geq 0.

Adding 2ab2ab to both sides, and dividing both sides by 22 (which of course is positive, so we don’t change the inequality) gives

aba22+b22.ab\leq\frac{a^{2}}{2}+\frac{b^{2}}{2}.
Exercise 10.9.

Prove that

a2+b2+c2>bc+ca+ab,a^{2}+b^{2}+c^{2}>bc+ca+ab,

holds for all a,b,ca,b,c\in\mathbb{R}, except when a=b=ca=b=c.

Solution (please try for yourself before looking)

Solution 1: First note that

2(a2+b2+c2bccaab)=(bc)2+(ca)2+(ab)2.2(a^{2}+b^{2}+c^{2}-bc-ca-ab)=(b-c)^{2}+(c-a)^{2}+(a-b)^{2}.

Unless a,b,ca,b,c are equal, the right hand side is positive. This gives us our desired inequality.

Solution 2: Use the result (10.1) three times, namely,

b2+c22>bc\frac{b^{2}+c^{2}}{2}>bc
c2+a22>ca\frac{c^{2}+a^{2}}{2}>ca
a2+b22>ab\frac{a^{2}+b^{2}}{2}>ab

and add these together.