4.10 The continuous inverse function theorem

To conclude the chapter, we consider inverses of continuous functions. We shall prove the inverse function theorem, which allows us to deduce information about the inverse f1f^{-1} of a continuous function ff.

Continuity and injectivity

In order to define its inverse, we need a function ff to be injective. A simple example of an injective function is a function which is either increasing or decreasing (in other words, it is strictly monotone). However, there are examples of functions which are injective but are not strictly monotone.

Exercise 4.107.

Find an injective function f:(1,1)f\colon(-1,1)\to\mathbb{R} which is not strictly monotone.

The trick to Exercise 4.107 is to consider functions which have a jump or break in the graph. That is, your solution should be discontinuous. Indeed, if ff is both injective and continuous, then it is intuitively clear that this forces ff to be either increasing or decreasing. We make these ideas precise using the intermediate value theorem, leading to the following result.

Proposition 4.108.

Let II\subseteq\mathbb{R} be an interval and f:If\colon I\to\mathbb{R} be a continuous, injective function. Then ff is strictly monotone.

By the result of Exercise 4.107, we see that Proposition 4.108 may fail if we do not assume ff is continuous.

Proof (of Proposition 4.108).

We may assume that II contains at least two points, otherwise the claim is trivial. Let aa, bIb\in I with a<ba<b. Since ff is injective, this implies that either f(a)<f(b)f(a)<f(b) or f(a)>f(b)f(a)>f(b). We assume f(a)<f(b)f(a)<f(b); the remaining case can be treated similarly or by reflection (replacing ff with f-f).

We argue by contradiction, assuming that ff is not strictly monotone on [a,b][a,b]. This means there must exist some c(a,b)c\in(a,b) such that

eitherf(c)f(a)<f(b)orf(a)<f(b)f(c).\text{either}\quad f(c)\leq f(a)<f(b)\qquad\text{or}\qquad f(a)<f(b)\leq f(c).

In the first case, the intermediate value theorem implies that there is x1[c,b)x_{1}\in[c,b) such that f(x1)=f(a)f(x_{1})=f(a). In the second case, the same argument gives some x2(a,c]x_{2}\in(a,c] such that f(x2)=f(b)f(x_{2})=f(b). Since x1ax_{1}\neq a and x2bx_{2}\neq b, in either case this contradicts the injectivity of ff. Hence ff is strictly monotone. ∎

Recall from Corollary 4.95 that continuous functions map intervals to intervals. If we start with an open interval, however, then the image may not be open.

Exercise 4.109.

Consider the continuous function p2:(1,1)p_{2}\colon(-1,1)\to\mathbb{R} given by p2(x):=x2p_{2}(x):=x^{2}. Show that the image Im(p2):={p2(x):x(1,1)}\mathrm{Im}(p_{2}):=\{p_{2}(x):x\in(-1,1)\} is an interval, but not an open interval. Draw a figure to illustrate this.

However, if we work with injective continuous functions, then we can use Proposition 4.108 to show that the open property is preserved.

Corollary 4.110.

Let II\subseteq\mathbb{R} be an open interval and f:If\colon I\to\mathbb{R} be continuous and injective. Then the image Im(f):={f(x):xI}\mathrm{Im}(f):=\{f(x):x\in I\} is an open interval.

By the result of Exercise 4.109, we see that Corollary 4.110 may fail if we do not assume ff is injective.

To prove Corollary 4.110 we shall use a simple characterisation of open intervals.

Lemma 4.111.

Let II\subseteq\mathbb{R} be an interval. Then the following are equivalent:

  1. 1

    II is an open interval.

  2. 2

    For all xIx\in I, there exists some r>0r>0 such that (xr,x+r)I(x-r,x+r)\subseteq I.

  3. 3

    For all xIx\in I, there exists some α\alpha, β\beta\in\mathbb{R} with α<x<β\alpha<x<\beta such that α\alpha, βI\beta\in I.

Roughly speaking, property 2 tells us that for any point xIx\in I, we can move slightly to the left or slightly to the right of xx and still stay within II. Property 3 is closely related and says that II does not have any maximum or minimum.

Proof (of Lemma 4.111).

for Let aa, bb\in\mathbb{R} with a<ba<b.

  • 1. \Rightarrow 2.

    Consider the case I=(a,b)I=(a,b). Let xIx\in I so that a<x<ba<x<b. It follows that xa>0x-a>0 and bx>0b-x>0 and so r:=min{xa,bx}>0r:=\min\{x-a,b-x\}>0. Here rr corresponds to the shortest distance from xx to one of the endpoints of II. Observe that

    a=x(xa)xr<x<x+rx+(bx)=b.a=x-(x-a)\leq x-r<x<x+r\leq x+(b-x)=b.

    and so (xr,x+r)(a,b)(x-r,x+r)\subseteq(a,b). This shows II satisfies 2.

    If I=(a,)I=(a,\infty), I=(,b)I=(-\infty,b), I=I=\mathbb{R} or I=∅︀I=\emptyset, then similar (but easier) arguments show that II again satisfies 2. This shows that all open intervals satisfy 2.

  • 2. \Rightarrow 3.

    Let xIx\in I and assume 2 holds, so there exists some r>0r>0 such that (xr,x+r)I(x-r,x+r)\subseteq I. If we define α:=xr/2\alpha:=x-r/2 and β:=x+r/2\beta:=x+r/2, then it follows that α<x<β\alpha<x<\beta and α\alpha, β(xr,x+r)I\beta\in(x-r,x+r)\subseteq I, so α\alpha, βI\beta\in I as required.

  • 3. \Rightarrow 1.

    We prove the contrapositive. If I=[a,b]I=[a,b], I=(,b]I=(-\infty,b] or I=(a,b]I=(a,b], then taking x=bx=b it is clear that there does not exist any β>x\beta>x such that βI\beta\in I. If I=[a,)I=[a,\infty) or I=[a,b)I=[a,b), then taking x=ax=a it is clear that there does not exist any α<x\alpha<x such that αI\alpha\in I. This shows that whenever II is not open, property 3 fails.

Proof (of Corollary 4.110).

We know from Corollary 4.95 that Im(f)\mathrm{Im}(f) is an interval, so it remains to show that Im(f)\mathrm{Im}(f) is an open interval.

By Proposition 4.108 we know that ff is strictly monotone. We shall assume that ff is increasing; the case where ff is decreasing can be treated similarly or by reflection.

Let yIm(f)y\in\mathrm{Im}(f) so that y=f(x)y=f(x) for some xIx\in I. Since II is an open interval, by Lemma 4.111 there exist α\alpha, β\beta\in\mathbb{R} with α<x<β\alpha<x<\beta such that α\alpha, βI\beta\in I.

We now consider the points f(α)f(\alpha), f(β)Im(f)f(\beta)\in\mathrm{Im}(f). Since ff is increasing, f(α)<f(x)=y<f(β)f(\alpha)<f(x)=y<f(\beta). However, this means Im(f)\mathrm{Im}(f) satisfies condition 3 of Lemma 4.111 and so it must be an open interval, as required. ∎

The continuous inverse function theorem

Let II, JJ\subseteq\mathbb{R} be intervals and f:IJf\colon I\to J be a bijective function. The graph of the inverse function f1:JIf^{-1}\colon J\to I is obtained by reflecting the graph of ff across the diagonal y=xy=x: see Figure 4.28. Suppose ff is continuous. Intuitively, the graph of ff has no breaks, and reflecting the graph should not result in new breaks appearing. Thus, we expect that f1f^{-1} should also be continuous. The following theorem confirms that this is indeed the case.

Figure 4.28: The graph of f1:JIf^{-1}\colon J\to I is formed by reflecting the graph of f:IJf\colon I\to J.
Theorem 4.112 (Continuous inverse function theorem).

Let II, JJ\subseteq\mathbb{R} be intervals and f:IJf\colon I\to J be a continuous, bijective function. Then the inverse f1:JIf^{-1}\colon J\to I is continuous.

Proof.

We shall only show the result when II\subseteq\mathbb{R} is an open interval. The same arguments can be used to prove the general case, but there are additional technicalities when dealing with endpoints.

Fix bJb\in J and let ε>0\varepsilon>0 be given. Our goal is to find some δ>0\delta>0 such that

(4.21) (4.21) |f1(y)f1(b)|<εfor all yJ with |yb|<δ.|f^{-1}(y)-f^{-1}(b)|<\varepsilon\qquad\text{for all $y\in J$ with $|y-b|<% \delta$.}

By the surjectivity of ff there exists some aIa\in I such that b=f(a)b=f(a). Since II is an open interval, by Lemma 4.111 there exists some r>0r>0 such that (ar,a+r)I(a-r,a+r)\subseteq I. Thus, if we define ρ:=min{ε,r}\rho:=\min\{\varepsilon,r\}, then 0<ρε0<\rho\leq\varepsilon and (aρ,a+ρ)I(a-\rho,a+\rho)\subseteq I. Since ff is bijective, it is injective. Since ff is both continuous and injective, by Corollary 4.110, it maps the open interval (aε,a+ε)(a-\varepsilon,a+\varepsilon) to an open interval UU containing b=f(a)b=f(a).

Since UU\subseteq\mathbb{R} is an open interval and bUb\in U, by Lemma 4.111 there exists some δ>0\delta>0 such that (bδ,b+δ)U(b-\delta,b+\delta)\subseteq U. We claim that (4.21) holds for this choice of δ\delta.

Proving the claim is just a matter of unpacking the definitions. Suppose yJy\in J satisfies |yb|<δ|y-b|<\delta. Then yUy\in U and so f1(y)f1(U)=(aρ,a+ρ)f^{-1}(y)\in f^{-1}(U)=(a-\rho,a+\rho), using the fact that f1f^{-1} is the inverse to ff. Thus,

|f1(y)f1(b)|=|f1(y)a|<ρε,|f^{-1}(y)-f^{-1}(b)|=|f^{-1}(y)-a|<\rho\leq\varepsilon,

as required. ∎

We can use the continuous inverse function theorem to show that many familiar functions are continuous.

Example 4.113.

Given nn\in\mathbb{N}, let qn:[0,)[0,)q_{n}\colon[0,\infty)\to[0,\infty) be the function qn(x):=x1/nq_{n}(x):=x^{1/n}. Then qnq_{n} is continuous. Indeed, qnq_{n} is the inverse of the bijective polynomial map pn:[0,)[0,)p_{n}\colon[0,\infty)\to[0,\infty) given by pn(x):=xnp_{n}(x):=x^{n}. By Example 4.53, we know pnp_{n} is continuous, and therefore qnq_{n} is continuous by the continuous inverse function theorem.

Remark 4.114.

Whilst it is possible to argue directly from the definition to show the functions qnq_{n} from Example 4.113 are continuous, this is quite an involved and messy exercise. It is much easier to let the continuous inverse function theorem do the work for us!

Example 4.115.

Since we know from Lemma 4.33 that sin:[π/2,π/2][1,1]\sin\colon[-\pi/2,\pi/2]\to[-1,1] and cos:[0,π][1,1]\cos\colon[0,\pi]\to[-1,1] are both continuous, it follows from the continuous inverse function theorem that the inverse trigonometric functions arcsin:[1,1][π/2,π/2]\arcsin\colon[-1,1]\to[-\pi/2,\pi/2] and arccos:[1,1][0,π]\arccos\colon[-1,1]\to[0,\pi] are both continuous.

Example 4.116.

Suppose we know that exp:(0,)\exp\colon\mathbb{R}\to(0,\infty) is continuous. We can then apply the continuous inverse function theorem to deduce that log:(0,)\log\colon(0,\infty)\to\mathbb{R} is continuous. Thus, in order to prove the ‘borrowed’ Lemma 4.35, we only need to show exp:(0,)\exp\colon\mathbb{R}\to(0,\infty) is continuous. We shall establish this fact in the next chapter.