5.9 The proof of Taylor’s theorem

This section is nonexaminable.

We motivated Taylor’s theorem as a generalisation of the mean value theorem, replacing the constant polynomial P0f,a(x):=f(a)P_{0}^{f,a}(x):=f(a) with higher-degree approximating polynomials. Recall, the first step in the proof of the mean value theorem was to prove Rolle’s theorem. The first step to proving Taylor’s theorem is the following ‘higher-order’ Rolle’s theorem.

Lemma 5.85 (Higher-order Rolle’s theorem).

Let II\subseteq\mathbb{R} be an open interval, n0n\in\mathbb{N}_{0} and f:If\colon I\to\mathbb{R} be (n+1)(n+1)-times differentiable. Let aa, bIb\in I with a<ba<b and suppose

(5.36) (5.36) f(a)=f(a)==f(n)(a)=f(b)=0.f(a)=f^{\prime}(a)=\cdots=f^{(n)}(a)=f(b)=0.

Then there exists some c(a,b)c\in(a,b) such that f(n+1)(c)=0f^{(n+1)}(c)=0.

The hypotheses of Lemma 5.85 are slightly different compared to the simple case of Rolle’s theorem as stated in Theorem 5.37. In particular, we assume ff is defined on an open interval II rather than on some closed interval [a,b][a,b]. This is just to avoid technicalities concerning defining the derivative f(k)(a)f^{(k)}(a) at the endpoint aa.99 9 One could formulate a version of Lemma 5.85 for functions f:[a,b]f\colon[a,b]\to\mathbb{R}, but then one would have to work with right-derivatives at the endpoint aa. In any case, Theorem 5.37 implies the n=0n=0 case of Lemma 5.85 by applying Theorem 5.37 to the restricted function f|[a,b]:[a,b]f|_{[a,b]}\colon[a,b]\to\mathbb{R}.

Proof (of Lemma 5.85).

We prove the result by induction on n0n\in\mathbb{N}_{0}. We have already observed that the n=0n=0 follows from Theorem 5.37, which serves as the base case.

Let nn\in\mathbb{N} and suppose, as an induction hypothesis, that the lemma holds for the n1n-1 case. Let f:If\colon I\to\mathbb{R} be (n+1)(n+1)-times differentiable, fix aa, bIb\in I with a<ba<b and suppose (5.36) holds. Note that the restricted function f|[a,b]:[a,b]f|_{[a,b]}\colon[a,b]\to\mathbb{R} is continuous on [a,b][a,b] and differentiable on (a,b)(a,b) and satisfies f(a)=f(b)=0f(a)=f(b)=0. By the simple case of Rolle’s theorem from Theorem 5.37 applied to f|[a,b]f|_{[a,b]}, there exists some d(a,b)d\in(a,b) such that f(d)=0f^{\prime}(d)=0. Now, the derivative f:If^{\prime}\colon I\to\mathbb{R} is nn-times differentiable and satisfies

f(a)=(f)(a)==(f)(n1)(a)=f(d)=0.f^{\prime}(a)=(f^{\prime})^{\prime}(a)=\cdots=(f^{\prime})^{(n-1)}(a)=f^{% \prime}(d)=0.

We can therefore apply the induction hypothesis to f:If^{\prime}\colon I\to\mathbb{R} to conclude that there exists some c(a,d)c\in(a,d) such that (f)(n)(c)=0(f^{\prime})^{(n)}(c)=0. But in this case, c(a,b)c\in(a,b) and f(n+1)(c)=(f)(n)(c)=0f^{(n+1)}(c)=(f^{\prime})^{(n)}(c)=0, as required. This closes the induction and completes the proof. ∎

Lemma 5.86 (Higher-order mean value theorem).

Let II\subseteq\mathbb{R} be an open interval, n0n\in\mathbb{N}_{0} and f:If\colon I\to\mathbb{R} be (n+1)(n+1)-times differentiable. Let aa, bIb\in I with a<ba<b. Then there exists some c(a,b)c\in(a,b) such that

(5.37) (5.37) f(b)Pnf,a(b)=f(n+1)(c)(n+1)!(ba)n+1.f(b)-P_{n}^{f,a}(b)=\frac{f^{(n+1)}(c)}{(n+1)!}(b-a)^{n+1}.

We use the same idea to prove Lemma 5.86 as we did to prove the simple case of the mean value theorem from Theorem 5.41. Recall, for the proof of Theorem 5.41, we subtracted a linear polynomial from ff to reduce to a situation where Rolle’s theorem applies. Here we subtract a degree n+1n+1 polynomial from ff to reduce to a situation where the higher-order Rolle’s theorem applies.

Proof (of Lemma 5.86).

First consider the function g:Ig\colon I\to\mathbb{R} given by g(x):=f(x)Pnf,a(x)g(x):=f(x)-P_{n}^{f,a}(x) for all xIx\in I. We know from Exercise 5.73 that

(5.38) (5.38) g(k)(a)=f(k)(a)(Pnf,a)(k)(a)=0for all 0kn.g^{(k)}(a)=f^{(k)}(a)-(P_{n}^{f,a})^{(k)}(a)=0\qquad\text{for all $0\leq k\leq n% $.}

So the function gg satisfies all the hypotheses of the higher-order Rolle’s theorem except possibly the condition g(b)=0g(b)=0. We can fix this by considering the modified function

h:I,h(x):=f(x)Pnf,a(x)f(b)Pnf,a(b)(ba)n+1(xa)n+1for all xI,h\colon I\to\mathbb{R},\qquad h(x):=f(x)-P_{n}^{f,a}(x)-\frac{f(b)-P_{n}^{f,a}% (b)}{(b-a)^{n+1}}\cdot(x-a)^{n+1}\qquad\text{for all $x\in I$,}

since a simple computation shows that hh also satisfies h(b)=0h(b)=0. Furthermore, observe that dkdxk(xa)n+1|x=a=0\frac{\mathrm{d}^{k}}{\mathrm{d}x^{k}}(x-a)^{n+1}|_{x=a}=0 for 0kn0\leq k\leq n. Combining this with (5.38) we have

h(k)(a)=0for all 0kn.h^{(k)}(a)=0\qquad\text{for all $0\leq k\leq n$.}

Thus, hh satisfies all the hypotheses of the higher-order Rolle’s theorem, so we can apply Lemma 5.85 to deduce that there exists some c(a,b)c\in(a,b) such that h(n+1)(c)=0h^{(n+1)}(c)=0. However,

0=h(n+1)(c)=f(n+1)(c)(n+1)!f(b)Pnf,a(b)(ba)n+1.0=h^{(n+1)}(c)=f^{(n+1)}(c)-(n+1)!\cdot\frac{f(b)-P_{n}^{f,a}(b)}{(b-a)^{n+1}}.

Rearranging, we obtain the desired identity (5.37). ∎

Exercise 5.87.

Let II\subseteq\mathbb{R} be an open interval, n0n\in\mathbb{N}_{0} and f:If\colon I\to\mathbb{R} be (n+1)(n+1)-times differentiable. Let aa, bIb\in I with a<ba<b. Show that there exists some c(a,b)c\in(a,b) such that

f(a)Pnf,b(a)=f(n+1)(c)(n+1)!(ab)n+1.f(a)-P_{n}^{f,b}(a)=\frac{f^{(n+1)}(c)}{(n+1)!}(a-b)^{n+1}.

This is a version of Lemma 5.86 with the roles of aa and bb flipped in the identity.

Hint: Let J:={a+bx:xI}J:=\{a+b-x:x\in I\} and apply Lemma 5.86 to the function g:Jg\colon J\to\mathbb{R} given by g(x):=f(a+bx)g(x):=f(a+b-x) for xJx\in J.

Taylor’s theorem is an immediate consequence of the higher-order mean value theorem.

Proof (of Theorem 5.74).

We use the argument from Interpretation 3 of the mean value theorem.

Let II\subseteq\mathbb{R} be an open interval, f:If\colon I\to\mathbb{R} be (n+1)(n+1)-times differentiable and aIa\in I. For any xIx\in I with x>ax>a, the function ff satisfies the hypotheses of the higher-order mean value theorem with b:=xb:=x. If xIx\in I and x<ax<a, then the same holds with the roles of aa and xx swapped. From this, we deduce that for all xI{a}x\in I\setminus\{a\} there exists some cxIc_{x}\in I lying between aa and xx such that

f(x)=Pnf,a(x)+f(n+1)(cx)(n+1)!(xa)n+1.f(x)=P_{n}^{f,a}(x)+\frac{f^{(n+1)}(c_{x})}{(n+1)!}(x-a)^{n+1}.

Indeed, for x>ax>a this follows directly from Lemma 5.86; for x<ax<a this follows from the ‘flipped’ form of Lemma 5.86 from Exercise 5.87. However, the above is exactly the statement of Taylor’s theorem! ∎