6.6 The Mean-Value Theorem for Derivatives

There is an important relation between the increment quotient \({\displaystyle \frac{\Delta y}{\Delta x}}\) and \(\dfrac{dy}{dx}=f^\prime(x)\). In calculus, many properties of functions can be deduced from this relation, which is called the Mean-Value Theorem. Let’s first consider the geometric significance of this theorem.

Consider the graph of \(y=f(x)\) and two points \(A(a,f(a))\) and \(B(b,f(b))\) on it (Figure 1). Because \(AG=b-a\) and \(GB=f(b)-f(a)\), the slope of the chord \(AB\) is \[\tan(\angle{GAB})=\frac{GB}{AG}=\frac{f(b)-f(a)}{b-a}=\frac{\Delta f}{\Delta x}.\tag{i}\]

Figure 1: Geometric interpretation of the Mean-Value Theorem

As we can see, there is at least one point on the curve between \(A\) and \(B\) (as \(P\)) where the tangent to the curve \(y=f(x)\) is parallel to the chord \(AB\). If the \(x\)-value of \(P\) is \(c\), the slope at \(P\) is \[\tan\theta=f^\prime(c)=\tan(\angle{GAB}).\tag{ii}\] Equating (i) and (ii), we get \[\frac{f(b)-f(a)}{b-a}=f^\prime(c).\tag{iii}\]

In fact, we will find Eq. (iii) intuitive if we think of \(f(t)\) as the position of a particle (that moves on a straight line) at time \(t\). The left-hand side of Eq. (iii) represents the average velocity in the time interval \([a,b]\) and the right-hand side of Eq. (iii) is the instantaneous velocity at some time. This equation states that at some instant the instantaneous velocity must be equal to the average velocity. For example, if the average speed (or average velocity) of a train is 100 km/h (kilometers per hour), then its speedometer must register 100 km/h at least once during the trip.

The formal statement of the mean-value theorem is as follows.

Theorem 1.The MEAN-VALUE THEOREM FOR DERIVATIVES. If function \(f(x)\) is a function with the following properties:

\(f\) is continuous on a closed interval \([a,b]\) and
\(f\)is differentiable on the open interval \((a,b)\),

then there exists at least one point \(c\) in the open interval \((a,b)\) such that \[f^\prime(c)=\frac{f(b)-f(a)}{b-a},\tag{iii}\] or equivalently, \[f(b)-f(a)=f^\prime(c)(b-a).\tag{iv}\]

Show the proof …

The equation of the line that connects \(A\) and \(B\) is \[y-f(a)=\frac{f(b)-f(a)}{b-a}(x-a)\] or \[y=f(a)+\frac{f(b)-f(a)}{b-a}(x-a).\] If we, as shown in Figure 2, subtract the right hand side of the above equation from \(f(x)\) and define a new function \(h(x)\) \[h(x)=f(x)-\left[f(a)+\frac{f(b)-f(a)}{b-a}(x-a)\right],\] then \(h(x)\) satisfies the hypotheses of Rolle’s Theorem on \([a,b]\) because \(h\) is (1) continuous on \([a,b]\), (2) differentiable on \((a,b)\), and (3) \[h(a)=h(b).\] Therefore, there exists at least one point \(c\) in the interval \((a,b)\) such that \(h'(c)=0\). Because \[h'(x)=f(x)-\frac{f(b)-f(a)}{b-a},\] we have \[h'(c)=f^\prime(c)-\frac{f(b)-f(a)}{b-a}=0\Rightarrow f^\prime(c)=\frac{f(b)-f(a)}{b-a}.\]

Figure 2: There might be more than one point that satisfies Eq. (c).

The Mean-Value Theorem for derivatives is sometimes called Lagrange’s Mean-Value Theorem.
Note that the Mean-Value Theorem does not assert where the mean value \(c\) is located, except that it is somewhere between \(a\) and \(b\). In fact, except for special cases, it is often difficult to determine this point. However, many important properties can be deduced from the mere existence of such a point.
Similar to Rolle’s theorem where \(f^\prime\) vanishes, there might be more than one point that satisfies the Mean-Value Theorem. For example, see Figure 3 where two points satisfy Eq. (iii).
If at any point in the interval \((a,b)\), the derivative \(f^\prime\) fails to exists, the Mean-Value Theorem may not hold true.

Figure 3: There might be more than one point that satisfies Eq. (c).

We can express Eq. (iii) somewhat differently by noticing that the number \(c\) can be written as \[c=a+\theta(b-a),\] where \(\theta\) is a certain number between 0 and 1 (\(0<\theta<1\)). Now if we replace \(a\) by \(x\) and \(b\) by \(x+h\), we can express the Mean-Value Theorem by the formula \[\frac{f(x+h)-f(x)}{h}=f^\prime(x+\theta h),\] or \[f(x+h)=f(x)+hf^\prime(x+\theta h)\qquad\text{where }0<\theta<1.\]

Example 1

Verify the conclusion of the Mean-Value Theorem for \[f(x)=x^{2}-4x+3\] on the interval \([-2,3]\).

Solution 1

Because \(f\) is a polynomial, it is continuous and differentiable for all values of \(x\). Therefore, the hypotheses of the Mean-Value Theorem are satisfied for any closed interval. The slope of the secant line through the endpoint values (or the average change of \(f\) over the interval \([-2,3]\)) is

\[\frac{f(3)-f(-2)}{3-(-2)}=\frac{0-15}{5}=-3.\] The slope of the tangent line (or the instantaneous change of \(f\)) is \[f^\prime(x)=2x-4\] The Mean-Value Theorem says there exists some \(c\) in \((-2,3)\) such that \(f^\prime(c)=-3\): \[f^\prime(c)=2c-4=-3\] \[\Rightarrow c=\frac{1}{2}.\] Because \(1/2\) lies in the interval \((-2,3)\), the \(c\) value referred to in the conclusion of the Mean-Value Theorem is \(c=1/2\). See Figure 4.

Figure 4

Example 2

Show that
(a) If \(-1\leq f^\prime(x)\leq1\) for every \(x\), then \[\left|f(x)-f(y)\right|\leq|x-y|\] for every \(x\) and \(y\).
(b) For every \(x\) and \(y\) \[|\sin x-\sin y|\leq|x-y|.\]

Solution 2

(a) The existence of \(f^\prime(x)\) shows that \(f\) is differentiable and consequently continuous everywhere (see here). Therefore, the hypotheses of the Mean-Value Theorem are satisfied, and there exists some \(c\) between \(x\) and \(y\) such that \[f(x)-f(y)=f^\prime(c)(x-y).\] Taking the absolute value of both sides: \[|f(x)-f(y)|=|f^\prime(c)|\ |x-y|.\] Because \(-1\leq f^\prime(x)\leq1\) for every \(x\), \(|f^\prime(c)|\leq1\) and thus \[|f(x)-f(y)|\leq|x-y|.\] (b) Let \(f(x)=\sin x\). Then \[-1\leq f^\prime(x)=\cos x\leq1.\] It follows from part (a) that \[|f(x)-f(y)|\leq|x-y|\]

\[|\sin x-\sin y|\leq|x-y|.\]

Example 3

Show that if \(x>0\) \[\sqrt{1+x}<1+\frac{1}{2}x\]

Solution 3

Let \(f(x)=\sqrt{1+x}\). Then \[f^\prime(x)=\frac{1}{2\sqrt{1+x}}.\] The function \(f\) is continuous and differentiable for all \(x>-1\). Let \(a=0\) and \(b=x>0\). By the Mean-Value Theorem, there is a number \(c\) between \(0\) and \(x\) such that \[f(x)-f(0)=f^\prime(c)(x-0)\] or \[\sqrt{1+x}-1=f^\prime(c)x\tag{i }\] Now we claim that \(f^\prime(c)<\frac{1}{2}\), because for \(c>0\) \[1<\sqrt{1+c}\] and hence \[\frac{1}{\sqrt{1+c}}<1\] so \[f^\prime(c)=\frac{1}{2\sqrt{1+c}}<\frac{1}{2}.\tag{ii }\] Combining (i) and (ii), we obtain \[\sqrt{1+x}-1<\frac{1}{2}x\] or \[\sqrt{1+x}<1+\frac{1}{2}x.\]

Theorem 2. CAUCHY’S MEAN-VALUE FORMULA. Assume \(f(x)\) and \(g(x)\) are continuous on \([a,b]\) and are differentiable on \((a,b)\). Then there exists at least one point \(c\) in \((a,b)\) such that \[f^\prime(c)\left[g(b)-g(a)\right]=g^\prime(c)\left[f(b)-f(a)\right].\] If \(g^\prime(c)\neq0\) and \(g(a)\neq g(b)\), this is equivalent to: \[\frac{f(b)-f(a)}{g(b)-g(a)}=\frac{f^\prime(c)}{g^\prime(c)}.\]

Show the proof …

Similar to the proof of the Mean-Value theorem, we define a new function: \[h(x)=f(x)\left[g(b)-g(a)\right]-g(x)\left[f(b)-f(a)\right].\] Because \(f(x)\) and \(g(x)\) are both continuous in \([a,b]\) and differentiable in \((a,b)\), and \(g(b)-g(a)\) and \(f(b)-f(a)\) are just two numbers, it follows from Theorem 1 in Section on Continuity and Theorem 2 in Section on Differentiation Rules that \(h(x)\) is also continuous on \([a,b]\) and differentiatiable on \((a,b)\). We note that \[h(a)=h(b)=f(a)[g(b)-g(a)]-g(a)[f(b)-f(a)]=f(a)g(b)-f(b)g(a).\] Therefore, the conditions of Rolle’s theorem are satisfied. Applying Rolle’s theorem to \(h(x)\), we find that for some \(c\) in \((a,b)\), we have \[h'(c)=f^\prime(c)\left[g(b)-g(a)\right]-g^\prime(c)\left[f(b)-f(a)\right]=0,\] or \[f^\prime(c)\left[g(b)-g(a)\right]=g^\prime(c)\left[f(b)-f(a)\right].\]

Notice that the Mean-Value Theorem (Lagrange’s theorem) is the special case of Cauchy’s Mean-Value Formula when \(g(x)=x\).

Show the proof …

Hide the proof

Show the proof …

Hide the proof