Linearization of a Function
Recall the definition of \(f'(x_{0})\): \[\lim_{\Delta x\to0}\frac{f(x_{0}+\Delta x)-f(x_{0})}{\Delta x}=f'(x_{0}).\] From the definition of a limit, we know that for \(x\) close to \(x_{0}\), the fraction \([f(x_{0}+\Delta x)-f(x_{0})]/\Delta x\) must be close to \(f'(x_{0})\); that is, for \(x\) close to \(x_{0}\) \[\frac{f(x_{0}+\Delta x)-f(x_{0})}{\Delta x}\approx f'(x_{0}).\] Multiplying both sides by \(\Delta x\), we get \[f(x_{0}+\Delta x)-f(x_{0})\approx f'(x_{0})\Delta x.\] In terms of \(x=x_{0}+\Delta x\), we have \[f(x)-f(x_{0})\approx f'(x_{0})(x-x_{0})\] or \[f(x)\approx f(x_{0})+f'(x_{0})(x-x_{0}).\] Let’s denote the right-hand side by \(L(x)\): \[L(x)=f(x_{0})+f'(x_{0})(x-x_{0}).\] The graph of the function \(L\) is the tangent line to the curve \(y=f(x)\) at \(\left(x_{0},f(x_{0})\right)\).1 It is also clear from Figure 1 that the tangent line at \(\left(x_{0},f(x_{0})\right)\) is a good approximation to \(f(x)\) when \(x\) is close to \(x_{0}\).
- If we use the delta notation \(\Delta y=f(x)-f(x_{0})\) and \(\Delta x=x-x_{0}\), then we can write the linear approximation as \[\underbrace{\Delta y}_{f(x)-f(x_{0})}\approx f'(x)\underbrace{\Delta x}_{x-x_{0}}\]
Read more about the meaning of linearization
Hide the extra explanation
For example, if \(y=f(x)=x^{2}\), then \[f'(x)=2x\] and the linear approximation is \[\underbrace{f(x+\Delta x)}_{(x+\Delta x)^{2}}\approx\underbrace{f(x)}_{x^{2}}+2x\Delta x\] and by expanding \((x+\Delta x)^{2}\), we have \[x^{2}+2x\,\Delta x+\left(\Delta x\right)^{2}\approx x^{2}+2x\,\Delta x.\] This approximation tells us that if \(\Delta x\) is small, \((\Delta x)^{2}\) is negligible. This makes sense, because when \(\Delta x\) is small, \((\Delta x)^{2}\) is the square of a small number and this is much smaller than \(\Delta x\) itself. For example, if \(\Delta x=0.001\) then \((\Delta x)^{2}=0.000000001\).
For the geometrical interpretation of this approximation, consider a square having sides \(x\) by \(x\), and suppose that each side is increased to \(x+\Delta x\) (Figure 2). The enlarged square is made up of the original square of area \(x^{2}\), two rectangles at the top and on the right, each of which is of area \(x\cdot\Delta x\), and the little square at the top right corner of area \((\Delta x)^{2}\). We may ignore this little square and say that the area of the enlarged square is approximately \(x^{2}+2x\,\Delta x.\)
Is the Linear Approximation Always a Good Approximation?
Show the answer
Hide the answer
The linear approximation is based on the assumption that the tangent line remains close to the curve \(y=f(x)\) when \(x\) is close to \(x_{0}\). If the curve has a pronounced bend at \(\left(x_{0},f(x_{0})\right)\) (for example, see Figure 7), then the linear approximation does not do a good job unless \(x\) is really close to \(x_{0}\). The bending of a curve is measured by the second derivative. We will talk about the bending of curves in the next chapter.
Figure 7. The linear approximation does not do a good job if the curve is bending sharply near \((x_{0},f(x_{0}))\) unless \(x\) is really close to \(x_{0}\).
An Application of Linearization in Physics Problems
Show the application
Hide the application
Linearization can simplify some equations. Here is an example:
A simple pendulum is a body of mass \(m\) on the end of a massless string suspended from a point (Figure 8). When displaced a little bit and released, the pendulum swings back and forth. Let \(\theta\) denote the angle that the string makes with the vertical. The displacement of the body from equilibrium is then \(s=L\theta\), where \(L\) is the length of the string. The force along the direction of motion is \(-mg\sin\theta\) where \(g\) is acceleration due to gravity (\(g\approx9.81\) m/s\(^{2}\) or \(g\approx32\) ft/s\(^{2}\)). It follows from Newton’s second law that \[F=ma=-mg\sin\theta\Rightarrow a=-g\sin\theta.\] Because \(a=d^{2}s/dt^{2}\) and \(s=L\theta\) we have \[\underbrace{\frac{d^{2}}{dt^{2}}(L\theta)}_{a}=-g\sin\theta\] or \[L\frac{d^{2}\theta}{dt^{2}}=-g\sin\theta.\] To describe the oscillation of the pendulum, we look for a function \(\theta=f(t)\) that satisfies the following equation: \[\underbrace{f^{\prime\prime}(t)}_{\frac{d^{2}\theta}{dt^{2}}}+\frac{g}{L}\sin(\underbrace{f(t)}_{\theta})=0.\] There is no function (among the functions that we know) that can satisfy the above equation. However, we can use the linear approximation \(\sin\theta\approx\theta\) or \(\sin(f(t))\approx f(t)\) and rewrite the equation as \[f^{\prime\prime}(t)+\frac{g}{L}f(t)=0.\] We can show that the solution of this equation is \[f(t)=\theta_{0}\cos\left(\sqrt{\frac{g}{L}}\ t\right),\] where \(\theta_{0}=f(0)\) is initial angular displacement. As long as \(\theta_{0}\) is small (about \(22.5^{\circ}\) or less as we showed in the last example), the linear approximation is valid.
- Recall that the equation of a line through \((x_{0},y_{0})\) with slope \(m\) is \(y-y_{0}=m(x-x_{0})\) or \(y=y_{0}+m(x-x_{0}).\) So because the slope of the tangent line to the curve \(y=f(x)\) at \(\left(x_{0},f(x_{0})\right)\) is \(f'(x_{0})\), its equation is \(y=f(x_{0})+f'(x_{0})(x-x_{0}).\)↩︎
- Notice that \(L\) is not a linear function unless \(f(x_{0})=0\). However, even if \(f(x_{0})\neq0\), we may loosely call \(L\) a linear function because its graph is a line.↩︎