In the previous chapter, we learned the substitution rule for indefinite integrals. In summary, if we have an indefinite integral of the form \[ \int f(g(x))g'(x)dx \] we make the substitution \[ u=g(x),\qquad du=g'(x)dx \] and transform the integral to the form \[ \int f(\underbrace{g(x)}_{u})\underbrace{g'(x)dx}_{du}=\int f(u)du. \] In this section, we will learn how to evaluate definite integral of the form \[ \int _{a}^{b}f(g(x))g'(x)dx, \] by substitution. For example, suppose that we want to evaluate \[ \int _{1}^{e}\frac{\ln x}{x}dx. \] It follows from the second part of the Fundamental Theorem of Calculus that \[ \int _{1}^{e}\frac{\ln x}{x}dx=\left [\int \frac{\ln x}{x}dx\right ]_{1}^{e}. \] To evaluate the indefinite integral \(\int \frac{\ln x}{x}dx\), we make the substitution \[ u=\ln x,\qquad du=\frac{1}{x}dx \] which converts the indefinite integral to \begin{align*} \int \overbrace{\ln x}^{u}\overbrace{\frac{1}{x}dx}^{du} & =\int udu\\ & =\frac{1}{2}u^{2}+C\\ & =\frac{1}{2}(\ln x)^{2}+C. \end{align*} Therefore, \begin{align*} \int _{1}^{e}\frac{\ln x}{x}dx & =\left [\int \frac{\ln x}{x}dx\right ]_{1}^{e}\\ & =\bigg [\frac{1}{2}(\ln x)^{2}\bigg ]_{1}^{e}\\ & =\frac{1}{2}\left [(\ln e)^{2}-(\ln 1)^{2}\right ]=\frac{1}{2}\left [1^{2}-0^{2}\right ]\\ & =\frac{1}{2}. \end{align*} When integrating by the substitution of a new variable (\(u\) in the above example), it is sometimes rather troublesome to express the result in terms of the original variable (\(x\) in the above example). We may avoid the process of restoring the original variable if we change the limits of integration to correspond with the new variable. In the above example, the lower limit of integration is \(x=1\), so in terms of \(u\), the lower limit becomes \(u=\ln 1=0\). The upper limit of integration is \(x=e\), so the upper limit of integration in terms of \(u\) becomes \(u=\ln e=1\). Therefore, \[ \int _{1}^{e}\frac{\ln x}{x}dx=\int _{0}^{1}udu=\left [\frac{1}{2}u^{2}\right ]_{0}^{1}=\frac{1}{2}, \] and we get the same result as Geometrically, the equation \[ \int _{1}^{e}\frac{\ln x}{x}dx=\int _{0}^{1}udu \] means that the two different regions shown in Figure 1 have the same area. There are two methods to evaluate a definite integral by substitution: that is, \(F(g(x))\) is an antiderivative of \(f(g(x))g'(x)\). Therefore, it follows from the second part of the Fundamental Theorem of Calculus that \begin{align*} \int _{a}^{b}f(g(x))g'(x)dx & =F(g(x)\bigg ]_{x=a}^{x=b}\\ & =F(g(b))-F(g(a))\\ & =F(u)\bigg ]_{u=g(a)}^{u=g(b)}\\ & =\int _{g(a)}^{g(b)}f(u)du. \end{align*} So far, we have learned the substitution for definite integrals is \[ \int _{a}^{b}f(g(x))g'(x)dx=\int _{g(a)}^{g(b)}f(u)du.\tag{i} \] Instead of thinking of this rule as an algebraic consequence of the chain rule, let’s see what it means geometrically. For convenience, assume that: Let’s represent \(g\) with a separate domain and range, as in Figure 2. The function \(g\) maps the interval \([a,b]\) on the \(x\)-axis to the interval \([g(a),g(b)]\) on the \(u\)-axis. Now let’s subdivide the interval \([a,b]\) by \(a=x_{0},x_{1},\dots ,x_{n}=b\). The images of the points \(x_{0},\dots ,x_{n}\) under \(g\) are \[ u_{0}=g(x_{0}),u_{1}=g(x_{1}),\dots ,u_{n}=g(x_{n}) \] on the \(u\)-axis. Specifically, the image of the subinterval \([x_{i-1},x_{i}]\) under \(g\) is the subinterval \([g(x_{i-1}),g(x_{i})]\). Even if the division points \(x_{0},\dots ,x_{n}\) are equally spaced, those image subintervals on the \(u\)-axis will not all have the same width. In fact, the width of \([u,u+\Delta u]\) is approximately \(g'(x)\) times the width of \([x,x+\Delta x]\) because by using linear approximation (Figure 3) we have \[ \Delta u\approx g'(x)\Delta x. \] The approximation becomes equality and \(\Delta u\to du=g'(x)dx\) as \(\Delta x=dx\to 0\). If \(g'(x)>1\), then the image of each subinterval on the \(x\)-axis is stretched on the \(u\)-axis and if \(0<g'(x)<1\), the image of each subinterval is compressed on the \(u\)-axis. Now let’s compare the graphs of \(f\circ g\) and \(f\) (Figure 4). Notice that the value of \(f\circ g\) at \(x_{i}\) is the same as the value of \(f\) at \(u_{i}\) \begin{align*} f\circ g(x_{i}) & =f(g(x_{i}))\\ & =f(u_{i}).\tag{$u_{i}=g(x_{i})$} \end{align*} Therefore, the rectangle with the corners \((x_{i},0),(x_{i}+\Delta x,0),(x_{i},f\circ g(x_{i})),\) and \((x_{i}+\Delta x,f\circ g(x_{i}))\) under the graph of \(f\circ g\) and the rectangle with the corners \((u_{i},0),(u_{i}+\Delta u,0),(u_{i},f(u_{i}))\) and \((u_{i}+\Delta u,f(u_{i}))\) under the graph of \(f\) have the same height but the width of the second one is approximately \(g'(x_{i})\Delta x\) (Figure 5). So if we multiply the height of the first rectangle by \(g'(x_{i})\) then both rectangles will have the same area (Figure 5). In other words, the role of \(g'(x)\) in the formula \[ \int _{a}^{b}f(g(x))g'(x)dx=\int _{g(a)}^{g(b)}f(u)du \] is to cancel the change in the width by the change in the height such that the net signed areas remain the same (Figure 6). For example, let \(f(u)=1.5\) for \(0\leq u\leq 4\) and \(u=g(x)=x^{2}\). Then \(g\) maps the interval \([0,2]\) to \([0,4]\). Figure 7 shows the graphs of \(f(u),\) \(f\circ g(x)\), and \((f\circ g)(x)g'(x)\). Because \(f\) is a constant function, then \[ \int _{0}^{4}f(u)du=1.5\times 4=6 \] and \[ \int _{0}^{2}f(g(x))dx=1.5\times 2=3. \]
We notice that \(g'(x)=2x\) and \[ 0\leq g'(x)\leq 1,\qquad \text{when }0\leq x\leq \frac{1}{2}. \] The function \(g\) maps the interval \([0,\frac{1}{2}]\) on the \(x\)-axis to a smaller interval \([0,\frac{1}{4}]\) on the \(u\)-axis. In fact, \(g\) maps any subinterval of \([0,\frac{1}{2}]\) to a smaller interval because by the Mean-Value Theorem we have \[ |u_{2}-u_{1}|=|g(x_{2})-g(x_{1})|=\underbrace{|g'(c)|}_{\leq 1}\ |x_{2}-x_{1}|\leq |x_{2}-x_{1}|, \] for every \(x_{1},x_{2}\in [0,\frac{1}{2}]\) and some \(c\) between \(x_{1}\) and \(x_{2}\). The area under graph of \(f\circ g\) on the interval \([0,\frac{1}{2}]\) is twice the area under the graph of \(f\) on the interval \([0,\frac{1}{4}]\) (compare the shaded areas in Figure 7(a) and (b)). If we multiply \(f\circ g\) by \(g'(x)=2x\), then the graph \((f\circ g)g'(x)\) is a straight line whose area on the interval \([0,\frac{1}{2}]\) is exactly the same as the area under the graph of \(f\) on the interval \([0,\frac{1}{4}]\) (compare the shaded areas in Figure 7(a) and (c)). On the other hand, because \[ 1<g'(x)\qquad \text{when }x>\frac{1}{2}, \] for every \(x_{1},x_{2}\) in \([\frac{1}{2},2]\), the image of \([x_{1},x_{2}]\) under \(g\) is a larger interval \([g(x_{1}),g(x_{2})]\) \[ |u_{2}-u_{1}|=|g(x_{2})-g(x_{1})|=\underbrace{\left |g'(c^{*})\right |}_{>1}\,\left |x_{2}-x_{1}\right |\geq |x_{2}-x_{1}|. \] Therefore, the area under the graph of \(f\circ g\) on the interval \([x_{1},x_{2}]\) is less than the area under the graph of \(f\) on the interval \([g(x_{1}),g(x_{2})]\). Again multiplying \(f\circ g\) by \(g'(x)\) makes the area under \((f\circ g)g’\) on \([x_{1},x_{2}]\) equal to the area under \(f\) on \([g(x_{1}),g(x_{2})]\). Notice that \(g(-1)=h(1)=1\) and \(g(2)=h(2)=4\) (see Figure 8). Unlike \(h\), the function \(g\) is not one-to-one. If we integrate \(f(g(x))g'(x)\) on the domain of \(g\) and \(f(h(x))h'(x)\) on the domain of \(h\), both integrals will be equal to \(\int _{1}^{4}f(u)du\): \[ \int _{-1}^{2}f(g(x))g'(x)dx=\int_{-1}^{2}(1)(2x)dx=x^{2}\big |_{-1}^{2}=2^{2}-(-1)^{2}=3, \] \[ \int _{1}^{2}f(h(x))h'(x)dx=\int _{1}^{2}(1)(2x)dx=x^{2}\big |_{1}^{2}=2^{2}-1^{2}=3. \] We notice that \[ Dom(g)=Dom(h)\cup [-1,1], \]
the function \(g\) is decreasing on \([-1,0]\) and increasing on \([0,1]\), and \[ \int _{-1}^{0}f(g(x))g'(x)dx=-\int _{0}^{1}f(g(x))g'(x)dx \] \[ \Rightarrow \int _{-1}^{1}f(g(x))g'(x)dx=\int _{-1}^{0}f(g(x))g'(x)dx+\int _{0}^{1}f(g(x))g'(x)dx=0. \] Therefore, \[ \int _{-1}^{2}f(g(x))g'(x)dx=\overset{0}{\cancel{\int _{-1}^{1}f(g(x))g'(x)dx}}+\underbrace{\int _{1}^{2}f(g(x))g'(x)dx}_{g(x)=h(x)\text{ on }[1,2]}. \] In general, in Theorem 1, if \(g\) is not a one-to-one function, then the value of the integral on the intervals in which \(g\) is increasing and on the intervals in which \(g\) is decreasing balance each other such that the total value of the integral of \(f(g(x))g'(x)\) on the entire interval \([a,b]\) will be equal to \(\displaystyle \int _{g(a)}^{g(b)}f(u)du\). See Figure 9. Show the introductory explanations
Hide the introductory explanations
before.
the limits of integration in one step, as stated in the following theorem:Show the proof
Hide the proof
Proof. Let \(F\) be an integral (or an antiderivative) of \(f\). Then it follows from the chain rule that \begin{align*} \frac{d}{dx}F(g(x)) & =F'(g(x))g'(x)\\ & =f(g(x))g'(x); \end{align*}
Geometric Interpretation
Show the geometric interpretation
Hide the geometric interpretation
(a)
\(\int _{0}^{1}f(g(x))g'(x)dx\) cancel
out \(\int _{-1}^{0}f(g(x))g'(x)dx\)(b)
(c)
Figure 8: Both \(g\) and \(h\) map the endpoints of their domains to \(u=1\) and \(u=4\).