An important application of the derivatives is in solving various problems of optimization. We have learned how to determine the maximum or minimum values for functions of a single variable, and now in this section, we want to know how to determine the extreme values for functions of two or more variables. The problem of finding the extreme values for functions of several variables has similar features to that for functions of a single variable but it is often more complicated. So first let’s review what we already know about the maxima and minima of functions of one variable.
Table of Contents
Review of Maxima and Minima of Single Variable Functions
Consider a function :
We say has a maximum (respectively minimum) if there exists in its domain such that for all in the domain of , [respectively ]. is called the maximum value (the minimum value) of . The word “extremum” refer to either a maximum or a minimum.
We say has a relative maximum (respectively relative minimum) at if [respectively ] for all in the domain of that are sufficiently close to .
If is continuous on a closed interval , it takes on both its absolute maximum value and its absolute minimum value on that interval. If the interval is not closed or if is not continuous on that interval, there is no guarantee that the function takes on its extreme values on that interval.
If is a relative or absolute extreme value of , then the point is one of three kinds of points:
the point is a stationary point; that is, ,
the point is a rough point; that is, does not exist, or
the point is one of the endpoints of the domain of .
If for every in an interval , then the graph of is concave up on .
If for every in an interval , then the graph of is concave down on .
If and , then has a local minimum at .
If and , then has a local maximum at .
If and , more information is required to conclude whether or not has a local extremum at . In fact, the additional information is the behavior of higher order derivatives.The complete theorem is as follows. Suppose and .
If is even and , then has a local minimum at .
If is even and , then has a local maximum at .
If is odd, then does not have an extremum at .
Definitions of Maxima and Minima for Multivariable Functions
Now we are ready to talk about finding maxima and minima for functions of two or more variables.
Consider a function defined on a set in the -plane. We say has a maximum at the point of its domain if for all in . Absolute maximum corresponds to a highest point on the surface . We say has a relative maximum (or local maximum) at if for all of that are in a sufficiently small neighborhood of . The value at a relative maximum does not have to be the greatest value of in the entire of but the greatest value of if we restrict ourselves to points that are sufficiently close to . The definitions of minimum (or more specifically absolute minimum) and relative minimum are analogous. Consider Fig. 1.
Figure 1.
In a similar way we can define the maximum and minimum points for functions of three or more variables.
Definition 1. Let .
We say has a maximum (or more specifically an absolute maximum) at the point if for all .
We say has a relative maximum at the point , if there is a neighborhood of such that for every
We say has a minimum (or more specifically an absolute minimum) at the point if for all points .
We say has a relative minimum (or local minimum) at the point if there is a neighborhood of such that for every
A point which is either a (relative or absolute) maximum or minimum is called a (relative or absolute) extremum.
Every absolute maximum (respectively minimum) is also a relative maximum (minimum).
Bounded and Unbounded Sets
A set in is bounded if it is contained in an interval of finite length, and is unbounded otherwise. A set in is bounded if the entire set can be contained within a disk of finite radius, and is called unbounded if there is no disk that contains all the points of the set. Similarly, a set of is bounded if the entire set can be contained within a sphere of finite radius, and is unbounded otherwise. In general, A set in is bounded if the entire points of the set are contained inside a ball of finite radius .
A bounded set in
An unbounded set (first quadrant) in
A bounded set in
The Extreme Value Theorem
The following theorem assures us that a continuous function in a closed and bounded set takes on its extreme values.
In the conditional statement “if then ” or “ implies ” (written as ), we say is a sufficient condition for and is necessary condition for . Also note that “if then ” is equivalent to “if is false, then is false.”
Theorem 1. (Sufficient Conditions for Extreme Values) If is a closed and bounded set, and is continuous in , then there is a point in at which attains its (absolute) maximum and is a point in at which attains its (absolute) minimum.
Finding Extrema
Calculus gives us the necessary conditions for an interior point to be a relative extremum. Let be a function of two variables and and let be an interior point of the domain of . If has a relative maximum or minimum at and if and exist, then
Figure 2. is an interior point, and is a boundary point.
If we define a single-variable function (see Fig. 3(a)), then . If is a relative maximum point, then for all in the domain of that are in a sufficiently small neighborhood of , . Consequently, in that neighborhood . This means has a relative maximum at . It follows from single variable calculus that ; that is (Fig. 3(b)). The proof that is analogous.
Figure 3.
At every relative extremum in the interior domain of a differentiable function we have
for and all and . Geometrically means that the tangent plane at the point is horizontal (or perpendicular to the -axis). See Fig.[fig:MaxMin-3]
Figure 4.
We can easily generalize this result for functions of any number of independent variables. The proof of the following theorem is essentially the same as we discussed here, but is expressed in a different way.
Theorem 2. Let and let be an interior point of . If has a relative maximum or minimum at and if the partial derivatives of at exist, then
Show the proof
Hide the proof
Suppose has a relative maximum at . We need to show . By the definition of a partial derivative
where as usual is the unit vector all of whose components are zero, except the -th component, which is one.
Because has a relative maximum at , by Definition 1 we have whenever is small enough, so that is sufficiently close to . If ; that is, if approaches 0 from the right, then ; therefore:
If ; that is, approaches 0 from the left, ; therefore:
If exists, both inequalities (*) and (**) must hold. Therefore, we must have
The proof when has a relative minimum at is very similar.
Definition 2. A point is called a stationary point of if all first order partial derivatives of at that point exist and are equal to 0; that is, .
It follows from Theorem 2 that if a function has a relative extremum at an interior point of its domain and if its partial derivatives at that point exist, the point must be a stationary point of the function. Theorem 2 does not talk about the points where the partial derivatives do not exist and the points on the boundary. That is, it is possible for a function to assume its (relative or absolute) extreme value at a point where at least one of the first partial derivatives does not exist (Fig. 5(a)) or at a boundary point (Fig. 5(b)).
(a) Maximum occurs at a rough point.
(b) Extrema occur at two boundary points.
Figure 5.
A point at which at least one of the partial derivatives does not exist is called a rough point. In other words, at a rough point, the gradient does not exist.
Stationary points and rough points constitute critical points.
From the above discussion, we conclude that to determine the extreme values of a function, we should search them among stationary points, rough points, and boundary points.
A function has a relative or absolute extremum at a point of its domain only if is one of the three types of points:
is a stationary point of ; that is, ,
is a rough point of ; that is, does not exist, or
is on the boundary of the domain of .
Example 1
Find the extreme values of the function .
Solution
First we need to find the gradient of
The gradient exists everywhere; therefore, there is no rough point. Both components of the gradient are zero only at ; that is, is the only critical point of . Because and the value of is always greater than or equal to zero, has an absolute minimum at the origin. The value of increases unboundedly when we go away from the origin. So there is no absolute maximum.
Graph of
Example 2
Find the extreme values of the function .
Solution
The partial derivatives of are
and they become zero at the origin. The function has a maximum at the origin, because at all the quantity under the square root is less than 1 which occurs at the origin.
The domain of is or , which is a disk of radius 1 and centered at the origin. On the boundary of the domain (i.e. ), is zero. The absolute minimum value of occurs on the boundary because the minimum of the square root is zero.
Graph of
Example 3
Find the extreme values of the function .
Solution
First we calculate the partial derivatives
The partial derivatives are not both zero if . Because and do not exist, is a rough point. The value of at is zero. This is the minimum value the square root. Thus, has an absolute minimum at the origin. The graph of is a circular cone, increases unboundedly when and does not have a maximum.
Graph of
Example 4
Suppose we would like to build a distribution center to service three cities, and suppose the locations (the coordinates) of the three cities are , , and . Locate the storage center to minimize the cost
if the cost is proportional to sum of the square distances from the center to the cities
if the cost is proportional to sum of the distances from the center to the cities.
Solution
Let be the coordinates of the distribution center. The distance of the distribution center to a city is For part (a) we wish to minimize and for part (b), we wish to minimize We will see minimization of is easier than minimization of .
(a) We have
Solving and gives and . Therefore, the best location to minimize sum of the square distances is the centroid of the triangle formed by the cities (Fig. 6).
Figure 6.
(b) We have
Differentiating and equating to zero, we obtain
These equations are complicated to solve. However, we note that
where
Also we note that these gradient vectors are unit vectors, . Therefore, the vector sum of three unit vectors at the relative minimum has to be . The only way for this to happen is when the angles between them are 360/3=120 (as (b) and (c) in the following figure). So the solution probably is when the roads from the cities to the center make 120 angles. There are other possibilities. Each function has a rough point; that is, has three rough points. The graph of is a circular cone (see Example [Eg:MinCone]), which has been shifted units in the -direction and units in the -direction. Therefore, has a rough point at . This means that if we build the distribution center in one of the cities, we might have minimized the cost.
If the triangle formed by the cities has an angle larger than 120, then a point inside the triangle such that the angles between the roads from the delivery center to the cities make 120 does not exist. In this case, the best point is the city with the wide angle (see (c) in the following figure). Otherwise, a point inside the triangle would be the solution.
(a)
(b)
(c)
Example 5
Find the extreme values of the function on the triangular region (see Fig. 6).
Figure 6: Region .
Solution
Because is closed and bounded, and the function is continuous in , takes on both its maximum and minimum values. Also because is a differentiable function, exists everywhere and there is no rough point. So to find the extreme values of , we should find the critical points and the extreme values on the boundary.
To find the critical point, we have to solve the following system of equations:
The value of at this point is . Now we find the extreme values on the boundary. Three line segments form the boundary. To look for an extremum on the the line segment and , we set , so depends on alone. Then we find the best . Therefore, the critical point of is which is out of the this line segment. We evaluate at the endpoints of this line segment, i.e. and . On the line segment and , to find the extreme value of , we set and find the best : Thus, has a critical point at . We evaluate at this point and the endpoints.
To follow the last segment of the boundary, i.e. and , we set and evaluate . Then we choose the best On this line segment when , , and .
To find the extreme values, we compare at all the points we found:
maximum
minimum
Therefore, the absolute maximum of in is and its absolute minimum in is , which occur at and , respectively.
Saddle Points
Theorem 2 states necessary conditions (not sufficient ones). Not every critical point is a relative extremum. See the following example.
Example 6
Given , find its extremum points.
Solution
Equating the gradient of to tells us that the origin is a critical point. Does have a relative maximum or minimum at this point?
The answer is that the origin is neither a relative maximum nor a relative minimum. Because any neighborhood of — no matter how small it is— include points from each quarter; but for any point in the first (when ) and third (when ) quarters and for any point in the second (when ) and forth (when ) quarters. Such a point is called a saddle point.
(a) Graph of
(b) Contour plot of . As we can see every neighborbood of contains points with and points with .
Figure 7.
A critical point where all the components of the gradient of the function vanish but it is not a relative maximum nor a relative minimum is a saddle point.
Saddle points are somewhat analogous to the points of inflection for functions of one variable.
Second Partials Test
To figure out whether a critical point is a maximum, a minimum or a saddle point, we may graph the function but what if we do not have access to a graphing application or what can we do to classify the critical points of functions of three or more variables? Fortunately we can systematically use what is called the “second partials test.” This test is similar to the second derivative test for functions of one variable. Because application of the second partial tests for functions of three or more variables is rather laborious, here we restrict ourselves to functions of two variables.
(Sufficient Conditions for Extreme Values: Second Partials Test) Let be a critical point of and interior to the domain of . Suppose has continuous second order partial derivatives in a neighborhood of . Let
and let
Then we have:
If and , then has a relative minimum at .
If and , then has a relative maximum at .
If , then has a saddle at .
If , this test gives no information.
Show the proof
Hide the proof
Let be a point (other than ) in a neighborhood of where the second order partial derivatives of are continuous. Using Taylor’s formula (Theorem 3), we can write
where , , and and are the second order partial derivatives of at some point for :
Because is a critical point, we have ; thus,
Case I: . It follows from that . Let’s define . We are given . Because , and are continuous in a neighborhood of , say , is also continuous in . Hence, there is a neighborhood of for some in which and has the same sign as . Now consider only points that lie in . In this case for also lies in ; hence and has the same sign as .
The expression inside the square brackets in (*) is a quadratic form of and . Because , by completing the squares, we may rewrite (*) as The expression inside the square brackets in (**) is the sum of two squares. Therefore, has the same sign as (or ). Therefore, if , then ; that is, has a relative minimum at . If , then ; that is, has a relative maximum at . This proves parts (a) and (b) of the above theorem.
Case II: . If , we consider in a neighborhood of where and has the same sign as . Again by completing the square, we can rewrite (*) as (**).
Because , the expression in square brackets in (**) is the difference of two squares. If we put and , has the same sign as (or, in turn, as ). Now if we put and , has the sign opposite to that of . Therefore has a saddle point at .
If but , again we can complete the square and rewrite (*) as Using the same argument as before (once putting and and once and ), we can show that has a saddle point at .
The last case we need to investigate is when , and . If follows from that . If we put , then Eq (*) becomes: Taking the limit, we have:
Therefore has the same sign as for sufficiently small . If we put , using the same argument, we can show that has the same sign as . This means we have shown has a saddle point at .
Case III: . Part (d) can be shown through examples.
Example 7
Given that is the function defined by , determine the relative extrema of if there are any.
Solution
To find the critical points, we need to find the first order partial derivatives of and set both of them equal to zero. We have:
It follows from the first equation that . If we plug into the second equation, we will arrive at , which gives us or . Thus, there are two critical points and . To determine if has relative extrema at these points, we use the second partials test. The second order partial derivatives are
At : Because
according to part (c) of Theorem 3, has a saddle point at .
At : Because
and , according to part (a) of Theorem 3, has a minimum at .
Figure 8. Contour plot of shows that has a relative minimum at and is a saddle point.
Example 8
Given that is the function defined by , determine the relative extrema of on the square and .
Solution
To find the critical points, first we calculate the first order partial derivatives and set them equal to zero:
These equations together imply that ; hence, .
If we plug (for ) into the first equation, we obtain We know .1 If we plug this identity into (*), we obtain: This is a quadratic in . Thus:
The equation has two solutions in the interval : or . Because lies between 0 and , we have to choose and get ; that is and are two critical points.
The solution of in the interval is . Again because lies between 0 and , we have to choose and find . The third critical point is
If we plug into the first equation, we obtain which is the same equation we had before.
Hence the critical points are and . For each of these points, we use the second partials test. We have:
At , we have:
Thus
Because and , we can conclude that has a maximum at .
At , we have:
Thus
Because and , we can conclude that has a minimum at .
At , we have:
Thus Because , the second partial test is inconclusive. In this example, to investigate whether or not has an extremum at , we can calculate the variation (increment) of near :
Now we can use Taylor’s expansion
To approximate , because and can be made as small as we desire, we just keep the dominant terms which are the terms with the lowest degree (i.e. the terms with the smallest power) and ignore the other terms as being negligible by comparison:
If and (for example and ), and if but (for example, and ), . Therefore, has a saddle point at .
The graph of is shown in Fig. 9.
(a) Graph of .
(b) Contour plot of .
Figure 9.
We mentioned that when , the second partial test is inconclusive, and the function may have a maximum, minimum, or a saddle point at the critical point. In the previous example, had a saddle point at a critical point where . Now consider the following functions: You can verify that the origin is a critical point of these functions; has a relative minimum, has a relative maximum, and has a saddle point at the origin. See Fig. 10.
Figure 10.
1 Alternatively, we can transform the sum into a product using
Scroll UpThe term “extreme value” refers to either a maximum value or a minimum valueLatin plurals of maximum and minimum, respectively.Also called “absolute maximum” or “global maximum” (“absolute minimum” or “global minimum”) in contract to relative or local maximum (minimum).Sometimes they are called local maximum and local minimumAlso called absolute maximumIt means it is not a boundary point of the domain.