Lagrange Multipliers

Constrained Optimization

In many applications, we must find the extrema of a function f(x,y) subject to a constraint, where a constraint is a curve of the form

g( x,y) = k

Such problems are called constrained optimization problems. Geometrically, constrained optimization is the problem of finding the points on the curve g( x,y) = k for which f( x,y) attains it largest and smallest values.

In particular, when g( x,y) = k is a closed curve parametrized by r( t) = á x( t) ,y(t) ñ for t in [ a,b] , then the absolute extrema of f( x,y) subject to g( x,y) = k must occur at the critical points of z( t) = f( x( t),y( t) ) .

Moreover, if g( x,y) = k and f( x,y) are both smooth, then the critical points of z( t) are the solutions to

= Ñf·v = 0

where v is the velocity of r( t) . That is, the critical points of z( t) occur when Ñf^v. And since Ñg^v, it follows that the extrema of f( x,y) subject to g( x,y) = k occur when Ñf is parallel to Ñg.

Let's view this in a different way. Let's suppose that as the levels of f( x,y) increase, short sections of level curves of f( x,y) form secant curves to g( x,y) = k

It follows that the highest level curve of f( x,y) intersecting g(x,y) = k must be tangent to the curve g( x,y) = k, which is possible only if their gradients are in the same direction, which is possible only if Ñf is a scalar multiple of Ñg

That is, Ñf is parallel to Ñg only if there is a number l for which

Ñf = lÑg

Thus, the extrema of f( x,y) subject to g( x,y) = k must occur at the points which are the solution to the system of equations

Ñf = lÑg, g( x,y) = k
(1)
We call (1) a Lagrange multiplier problem and we call l a Lagrange multiplier.

To solve a Lagrange multiplier problem, we usually solve for l in the first two equations in (1) to obtain

l = f_x

g_x
= f_y

g_y

Once l has been eliminated, we solve for x and y using the resulting 2 equations in 2 unknowns:

f_x

g_x
= f_y

g_y
, g( x,y) = k

Moreover, if the constraint is a closed curve, then the extrema must occur at the critical points. Thus, we simply test f( x,y) at the solutions to (1) in order to identify the extrema of f( x,y) over g( x,y) = k.

EXAMPLE 1    Find the extrema of f( x,y) = xy+14 subject to

x²+y² = 18

Solution: That is, we want to find the highest and lowest points on the surface z = xy+14 over the circle x² + y² = 18:
LiveGraphics3d Applet

If we let g( x,y) = x²+y², then the constraint is g(x,y) = 18. The gradients of f and g are respectively

Ñf = á y, x ñ         and        Ñg = á 2x, 2y ñ

As a result, Ñf = lÑg implies that y = l2x and x = l2y. Clearly, x = 0 only if y = 0, but ( 0,0) is not on the circle. Thus, x ¹ 0 and y ¹ 0, so that solving for l yields

l = y

2x
    and     l = x

2y
        Þ         y

2x
= x

2y

Cross-multiplying then yields 2y² = 2x², which is the same as y² = x². Thus, the constraint x²+y² = 18 becomes

x²+x² = 18,        x² = 9,        x = ±3

Moreover, y² = x² implies that either y = x or y = -x, so that the solutions to (1) are

( 3,3) ,  ( -3,3) ,  ( 3,-3) ,  (-3,-3)

However, f( 3,3) = f( -3,-3) = 23, while f(-3,3) = f( 3,-3) = 5. Thus, the maxima of f(x,y) = xy+14 over x²+y² = 18 occur at ( 3,3) and ( -3,-3) , while the minima of f( x,y) = xy+14 occur at ( -3,3) and ( 3,-3) .
LiveGraphics3d Applet

Check your Reading: What lines through the origin in the xy-plane contain the critical points in example 1?