The Chain Rule

In an introductory calculus course, it is shown that the chain rule is instrumental in both the theory and the applications of derivatives. In this section, we generalize the all-important chain rule to functions of two or more variables.

Let's suppose that w = f( x,y) is differentiable at (p,q) , and then let's suppose that x = h( t) , y = k(t) where h and k are differentiable at t₀ and (h( t₀) ,k( t₀) ) = ( p,q) . Then definition 3.2 in section 3 says that

Dw = f_x( p,q) Dx+f_y( p,q) Dy + o₁(Dx,Dy)Dx + o₂(Dx,Dy)Dx

so that if we divide by Dt and let Dt approach 0, then we obtain

lim
DtŽ 0
Dw

Dt
= f_x( p,q)
lim
DtŽ 0
Dx

Dt
+f_y( p,q)
lim
DtŽ 0
Dy

Dt
+
lim
DtŽ 0
o₁(Dx,Dy) Dx

Dt
+o₂(Dx,Dy) Dy

Dt

(1)
In order for dx/dt to exist, it is necessary that Dx approaches 0 as Dt approaches 0. Likewise, Dy approaches 0 as Dt approaches 0. Since o₁(Dx,Dy) and o₂(Dx,Dy) approach 0 as Dx and Dy approach 0, the last limit in (1) becomes

lim
DtŽ 0
o₁(Dx,Dy) Dx

Dt
+o₂(Dx,Dy) Dy

Dt
= lim
DtŽ 0
o₁(Dx,Dy) dx

dt
+ lim
DtŽ 0
o₂(Dx,Dy) dy

dt
= 0

Consequently, (1) reduces to

dw

dt
= f_x( p,q) dx

dt
+f_y( p,q) dy

dt

(2)
which is known as the chain rule for functions of 2 variables. Moreover, the chain rule can also be written in the form

dw

dt
= śf

śx
dx

dt
+ śf

śy
dy

dt

EXAMPLE 1    Find dw/dt given that w = x²+y³ and that x = cot( t) , y = sin( t) .
Solution: The first partial derivatives of w = x²+y³ are

św

śx
= 2x,        św

śy
= 3y²

As a result, the chain rule says that

dw

dt
= 2x dx

dt
+3y² dy

dt

and since dx/dt = -csc²( t) and dy/dt = cos( t), we have

dw

dt

=

2cot( t) · [ -csc²(t) ] +3[ sin( t) ]²cos(t)

=

-2cot( t) csc²( t) +3sin²(t) cos( t)


EXAMPLE 2    Find dw/dt given that w = cos( xy) and that x = pe^t and y = e^-t.
Solution: The first partial derivatives are

św

śx
= -ysin( xy) ,        św

śy
= -xsin( xy)

As a result, the chain rule says that

dw

dt
= -ysin( xy) dx

dt
-xsin( xy) dy

dt

and since dx/dt = pe^t and dy/dt = -e^-t, we have

dw

dt

=

-e^-tsin( pe^te^-t) ( pe^t) -pe^tsin( pe^te^-t) (-e^-t)

=

-pe^te^-tsin( pe^te^-t) +pe^te^-tsin( pe^te^-t)

=

-psin( p) +psin( p)

=

0

Check your Reading: Substitute x = pe^t and y = e^-t into w = cos( xy). Why would this also imply that dw/dt = 0?