]> Partial derivatives

Math Animated™
Mathematical Introduction for Physics and Engineering
by Samuel Dagan (Copyright © 2007-2020)

Chapter 3: Many Variables; Section 1: Differentiation; page 2

Home | Table of Contents | A-Z Index | Help

Previous topic: page 1 Functions of Many Variables

Next topic: page 3 Derivatives of Higher Order


Partial Derivatives

Definition

We define the partial derivative of a function

y=f( x 1 ,.., x k ,.., x n ) (3.1.2.1)

with respect to the variable xk, as the limit

y x k = f x k = lim Δ x k 0 f( x 1 ,.., x k1 , x k +Δ x k , x k+1 ,.., x n )f( x 1 ,.., x k ,.., x n ) Δ x k (3.1.2.2)

In other words, this is the regular derivative of the function with respect to the variable xk, while all the other variables are kept constant. The notation of rounded " " instead of the common "d" is used, in order to emphasize the partiality of the derivative. As it will be demonstrated, the partial derivatives differ from the regular one by their properties, and therefore the use of a different notation is essential. From the definition it follows that the number of the partial derivatives of a function is equal to the number of the independent variables. Each one of the partial derivatives is a function of these variables, or of part of them.

As an example

{ forf( x,y,z )=lnyexpx+az( y+b ) witha,b=constants f x =lnyexpx f y = expx y +az f z =a( y+b ) } (3.1.2.3)

Unlike the regular derivatives, the continuity of a function is not a necessary condition for the existence of the partial derivatives. This is equivalent to the statement that the existence of all the partial derivatives alone do not imply the continuity of the function.

This is illustrated in the example of the function of two variables

z= x 2 y 2 x 2 + y 2 (3.1.2.4)

which has a point of discontinuity at  x = y = 0 , as it was shown by the analysis of (3.1.1.15) on the previous page. The corresponding partial derivatives are

z x = 2x[ ( x 2 + y 2 )( x 2 y 2 ) ] ( x 2 + y 2 ) 2 = 4x y 2 ( x 2 + y 2 ) 2 z y = 2y[ ( x 2 + y 2 )+( x 2 y 2 ) ] ( x 2 + y 2 ) 2 = 4 x 2 y ( x 2 + y 2 ) 2 } (3.1.2.5)

Both partial derivatives exist and vanish for x→0 and y→0 , independently of the limit's order of variables, in spite of the discontinuity of the function. Although this seems a bit odd, it does not contradict the previously stated property.

On the other hand, there exists a more restrictive requirement for the partial derivatives: the continuity of all the partial derivatives implies the continuity of the function.

The application of this property for the function (3.1.2.4) means that we should check, if in this example, the partial derivatives (3.1.2.5) are continuous at the origin x=y=0 . If they are, we'll have a real contradiction. This will be done by repeating the procedure already applied on the previous page, namely we'll approach the origin from different directions, by the aid of the lines (3.1.1.17): y=kx, where k is a non-vanishing finite constant. This yields

lim x0 z x = lim x0 4 k 2 x 3 ( 1+ k 2 ) 2 x 4 ± lim x0 z x = lim x0 4k x 3 ( 1+ k 2 ) 2 x 4 } (3.1.2.6)

that contradicts the previously found result of vanishing derivatives (3.1.2.5) at the origin. Therefore the partial derivatives are not continuous at the origin, and the function (3.1.2.4) itself does not have to be continuous there either.

Total differential

In order to partially differentiate the function (3.1.2.1) with respect to xk, we used in (3.1.2.2) a partial increment of the function, where only xk was incremented, and the rest of the independent variables were not. We are going to define a total increment of the function (3.1.2.1), subject to the increments of all the independent variables:

Δy=f( x 1 +Δ x 1 ,..., x n +Δ x n )f( x 1 ,..., x n ) (3.1.2.7)

For the sake of simplicity, we'll use a function of two variables

z=f( x,y )yielding Δz=f( x+Δx,y+Δy )f( x,y ) } (3.1.2.8)

which can be rewritten as

Δz={ f( x+Δx,y+Δy )f( x,y+Δy ) +f( x,y+Δy )f( x,y )= ={ f( x+Δx,y+Δy )f( x,y+Δy ) Δx Δx+ + f( x,y+Δy )f( x,y ) Δy Δy } (3.1.2.9)

For Δx→0 and Δy→0 one obtains the following symbolic expression for the total differential of z :

dz= f x dx+ f y dy (3.1.2.10)

The name "total" was chosen, since the differential was obtained from a total increment. By the same procedure, the total differential of the function (3.1.2.1) is

dy= k=1 n f x k d x k (3.1.2.11)

The total differential has the same properties as the differential we know from the functions of one variable. It means for example that a division by another differential yields the appropriate derivative. For this reason the adjective "total" is omitted in some of the text-books.

As a physical example, we'll take the main law of thermodynamics for a system of fluids, expressed as a total differential. The internal energy of the system U can be written as a function of two variables: entropy S - related to the heat of the system and V - the volume. According to the definition (3.1.2.10), the total differential of U is

dU= ( U S ) V dS+ ( U V ) S dV (3.1.2.12)

where the increment of the total energy is due to the increment of the heat, and that of the mechanical work.

The main law of thermodynamics states that

dU=TdSpdV (3.1.2.13)

meaning that

T   ( temperature )= ( U S ) V p   ( pressure )= ( U V ) S } (3.1.2.14)

Function of functions

A function of functions with one variable is defined by

y=F[ f 1 ( x ),...., f n ( x ) ] (3.1.2.15)

The derivative of (3.1.2.15) was not studied so far, and we are now ready to deal with this problem. Since F is a function of the n variables fk, one obtains from (3.1.2.11)

dy= k=1 n F f k d f k (3.1.2.16)

which after division by dx yields the following chain rule for the derivative:

dy dx = k=1 n F f k d f k dx (3.1.2.17)

The following example shows the use of (3.1.2.17)

y=F[ f 1 ( x ), f 2 ( x ) ]= f 1 ( x ) f 2 ( x ) dy dx = F f 1 d f 1 dx + F f 2 d f 2 dx = = f 2 f 1 ( f 2 1 ) d f 1 dx + f 1 f 2 ln f 1 d f 2 dx = = f 1 ( f 2 1 ) ( f 2 d f 1 dx + f 1 ln f 1 d f 2 dx ) } (3.1.2.18)

For instance, one obtains from (3.1.2.18)

d dx x x1 = x x2 ( x1+xlnx )= x x1 ( x1 x +lnx ) (3.1.2.19)

the same result as (1.3.3.13), that we obtained indirectly in chapter 1 by differentiating the logarithm of the function.

Implicit functions

We already know from chapter1 how to obtain the derivative of an implicit function of one variable. The partial differentiation offer another way of calculating this derivative. By definition, an implicit function is presented by the general expression

F[ x,y( x ) ]=0 (3.1.2.20)

where y is the dependent variable, considered as a function of x.

In (3.1.2.20) F can be considered as a function of two variables, therefore according to the chain rule (3.1.2.16), one can write

dF dx = F x dx dx + F y dy dx = F x + F y dy dx =0 (3.1.2.21)

which yields

dy dx = F x F y (3.1.2.22)

Notice that (3.1.2.22) does not fulfill the arithmetic rules of the differentials. This should not be surprising, since on the right-hand side the ratio contains partial differentials.

As an example, let's obtain the derivative of the "Folium of Descartes" defined by

F( x,y )= x 3 3axy+ y 3 =0 (3.1.2.23)

The use of (3.1.2.22) yields in this case

dy dx = F x F y = 3 x 2 3ay 3 y 2 3ax = ay x 2 y 2 ax (3.1.2.24)

in agreement with (1.3.3.2).

An implicit function of n variables have the general form of

F[ x 1 ,...., x n ,y( x 1 ,...., x n ) ]=0 (3.1.2.25)

By taking into account that a partial derivative with respect to one of the variables is equivalent to a regular derivative, with constant values of the rest of the variables, we obtain similarly to (3.1.2.21)

F x k + F y y x k =0yielding y x k = F x k F y fork=1,....,n } (3.1.2.26)

For example, the partial derivatives of the implicit function z with respect to the variables x and y will be calculated for the following case:

F( x,y,z )= x 3 y+cosz z 2 =0 z x = F x F z = 3 x 2 y sinz+2z z y = F y F z = x 3 sinz+2z } (3.1.2.27)

First approximation

As known from chapter 1, the tangent line at a point x0 is used, as a first approximation (y1) of the function  y = y(x) , around this point:

y 1 = y 0 + ( dy dx ) 0 ( x x 0 ) (3.1.2.28)

where the index 0, denotes at point x0. For an implicit function (3.1.2.20), one can use the relation (3.1.2.22), in order to obtain the tangent line in a symmetric form:

( F y ) 0 ( y y 0 )+ ( F x ) 0 ( x x 0 )=0 (3.1.2.29)

The first approximation of a function of n variables

y=y( x 1 ,...., x n ) (3.1.2.30)

around the point

( x 1 ,...., x n ) 0 =( x 1,0 ,...., x n,0 ) (3.1.2.31)

has the form of:

y 1 = y 0 + k=1 n ( y x k ) 0 ( x k x k,0 ) (3.1.2.32)

since for the point (3.1.2.31)

y 1 = y 0 and y 1 x k = ( y x k ) 0 k=1,...,n } (3.1.2.33)

As seen from (3.1.2.32), the first approximation itself is in general a function of n variables. Care should be taken that the function and the derivatives are continuous at the point (3.1.2.31).

In the case of a function of two variables,

z=z( x,y ) (3.1.2.34)

the first approximation is the tangent plane:

z 1 = z 0 + ( z x ) 0 ( x x 0 )+ ( z y ) 0 ( y y 0 ) (3.1.2.35)

It is called "tangent", since its derivatives are equal to these of the function at the point (x, y, z)0, but is it a plane?

A plane in a three dimensional space is defined as any linear combination of the Cartesian coordinates:

Ax+By+Cz=D (3.1.2.36)

where at least one of the constant factors of the coordinates (A,B,C) does not vanish. Indeed any intersection of (3.1.2.36) with the planes of constant x, y or z, yields a straight line, as can be easily seen. From the definition (3.1.2.36) it follows that (3.1.2.35) is a plane.

According to (3.1.2.25) and (3.1.2.26), for an implicit function of two variables

F[ x,y,z( x,y ) ]=0 { z x = F x F z z y = F y F z } (3.1.2.37)

With the help of (3.1.2.37), one can transform the expression of the tangent plane (3.1.2.35) in the following symmetric form:

( F z ) 0 ( z z 0 )+ ( F y ) 0 ( y y 0 )+ ( F x ) 0 ( x x 0 )=0 (3.1.2.38)

A point of a function of many variables with vanishing and continuous derivatives is called a stationary point. As in the case of a function of a single variable, a stationary point indicates that for an infinitesimal deviation from this point, the function remains constant. In the case of two variables (3.1.2.34), the tangent plane at a stationary point is

z= z 0 where    z 0    is   z   at that point (3.1.2.39)

as can be seen from (3.1.2.38).

The following example will be used to illustrate the tangent plane.

z=3[ 1 ( x 4 ) 2 ( y 3 ) 2 ] (3.1.2.40)

The section of this function with a constant z plane is

( x 4 ) 2 + ( y 3 ) 2 =1 z c 3 with z c =constant (3.1.2.41)

which is an ellipse for zc<3, a point for zc=3 and non existing for zc>3. Therefore

x=y=0 z=3 }isapointofmaximum (3.1.2.42)

The sections with constant x or y values are parabolas:

z= y 2 3 +3[ 1 ( x c 4 ) 2 ] x c =constant z= 3 16 x 2 +[ 3 y c 2 3 ]    y c =constant } (3.1.2.43)

The shape is similar to a paraboloid of revolution, except that instead of circles there are ellipses, and therefore it is called an elliptic paraboloid.

The tangent plane at point (x,y,z)0, according to (3.1.2.38) is:

1 3 ( z z 0 )+ 2 y 0 9 ( y y 0 )+ x 0 8 ( x x 0 )=0 (3.1.2.44)

For the maximum (3.1.2.42), the tangent plane becomes

z=3 (3.1.2.45)

which corresponds to a stationary point (3.1.2.39).

This example (3.1.2.39) and some display of the tangent planes are shown in Fig. Elliptic paraboloid.

Exercises

Exercise 1. Calculate:


  1. d dx v( x ) w( x ) f( u )du

  2. d dx x exp( x ) x expx f( u ) du

Exercise 2. The function u of three (3) variables is defined as:

u( x,y,z )= x 2 y+ y 2 z z 2 x
  1. Calculate the partial derivatives

    u x , u y , u z

  2. What is the value of the function at the point

    ( x,y,z ) 0 =( 1,1,1 )

  3. Obtain the first approximation of the function, about this point!

Exercise 3. For the implicit function

F( x,y )=sin( xy )ln( x 2 + y 2 )=0
  1. Calculate the partial derivatives

    F x and F y

  2. What is the derivative

    dy dx

  3. Obtain the symmetric form of the tangent line at point (x, y)0 !
  4. What are the two values of y for x=0 ?
  5. What are the tangent lines at x0=0 ?

Exercise 4. The function z of two variables x and y is presented in the implicit form:

F( x,y,z )=x z 5 +y z 2 x y 2 z1=0
  1. Calculate F z , F y and F x

  2. What are z x and z y

  3. What is the value of z for the point (x, y)=(1, 0) ?
  4. Obtain the tangent plane at

    ( x,y ) 0 =( 1,0 )

Previous topic: page 1 Functions of Many Variables

Next topic: page 3 Derivatives of Higher Order

Home | Table of Contents | A-Z Index | Help