|
|
|
|
|
The gradient is a first-order differential operator that maps scalar functions to vector fields. It is a generalization of the ordinary derivative, and as such conveys information about the rate of change of a function relative to small variations in the independent variables. The gradient of a function is customarily denoted by or by
.
Let
be continuously differentiable. The gradient of , denoted by , is defined by the property:
for all vectors
. |
(1) |
The middle dot is the dot product, and
is the directional derivative with respect to
.
If
are Euclidean coordinates, corresponding to the orthonormal basis
, then
 |
(2) |
The formula (2) is sometimes given as the definition of . We prefer to define by the coordinate-free formula (1) instead, because then the geometric interpretations (see below) become obvious, and (1) also indicates how we would go about calculating the gradient in other curvilinear coordinate systems. Formula (1) also makes it clear that the gradient is a
physical vector, depending only on the inner product structure of
, and not on the specific coordinate system used to calculate it.
There is the issue of whether the as defined by (1) exists; but this is proved easily enough, by substituting the concrete expression (2) and seeing that it satisfies (1).
The gradient can be considered to be a vector-valued differential operator, written as
or, in the context of Euclidean 3-space, as
where
are the unit vectors lying along the positive direction of the axes, respectively.
- (a)
- The direction of the vector
is the direction of the greatest positive change, or increase, in . The magnitude of is the magnitude of this increase. This follows immediately from (1):
where is the angle between and
. So among all unit directions
of change, if
is perpendicular to then the change
is zero; if
is parallel to then the change is maximized.
Similarly, is the direction of the greatest negative change, or decrease, in .
- (b)
- If
is the hypersurface in
defined by
then
is the normal to the hypersurface at the point . For
is the tangent space
to at , that is,
for all
, and by definition (1),
must be perpendicular to all
.
Note that
is equivalent to
. Consequently, also gives an orientation to the hypersurface .
For example, if
for
, is the -dimensional sphere of unit radius, embedded in
. Its normal,
, as one would expect, points outward radially.
- (c)
- As a simple case of (b), consider the surface
in
, with Cartesian coordinates . Think of this surface as describing a hill, with height . Then the direction of the gradient vector is the direction of steepest ascent of the hill, while its magnitude
is the slope or steepness in that direction.
If a ball is placed on the hill at a point , theoretically it should roll down the hill in the direction of the gradient vector
. This may be easily derived by considering the mechanical forces on the ball. The direction of
is, in fact, the projection to the xy-plane of an outward normal vector to the hill at ; the normal vector is involved because the movement of the ball arises from the normal force from the hill.
- (d)
- Suppose the surface
in (c) describes a bowl instead of a hill, and we place a marble at any point on this bowl. We would expect the marble to roll down to a local minimum point of . Since the marble should roll down in the direction of , we might hope that we can find local minima of a given function by following the path mapped out by the gradients . Formally, this method of finding local extrema (with some modifications) is called gradient descent.
- (e)
- If
is the potential function corresponding to a conservative physical force, then
is the corresponding force field.
Consequently, the gradient theorem,
simply gives the formula for the change in the potential energy when an object “does work” along a path in a conservative force field
.
It is obvious how (1) can be generalized to the setting of Riemannian manifolds: the dot product of
must be replaced by the Riemannian metric, and the analogue of
is the directional derivative
, for tangent vectors
on the Riemannian manifold. Thus for a smooth scalar-valued function on a Riemannian manifold,
![$\displaystyle \mathbf{X}= \operatorname{grad}f \: \Leftrightarrow \: df_p(\mathbf{v}) = \mathbf{v}[f] = \langle \mathbf{X}, \mathbf{v}\rangle_p\,.$ $\displaystyle \mathbf{X}= \operatorname{grad}f \: \Leftrightarrow \: df_p(\mathbf{v}) = \mathbf{v}[f] = \langle \mathbf{X}, \mathbf{v}\rangle_p\,.$](http://images.planetmath.org:8080/cache/objects/892/l2h/img89.png) |
(3) |
We can calculate
explicitly as follows. If are local coordinates on the manifold (not necessarily orthonormal), set
(the Einstein summation convention is being used). Let be the metric tensor, and be its inverse. Then from (3),
and taking inverses,
 |
(4) |
Definitions (1) and (3) exhibit as the vector field dual to the differential form . The isomorphism is given by applying the inner product or Riemannian metric. This isomorphism is, of course, linear; in particular it leads to the identity
 |
(5) |
which is the dual to the standard formula of differential one-forms:
Using (3) and (4), we have
 |
(6) |
So the isomorphism between vector fields and one-forms is expressed by changing the 's in (6) to 's, and vice versa. That is,
 |
(7) |
It is commonly said that this isomorphism is expressed by “raising and lowering the indices of a tensor field, using contractions with and ”.
Notice that when are orthonormal coordinates on
, equation (5) reduces to equation (2), because
(Kronecker delta).
The formulae presented in this section are useful in the Euclidean setting as well, for deriving the formulae for the gradient in various curvilinear coordinate systems.
Several properties of the one-dimensional derivative generalize to a multi-dimensional setting
 |
 |
Linearity |
|
 |
 |
Product rule |
|
 |
 |
Chain rule |
|
 |
 |
Another Chain rule |
|
The function is
. The notation
denotes the transpose of the Jacobian matrix, in Euclidean coordinates, of
. In the abstract setting,
is the adjoint to the tangent map
between the tangent bundles of two Riemannian manifolds.
These identities can be proved directly from the definition, but the first three are really just the duals of the following well-known identities for differential forms:
and so may be derived by changing the 's here to 's! (Though the third identity may take a bit of thought.)
The following identity
is a special case of the differential forms identity . Conversely, if
on a simply connected domain, then there is such that
. See laminar field for details.
(This discussion does not really belong here, but should be moved to the nabla entry.)
Using the formalism, the divergence operator can be expressed as
, the curl operator as
, and the Laplacian operator as . To wit, for a given vector field
and a given function we have
- 1
- Michael Spivak. A Comprehensive Introduction to Differential Geometry, Volume I. Publish or Perish, 1979.
|
"gradient" is owned by stevecheng. [ full author list (3) | owner history (2) ]
|
|
(view preamble)
See Also: derivative, Riemannian manifold, gradient in curvilinear coordinates, nabla, differential form, first order operators in Riemannian geometry, vector field, Hessian matrix, derivative notation, Jacobian matrix, partial derivative, tilt curve
|
|
Cross-references: Laplacian, curl, operator, divergence, Formalism, nabla, domain, simply connected, tangent bundles, tangent map, adjoint, Jacobian matrix, transpose, section, Kronecker delta, equation, identity, isomorphism, differential form, definitions, inverse, metric tensor, Einstein summation convention, orthonormal, manifold, local coordinates, smooth, tangent vectors, Riemannian metric, Riemannian manifolds, object, field, conservative, potential, modifications, extrema, path, local minima, local minimum, place, normal vector, projection, forces, ball, slope, height, Cartesian coordinates, surface, simple, radius, sphere, orientation, equivalent, tangent space, point, normal, hypersurface, negative, parallel, perpendicular, unit, angle, vector, positive, unit vectors, expression, calculate, structure, inner product, physical vector, coordinate systems, obvious, interpretations, orthonormal basis, coordinates, Euclidean, directional derivative, dot product, property, continuously differentiable, variables, independent, variations, information, derivative, vector fields, functions, scalar, maps, differential operator
There are 34 references to this entry.
This is version 15 of gradient, born on 2001-11-16, modified 2007-04-12.
Object id is 892, canonical name is Gradient.
Accessed 58272 times total.
Classification:
| AMS MSC: | 26B10 (Real functions :: Functions of several variables :: Implicit function theorems, Jacobians, transformations with several variables) | | | 26B12 (Real functions :: Functions of several variables :: Calculus of vector functions) | | | 58A10 (Global analysis, analysis on manifolds :: General theory of differentiable manifolds :: Differential forms) |
|
|
|
|
|
|
Pending Errata and Addenda
|
|
|
|
|
|
|
|
|
|
|