User:Egm6936.f09/Gradient of vector: Two tensor conventions
The Jacobian matrix of a vector-valued function of several variables is a collection of partial derivatives that can be ordered in many different ways, among which there are generally two logical orderings of these partial derivatives (see Two conventions for Jacobian matrix further below). Not surprisingly, in parallel, there are also two conventions for writing the gradient of a vector.
It is important to know these conventions to avoid the confusion in writing the term in the vorticity equation (e.g., Batchelor 1967, p.267[1]) that corresponds to the vorticity production due to vortex-line stretching (e.g., Tritton 1988, p.86[2]) as
or as
.
Both ways of writing this term could be correct, depending on which convention one uses for writing the gradient of a vector in terms of the basis vectors. It is important to note that, for any physical relation, regardless of the various possible tensor forms, there is only a unique component form; an example is the vorticity equation itself. There are two ways (conventions) to express the gradient of a vector, i.e., a second order tensor, in terms of the basis vectors.
Here, as in the article Kolmogorov scales,
designates a (spatial velocity) vector field defined on the domain
, i.e.,
. (Ignore the time dependence of
here.)
Contents |
First convention [edit]
The first convention, often used in continuum mechanics (e.g., Truesdell & Noll 2004, p.15[3]; Gurtin 1981, p.30[4]; Gurtin et al. 2010, p.45[5]; Naghdi 2001, p.21, Eq.5.2[6]; Gonzalez & Stuart 2008, p.48[7]), is
-

(1)
in which the index
of the vector component in the numerator is summed with the index
of the first basis vector
, and the index
of the coordinate
in the denominator is summed with the index
of the second basis vector
.
Nabla is not a "vector" [edit]
In Eq.(1), "grad" is a differential operator; nabla
is simply an abbreviation for "grad", and is not a "vector" in this first convention. In the second convention, on the other hand, nabla is considered as a "vector", and is written here in boldface, i.e.,
to distinguish from the above
differential operator in lightface.
Directional derivative [edit]
The first convention is likely to have its root in the definition of the gradient of a vector field
by the directional derivative of
at
in the direction
(e.g., Misner et al. 1973, p.59[8]; Truesdell & Noll 2004, p.15[3]; Gurtin 1981, p.29[4]; Ciarlet 1988, p.41[9]):
-
![\displaystyle
\left. \frac{d \mathbf U (x + \epsilon \mathbf h)}{d \epsilon} \right|_{\epsilon = 0} = \lim_{\epsilon \rightarrow 0} \frac{\mathbf U (x + \epsilon \mathbf h) - \mathbf U (x)}{\epsilon}= D \mathbf U (x) \cdot \mathbf h = [{\rm grad} \mathbf U (x)] \cdot \mathbf h \equiv [\nabla \mathbf U (x)] \cdot \mathbf h](//upload.wikimedia.org/math/7/3/e/73e2768c5c2a3b094fd84a10c9784c1b.png)
(1b)
In some works, the component form of
in Eq.(1) was not given explicitly, but can be deduced implicitly from Eq.(1b).
Second convention, transpose of the first [edit]
The second convention for gradient of a vector, often used in fluid mechanics (e.g., Batchelor 1967[1], Bird et al. 2006[10], Pope 2000[11], Tritton 1988[2]), corresponds to the transpose of the gradient in the first convention.
For this second convention, since boldface nabla is difficult to distinguish from lightface nabla, we use
(nabla with subscript *) to denote the gradient (of a vector) to distinguish with the symbol
(plain lightface nabla) used for the first convention:
-

(2)
in which the right-arrow nabla
in Malvern 1969, p.58[12], was introduced.
Nabla is a "vector" [edit]
In this second convention, the gradient operator
is thought of as the vector
(Pope 2000, p.651[11]), i.e.,
-

(2b)
that acts on the argument (vector ) on its right side (cf. Pope 2000, p.14[11]).
Note that boldface
was also used to designate gradient as a vector (e.g., Landau & Lifshitz 1987, p.3, Eq.(2.2) [13] ), i.e.,
-

(2c)
Again, since it is difficult to distinguish the boldface
from the lightface
, we will avoid using these boldface symbols, but use
instead.
Thus equivalently
-

(3)
where we also introduced the "left-arrow nabla" symbol
used in Malvern 1969, p.58[12]. The left arrow is used to indicate that the differential operator in
acts on the argument (vector
) on its left side. On the other hand, as a tensor product (or dyadic), vector
is on the left followed by "vector"
. Thus
-

(3b)
which is identical to Eq.(1).
Directional derivative [edit]
With the second convention, the directional derivative is written a bit awkwardly by having to move the direction
in front of the gradient
, i.e.,
-
![\displaystyle
D \mathbf U (x) \cdot \mathbf h = \mathbf h \cdot [\nabla_* \mathbf U (x)]](//upload.wikimedia.org/math/d/6/3/d63274eb591570ae7695909b65717d48.png)
(3c)
An example of the use of Eq.(3c) is Pope 2000, p.14, Eq.(2.16) [11], i.e.,
-

(3d)
Two conventions for Jacobian matrix [edit]
Not surprisingly, the above two conventions for writing the gradient of a vector parallels the two similar conventions of writing the Jacobian matrix containing the derivative of
, with respect to
. The first convention, which is the most used convention (e.g., Battin 1999, p.xxxi[14]; Ciarlet 1988, pp.28-29[9]; Hughes 1987, p.119[15]; cf. Malvern 1969, p.58[12]; Rappaz et al. 2010, p.13[16]), is to write
-
![\displaystyle
\mathbf J = \left[ \frac{\partial U_i}{\partial x_j} \right] = \left[ \displaystyle \begin{array}{lll} \frac{\partial U_1}{\partial x_1} & \frac{\partial U_1}{\partial x_2} & \frac{\partial U_1}{\partial x_3} \\ \frac{\partial U_2}{\partial x_1} & \frac{\partial U_2}{\partial x_2} & \frac{\partial U_2}{\partial x_3} \\ \frac{\partial U_3}{\partial x_1} & \frac{\partial U_3}{\partial x_2} & \frac{\partial U_3}{\partial x_3} \end{array} \right]](//upload.wikimedia.org/math/a/2/6/a260b7c37d0a2356dc3f3c8e1171b55f.png)
(4)
The second convention corresponds to the transpose of the above convention, with
being the column index, and
the row index (e.g., Kellogg 1929, p.35[17]; Zienkiewicz & Taylor 2005, p.20[18]; Fish & Belytschko 2007, p.167[19]). Thus if
designates the Jacobian matrix in the second convention, then we have (cf. Malvern 1969, p.58[12])
-
![\displaystyle
\mathbf J_* = \mathbf J^T = \left[ \frac{\partial U_i}{\partial x_j} \right] = \left[ \displaystyle \begin{array}{lll} \frac{\partial U_1}{\partial x_1} & \frac{\partial U_2}{\partial x_1} & \frac{\partial U_3}{\partial x_1} \\ \frac{\partial U_1}{\partial x_2} & \frac{\partial U_2}{\partial x_2} & \frac{\partial U_3}{\partial x_2} \\ \frac{\partial U_1}{\partial x_3} & \frac{\partial U_2}{\partial x_3} & \frac{\partial U_3}{\partial x_3} \end{array} \right]](//upload.wikimedia.org/math/7/9/3/793701d1a3beb983ceb4341a80abd43e.png)
(5)
Matrix decomposition [edit]
From the matrix algebra of the component form, we can deduce the corresponding tensor form. Let's take the case of a 2x2 matrix as an example.
-
![\displaystyle
\hat{\mathbf A} = \left[ \begin{array}{ll} A^1_1 & A^1_2 \\ A^2_1 & A^2_2 \end{array} \right] = A^1_1 \left\{ \begin{array}{l} 1 \\ 0 \end{array}\right\} \left\lfloor 1 \ 0 \right\rfloor + A^1_2 \left\{ \begin{array}{l} 1 \\ 0 \end{array}\right\} \left\lfloor 0 \ 1 \right\rfloor + A^2_1 \left\{ \begin{array}{l} 0 \\ 1 \end{array}\right\} \left\lfloor 1 \ 0 \right\rfloor + A^2_2 \left\{ \begin{array}{l} 0 \\ 1 \end{array}\right\} \left\lfloor 0 \ 1 \right\rfloor = \sum_{i,j} A^i_j \hat{\mathbf e}_i \hat{\mathbf e}_j ^T](//upload.wikimedia.org/math/f/8/0/f80e9a7e77f16a0fc5cf14c46fe69973.png)
(6)
where the superscript denotes the row index and the subscript the column index, and where
is a column matrix and
is its transpose, i.e., the corresponding row matrix, as shown below:
-

(7)
If we think of
as the matrix of components of the vector
, then the matrix of components of the tensor product
is
; this association is consistent with the following rule of tensor algebra among 3 vectors
(e.g., Gurtin 1981, p.4[4])
-

(8)
since in terms of matrices of components of these vectors, we have
-

(9)
which is the matrix of components of the vector
. Thus the tensor
with the above matrix of components
can be written as
-

(10)
With the above explanation, it is then clear that the first convention of writing the gradient of a vector, i.e.,
in Eq.(1), corresponds to the usual convention of writing the Jacobian matrix
in Eq.(4), whereas the second convention for the gradient of a vector, i.e.,
in Eq.(2), corresponds to the second (rare) convention of writing the Jacobian matrix (which is the transpose of the usual Jacobian matrix).
In addition, the main rationale for the second convention in writing
, as explained above, is to think of the gradient operator as a "vector", which can then be formed freely with another vector (e.g.,
) into a dyadic (or tensor product).
Vorticity production by vortex-line stretching [edit]
Now let's return to the vorticity production by vortex-line stretching; the following relationships hold:
-

(11)
We can write:
-

(12)
where parentheses were put around
to emphasize the meaning of the gradient of a vector (e.g., Pope 2000, p.14[11]). Many authors (e.g., Aris 1962, p.57, p.79[20]; King et al. 2003, p.50[21]) put the parentheses around
, i.e.,
-
![\displaystyle
( \boldsymbol \omega \cdot \overset{\rightarrow}{\nabla} ) \mathbf U = \left[ (\omega_k \mathbf e_k) \cdot \left( \frac{\partial}{\partial x_j} \mathbf e_j \right ) \right] \left( U_i \mathbf e_i \right ) \equiv \boldsymbol \omega \cdot ( \overset{\rightarrow}{\nabla} \mathbf U ) \equiv \boldsymbol \omega \cdot \overset{\rightarrow}{\nabla} \mathbf U](//upload.wikimedia.org/math/1/3/5/135de992ff14e3a804d814e853bb54f8.png)
(13)
which could originate from the writing of the material time derivative operator as (Aris 1962, p.114[20]; cf. Pope 2000, p.13[11])
-

(14)
Even though the same component form would result from such notation, the physical meaning of the gradient of a vector field in
is lost. Besides,
is not the divergence of
; its mathematical meaning is a differential operator waiting to operate on
; one could interpret
as the gradient projected along the direction
.
Many other authors simply omit the parentheses to write the vortex stretching term as
(e.g., Batchelor 1967, p.267[1]; Pope 2000, p.22[11]).
Other uses and notations [edit]
On the other hand, there is no point of avoiding
, since the strain rate is
-
![\displaystyle
\boldsymbol \epsilon = \frac{1}{2}\left[ \nabla \mathbf U + \nabla^T \mathbf U \right] = \frac{1}{2}\left[ {\mathbf U} \overset{\leftarrow}{\nabla} + \overset{\rightarrow}{\nabla} \mathbf U \right] = \frac{1}{2}\left[ \nabla_*^T \mathbf U + \nabla_* \mathbf U \right]](//upload.wikimedia.org/math/b/6/f/b6fd48d677a9acd024525b93aa16be79.png)
(15)
Note that other authors (e.g., Dhont 2004, p.20[22]) even put the tensor product between nabla
and
(perhaps to emphasize the 2nd-order nature of the resulting tensor) to write
-

(16)
Here again, as mentioned above, the nabla
is viewed as a vector, as in Eq.(2b).
Closure [edit]
As mentioned above, since there is a unique component form for any physical relation, even though there could be several ways to write the same relation in tensor form, some authors write exclusively in component form (e.g., Berdichevsky 2009[23], Wyngaard 2010[24]). The tensor form has its advantage of being concise, compact, and is thus easier to grasp the relation among the different tensor quantities, without being bogged down by the indices and their summations. To avoid confusion, however, the corresponding key components form should always be given, such as the component form of
in Eq.(1).
Most continuum mechanics literature uses the first convention, whereas most fluid mechanics literature uses the second convention. Malvern 1969[12] is a rare exception and an excellent book that presents both conventions, as the book was intended for a course that graduate students would take before taking more specialized topics such as linear elasticity or fluid mechanics.
Unfortunately, these days, the original vision of Malvern (even at the last university where he taught for more than 20 years) is no longer followed perhaps due to practical reasons: Solid mechanics students would take the more specialized linear elasticity before taking continuum mechanics, which includes nonlinear behaviors, whereas fluid mechanics students would shun continuum mechanics completely, as the kinematics of deformation, which is primarily useful for solids, would not be useful for fluids.
References (order of appearance, with links) [edit]
- ↑ 1.0 1.1 1.2 Batchelor, G.K., An introduction to fluid dynamics, Cambridge University Press, 1967.
- ↑ 2.0 2.1 Tritton, D.J., Physical fluid dynamics, 2nd edition, Oxford Science Publications, 1988.
- ↑ 3.0 3.1 Truesdell, C., Noll, W., The nonlinear field theories of mechanics, ed. by S. Antman, 3rd edition, Springer Verlag, 2004.
- ↑ 4.0 4.1 4.2 Gurtin, M.E., An introduction to continuum mechanics, Academic Press, 1981.
- ↑ Gurtin, M.E., Fried, E., Anand, L., The mechanics and thermodynamics of continua, Cambridge University Press, 2010.
- ↑ Naghdi, P.M., ME 185 Lecture notes on continuum mechanics, University of California at Berkeley, Mechanical Engineering, edited by J. Casey, 2001.
- ↑ Gonzalez, O., Stuart, A.M., A first course in continuum mechanics, Cambridge University Press, 2008.
- ↑ Misner, C.W., Thorne, K.S., Wheeler, J.A., Gravitation, Freeman and Co., 1973.
- ↑ 9.0 9.1 Ciarlet, P.G., Mathematical elasticity, Vol.1: Three-dimensional elasticity, Elsevier, 1988.
- ↑ Bird, R.B., Stewart, W.E., Lightfood, E.N., Transport phenomena, Second edition, Wiley, 2006.
- ↑ 11.0 11.1 11.2 11.3 11.4 11.5 11.6 Stephen B. Pope, Turbulent Flows, Cambridge U. Press, 2000. Amazon
- ↑ 12.0 12.1 12.2 12.3 12.4 Malvern, L.E., Introduction to the mechanics of a continuous medium, Prentice Hall, 1969 (hard cover), 1977 (paperback).
- ↑ Landau, L.D., Lifshitz, E.M., Fluid mechanics, 2nd edition, Pergamon Press, 1987.
- ↑ Battin, R.H., An introduction to the mathematics and methods of astrodynamics, revised edition, AIAA, 1999.
- ↑ Hughes, T.J.R., The finite element method: Linear static and dynamic finite element analysis, Prentice Hall, NJ, 1987.
- ↑ Rappaz, M., Bellet, M., Deville, M., Numerical modeling in materials science and engineering, Springer, 2010.
- ↑ Kellogg, O.D., Foundations of potential theory, Frederick Ungar Publishing Co., New York, 1929; Dover, 2010.
- ↑ Zienkiewicz, O.C., Taylor, R.L., The Finite element method, 6th edition, Butterworth-Heineman, MA, 2005.
- ↑ Fish, J., Belytschko, T., A first course in finite elements, Wiley, 2007.
- ↑ 20.0 20.1 Aris, R., Vectors, tensors, and the basic equations of fluid mechanics, Prentice Hall, NJ, 1962; Dover, 1989.
- ↑ King, A.C., Billingham, J., Otto, S.R., Differential equations: Linear, nonlinear, ordinary, partial, Cambridge University Press, 2003.
- ↑ Dhont, G., The finite element method for three-dimensional thermomechanical applications, Wiley, 2004.
- ↑ Berdichevsky, V.L., Variational principles of continuum mechanics, Vol.II: Applications, Springer Verlag, 2009.
- ↑ Wyngaard, J.C., Turbulence in the atmosphere, Cambridge University Press, 2010.
References (alphabetical order) [edit]
Aris, R., Vectors, tensors, and the basic equations of fluid mechanics, Prentice Hall, NJ, 1962; Dover, 1989.
Batchelor, G.K., An introduction to fluid dynamics, Cambridge University Press, 1967.
Battin, R.H., An introduction to the mathematics and methods of astrodynamics, revised edition, AIAA, 1999.
Berdichevsky, V.L., Variational principles of continuum mechanics, Vol.II: Applications, Springer Verlag, 2009.
Bird, R.B., Stewart, W.E., Lightfood, E.N., Transport phenomena, Second edition, Wiley, 2006.
Ciarlet, P.G., Mathematical elasticity, Vol.1: Three-dimensional elasticity, Elsevier, 1988.
Dhont, G., The finite element method for three-dimensional thermomechanical applications, Wiley, 2004.
Fish, J., Belytschko, T., A first course in finite elements, Wiley, 2007.
Gonzalez, O., Stuart, A.M., A first course in continuum mechanics, Cambridge University Press, 2008.
Gurtin, M.E., An introduction to continuum mechanics, Academic Press, 1981.
Gurtin, M.E., Fried, E., Anand, L., The mechanics and thermodynamics of continua, Cambridge University Press, 2010.
Hughes, T.J.R., The finite element method: Linear static and dynamic finite element analysis, Prentice Hall, NJ, 1987.
Kellogg, O.D., Foundations of potential theory, Frederick Ungar Publishing Co., New York, 1929; Dover, 2010.
King, A.C., Billingham, J., Otto, S.R., Differential equations: Linear, nonlinear, ordinary, partial, Cambridge University Press, 2003.
Landau, L.D., Lifshitz, E.M., Fluid mechanics, 2nd edition, Pergamon Press, 1987.
Malvern, L.E., Introduction to the mechanics of a continuous medium, Prentice Hall, 1969 (hard cover), 1977 (paperback).
Misner, C.W., Thorne, K.S., Wheeler, J.A., Gravitation, Freeman and Co., 1973.
Naghdi, P.M., ME 185 Lecture notes on continuum mechanics, University of California at Berkeley, Mechanical Engineering, edited by J. Casey, 2001.
Rappaz, M., Bellet, M., Deville, M., Numerical modeling in materials science and engineering, Springer, 2010.
Stephen B. Pope, Turbulent Flows, Cambridge U. Press, 2000.
Tritton, D.J., Physical fluid dynamics, 2nd edition, Oxford Science Publications, 1988.
Truesdell, C., Noll, W., The nonlinear field theories of mechanics, ed. by S. Antman, 3rd edition, Springer Verlag, 2004.
Wyngaard, J.C., Turbulence in the atmosphere, Cambridge University Press, 2010.
Zienkiewicz, O.C., Taylor, R.L., The Finite element method, 6th edition, Butterworth-Heineman, MA, 2005.

![\displaystyle
\left. \frac{d \mathbf U (x + \epsilon \mathbf h)}{d \epsilon} \right|_{\epsilon = 0} = \lim_{\epsilon \rightarrow 0} \frac{\mathbf U (x + \epsilon \mathbf h) - \mathbf U (x)}{\epsilon}= D \mathbf U (x) \cdot \mathbf h = [{\rm grad} \mathbf U (x)] \cdot \mathbf h \equiv [\nabla \mathbf U (x)] \cdot \mathbf h](http://upload.wikimedia.org/math/7/3/e/73e2768c5c2a3b094fd84a10c9784c1b.png)





![\displaystyle
D \mathbf U (x) \cdot \mathbf h = \mathbf h \cdot [\nabla_* \mathbf U (x)]](http://upload.wikimedia.org/math/d/6/3/d63274eb591570ae7695909b65717d48.png)

![\displaystyle
\mathbf J = \left[ \frac{\partial U_i}{\partial x_j} \right] = \left[ \displaystyle \begin{array}{lll} \frac{\partial U_1}{\partial x_1} & \frac{\partial U_1}{\partial x_2} & \frac{\partial U_1}{\partial x_3} \\ \frac{\partial U_2}{\partial x_1} & \frac{\partial U_2}{\partial x_2} & \frac{\partial U_2}{\partial x_3} \\ \frac{\partial U_3}{\partial x_1} & \frac{\partial U_3}{\partial x_2} & \frac{\partial U_3}{\partial x_3} \end{array} \right]](http://upload.wikimedia.org/math/a/2/6/a260b7c37d0a2356dc3f3c8e1171b55f.png)
![\displaystyle
\mathbf J_* = \mathbf J^T = \left[ \frac{\partial U_i}{\partial x_j} \right] = \left[ \displaystyle \begin{array}{lll} \frac{\partial U_1}{\partial x_1} & \frac{\partial U_2}{\partial x_1} & \frac{\partial U_3}{\partial x_1} \\ \frac{\partial U_1}{\partial x_2} & \frac{\partial U_2}{\partial x_2} & \frac{\partial U_3}{\partial x_2} \\ \frac{\partial U_1}{\partial x_3} & \frac{\partial U_2}{\partial x_3} & \frac{\partial U_3}{\partial x_3} \end{array} \right]](http://upload.wikimedia.org/math/7/9/3/793701d1a3beb983ceb4341a80abd43e.png)
![\displaystyle
\hat{\mathbf A} = \left[ \begin{array}{ll} A^1_1 & A^1_2 \\ A^2_1 & A^2_2 \end{array} \right] = A^1_1 \left\{ \begin{array}{l} 1 \\ 0 \end{array}\right\} \left\lfloor 1 \ 0 \right\rfloor + A^1_2 \left\{ \begin{array}{l} 1 \\ 0 \end{array}\right\} \left\lfloor 0 \ 1 \right\rfloor + A^2_1 \left\{ \begin{array}{l} 0 \\ 1 \end{array}\right\} \left\lfloor 1 \ 0 \right\rfloor + A^2_2 \left\{ \begin{array}{l} 0 \\ 1 \end{array}\right\} \left\lfloor 0 \ 1 \right\rfloor = \sum_{i,j} A^i_j \hat{\mathbf e}_i \hat{\mathbf e}_j ^T](http://upload.wikimedia.org/math/f/8/0/f80e9a7e77f16a0fc5cf14c46fe69973.png)






![\displaystyle
( \boldsymbol \omega \cdot \overset{\rightarrow}{\nabla} ) \mathbf U = \left[ (\omega_k \mathbf e_k) \cdot \left( \frac{\partial}{\partial x_j} \mathbf e_j \right ) \right] \left( U_i \mathbf e_i \right ) \equiv \boldsymbol \omega \cdot ( \overset{\rightarrow}{\nabla} \mathbf U ) \equiv \boldsymbol \omega \cdot \overset{\rightarrow}{\nabla} \mathbf U](http://upload.wikimedia.org/math/1/3/5/135de992ff14e3a804d814e853bb54f8.png)

![\displaystyle
\boldsymbol \epsilon = \frac{1}{2}\left[ \nabla \mathbf U + \nabla^T \mathbf U \right] = \frac{1}{2}\left[ {\mathbf U} \overset{\leftarrow}{\nabla} + \overset{\rightarrow}{\nabla} \mathbf U \right] = \frac{1}{2}\left[ \nabla_*^T \mathbf U + \nabla_* \mathbf U \right]](http://upload.wikimedia.org/math/b/6/f/b6fd48d677a9acd024525b93aa16be79.png)
