Elasticity/Tensors

Tensors in Solid Mechanics[edit | edit source]

A sound understanding of tensors and tensor operation is essential if you want to read and understand modern papers on solid mechanics and finite element modeling of complex material behavior. This brief introduction gives you an overview of tensors and tensor notation. For more details you can read A Brief on Tensor Analysis by J. G. Simmonds, the appendix on vector and tensor notation from Dynamics of Polymeric Liquids - Volume 1 by R. B. Bird, R. C. Armstrong, and O. Hassager, and the monograph by R. M. Brannon. An introduction to tensors in continuum mechanics can be found in An Introduction to Continuum Mechanics by M. E. Gurtin. Most of the material in this page is based on these sources.

Notation[edit | edit source]

The following notation is usually used in the literature:

{\begin{aligned}s&=~{\text{scalar (lightface italic small)}}\\\mathbf {v} &=~{\text{vector (boldface roman small)}}\\{\boldsymbol {\sigma }}&=~{\text{second-order tensor (boldface Greek)}}\\{\boldsymbol {A}}&=~{\text{third-order tensor (boldface italic capital)}}\\{\boldsymbol {\mathsf {A}}}&=~{\text{fourth-order tensor (sans-serif capital)}}\end{aligned}}

Motivation[edit | edit source]

A force $\mathbf {f} \,$ has a magnitude and a direction, can be added to another force, be multiplied by a scalar and so on. These properties make the force $\mathbf {f} \,$ a vector.

Similarly, the displacement $\mathbf {u}$ is a vector because it can be added to other displacements and satisfies the other properties of a vector.

However, a force cannot be added to a displacement to yield a physically meaningful quantity. So the physical spaces that these two quantities lie on must be different.

Recall that a constant force $\mathbf {f}$ moving through a displacement $\mathbf {u} \,$ does $\mathbf {f} \bullet \mathbf {u}$ units of work. How do we compute this product when the spaces of $\mathbf {f} \,$ and $\mathbf {u} \,$ are different? If you try to compute the product on a graph, you will have to convert both quantities to a single basis and then compute the scalar product.

An alternative way of thinking about the operation $\mathbf {f} \bullet \mathbf {u}$ is to think of $\mathbf {f} \,$ as a linear operator that acts on $\mathbf {u}$ to produce a scalar quantity (work). In the notation of sets we can write

\mathbf {f} \bullet \mathbf {u} ~~~\equiv ~~~\mathbf {f} :\mathbf {u} \rightarrow \mathbb {R} ^{}~.

A first order tensor is a linear operator that sends vectors to scalars.

Next, assume that the force $\mathbf {f} \,$ acts at a point $\mathbf {x} \,$ . The moment of the force about the origin is given by $\mathbf {x} \times \mathbf {f} \,$ which is a vector. The vector product can be thought of as an linear operation too. In this case the effect of the operator is to convert a vector into another vector.

A second order tensor is a linear operator that sends vectors to vectors.

According to Simmonds, "the name tensor comes from elasticity theory where in a loaded elastic body the stress tensor acting on a unit vector normal to a plane through a point delivers the tension (i.e., the force per unit area) acting across the plane at that point."

Examples of second order tensors are the stress tensor, the deformation gradient tensor, the velocity gradient tensor, and so on.

Another type of tensor that we encounter frequently in mechanics is the fourth order tensor that takes strains to stresses. In elasticity, this is the stiffness tensor.

A fourth order tensor is a linear operator that sends second order tensors to second order tensors.

Tensor algebra[edit | edit source]

A tensor ${\boldsymbol {A}}\,$ is a linear transformation from a vector space ${\mathcal {V}}$ to ${\mathcal {V}}$ . Thus, we can write

{\boldsymbol {A}}:\mathbf {u} \in {\mathcal {V}}\rightarrow \mathbf {v\in {\mathcal {V}}} ~.

More often, we use the following notation:

\mathbf {v} ={\boldsymbol {A}}\mathbf {u} \equiv {\boldsymbol {A}}(\mathbf {u} )\equiv {\boldsymbol {A}}\bullet \mathbf {u} ~.

I have used the "dot" notation in this handout. None of the above notations is obviously superior to the others and each is used widely.

Addition of tensors[edit | edit source]

Let ${\boldsymbol {A}}\,$ and ${\boldsymbol {B}}\,$ be two tensors. Then the sum $({\boldsymbol {A}}+{\boldsymbol {B}})\,$ is another tensor ${\boldsymbol {C}}\,$ defined by

{\boldsymbol {C}}={\boldsymbol {A}}+{\boldsymbol {B}}\implies {\boldsymbol {C}}\bullet \mathbf {v} =({\boldsymbol {A}}+{\boldsymbol {B}})\bullet \mathbf {v} ={\boldsymbol {A}}\bullet \mathbf {v} +{\boldsymbol {B}}\bullet \mathbf {v} ~.

Multiplication of a tensor by a scalar[edit | edit source]

Let ${\boldsymbol {A}}\,$ be a tensor and let $\lambda \,$ be a scalar. Then the product ${\boldsymbol {C}}=\lambda {\boldsymbol {A}}\,$ is a tensor defined by

{\boldsymbol {C}}=\lambda {\boldsymbol {A}}\implies {\boldsymbol {C}}\bullet \mathbf {v} =(\lambda {\boldsymbol {A}})\bullet \mathbf {v} =\lambda ({\boldsymbol {A}}\bullet \mathbf {v} )~.

Zero tensor[edit | edit source]

The zero tensor ${\boldsymbol {\mathit {0}}}\,$ is the tensor which maps every vector $\mathbf {v} \,$ into the zero vector.

{\boldsymbol {\mathit {0}}}\bullet \mathbf {v} =\mathbf {0} ~.

Identity tensor[edit | edit source]

The identity tensor ${\boldsymbol {\mathit {I}}}\,$ takes every vector $\mathbf {v} \,$ into itself.

{\boldsymbol {\mathit {I}}}\bullet \mathbf {v} =\mathbf {v} ~.

The identity tensor is also often written as ${\boldsymbol {\mathit {1}}}\,$ .

Product of two tensors[edit | edit source]

Let ${\boldsymbol {A}}\,$ and ${\boldsymbol {B}}\,$ be two tensors. Then the product ${\boldsymbol {C}}={\boldsymbol {A}}\bullet {\boldsymbol {B}}$ is the tensor that is defined by

{\boldsymbol {C}}={\boldsymbol {A}}\bullet {\boldsymbol {B}}\implies {\boldsymbol {C}}\bullet \mathbf {v} =({\boldsymbol {A}}\bullet {\boldsymbol {B}})\bullet {\mathbf {v} }={\boldsymbol {A}}\bullet ({\boldsymbol {B}}\bullet {\mathbf {v} })~.

In general ${\boldsymbol {A}}\bullet {\boldsymbol {B}}\neq {\boldsymbol {B}}\bullet {\boldsymbol {A}}$ .

Transpose of a tensor[edit | edit source]

The transpose of a tensor ${\boldsymbol {A}}\,$ is the unique tensor ${\boldsymbol {A}}^{T}\,$ defined by

({\boldsymbol {A}}\bullet \mathbf {u} )\bullet \mathbf {v} =\mathbf {u} \bullet ({\boldsymbol {A}}^{T}\bullet \mathbf {v} )~.

The following identities follow from the above definition:

{\begin{aligned}({\boldsymbol {A}}+{\boldsymbol {B}})^{T}&={\boldsymbol {A}}^{T}+{\boldsymbol {B}}^{T}~,\\({\boldsymbol {A}}\bullet {\boldsymbol {B}})^{T}&={\boldsymbol {B}}^{T}\bullet {\boldsymbol {A}}^{T}~,\\({\boldsymbol {A}}^{T})^{T}&={\boldsymbol {A}}~.\end{aligned}}

Symmetric and skew tensors[edit | edit source]

A tensor ${\boldsymbol {A}}\,$ is symmetric if

{\boldsymbol {A}}={\boldsymbol {A}}^{T}~.

A tensor ${\boldsymbol {A}}\,$ is skew if

{\boldsymbol {A}}=-{\boldsymbol {A}}^{T}~.

Every tensor ${\boldsymbol {A}}\,$ can be expressed uniquely as the sum of a symmetric tensor ${\boldsymbol {E}}\,$ (the symmetric part of ${\boldsymbol {A}}\,$ ) and a skew tensor ${\boldsymbol {W}}\,$ (the skew part of ${\boldsymbol {A}}\,$ ).

{\boldsymbol {A}}={\boldsymbol {E}}+{\boldsymbol {W}}~;~~{\boldsymbol {E}}={\cfrac {{\boldsymbol {A}}+{\boldsymbol {A}}^{T}}{2}}~;~~{\boldsymbol {W}}={\cfrac {{\boldsymbol {A}}-{\boldsymbol {A}}^{T}}{2}}~.

Tensor product of two vectors[edit | edit source]

The tensor (or dyadic) product $\mathbf {a} \mathbf {b} \,$ (also written $\mathbf {a} \otimes \mathbf {b} \,$ ) of two vectors $\mathbf {a} \,$ and $\mathbf {b} \,$ is a tensor that assigns to each vector $\mathbf {v} \,$ the vector $(\mathbf {b} \bullet \mathbf {v} )\mathbf {a}$ .

(\mathbf {a} \mathbf {b} )\bullet \mathbf {v} =(\mathbf {a} \otimes \mathbf {b} )\bullet \mathbf {v} =(\mathbf {b} \bullet \mathbf {v} )\mathbf {a} ~.

Notice that all the above operations on tensors are remarkably similar to matrix operations.

Spectral theorem[edit | edit source]

The spectral theorem for tensors is widely used in mechanics. We will start off by definining eigenvalues and eigenvectors.

Eigenvalues and eigenvectors[edit | edit source]

Let ${\boldsymbol {S}}$ be a second order tensor. Let $\lambda$ be a scalar and $\mathbf {n}$ be a vector such that

{\boldsymbol {S}}\cdot \mathbf {n} =\lambda ~\mathbf {n}

Then $\lambda$ is called an eigenvalue of ${\boldsymbol {S}}$ and $\mathbf {n}$ is an eigenvector .

A second order tensor has three eigenvalues and three eigenvectors, since the space is three-dimensional. Some of the eigenvalues might be repeated. The number of times an eigenvalue is repeated is called multiplicity.

In mechanics, many second order tensors are symmetric and positive definite. Note the following important properties of such tensors:

If ${\boldsymbol {S}}$ is positive definite, then $\lambda >0$ .
If ${\boldsymbol {S}}$ is symmetric, the eigenvectors $\mathbf {n}$ are mutually orthogonal.

For more on eigenvalues and eigenvectors see Applied linear operators and spectral methods.

Spectral theorem[edit | edit source]

Let ${\boldsymbol {S}}$ be a symmetric second-order tensor. Then

the normalized eigenvectors $\mathbf {n} _{1},\mathbf {n} _{2},\mathbf {n} _{3}$ form an orthonormal basis.
if $\lambda _{1},\lambda _{2},\lambda _{3}$ are the corresponding eigenvalues then ${\boldsymbol {S}}=\sum _{i=1}^{3}\lambda _{i}\mathbf {n} _{i}\otimes \mathbf {n} _{i}$ .

This relation is called the spectral decomposition of ${\boldsymbol {S}}$ .

Polar decomposition theorem[edit | edit source]

Let ${\boldsymbol {F}}$ be second order tensor with $\det {\boldsymbol {F}}>0$ . Then

there exist positive definite, symmetric tensors ${\boldsymbol {U}}$ , ${\boldsymbol {V}}$ and a rotation (orthogonal) tensor ${\boldsymbol {R}}$ such that ${\boldsymbol {F}}={\boldsymbol {R}}\cdot {\boldsymbol {U}}={\boldsymbol {V}}\cdot {\boldsymbol {R}}$ .
also each of these decompositions is unique.

Principal invariants of a tensor[edit | edit source]

Let ${\boldsymbol {S}}$ be a second order tensor. Then the determinant of ${\boldsymbol {S}}-\lambda ~{\boldsymbol {\mathit {I}}}$ can be expressed as

\det({\boldsymbol {S}}-\lambda ~{\boldsymbol {\mathit {I}}})=-\lambda ^{3}+I_{1}({\boldsymbol {S}})~\lambda ^{2}-I_{2}({\boldsymbol {S}})~\lambda +I_{3}({\boldsymbol {S}})

The quantities $I_{1},I_{2},I_{3}\,$ are called the principal invariants of ${\boldsymbol {S}}$ . Expressions of the principal invariants are given below.

Principal invariants of ${\boldsymbol {S}}$

{\begin{aligned}I_{1}&={\text{tr}}~{\boldsymbol {S}}=\lambda _{1}+\lambda _{2}+\lambda _{3}\\I_{2}&={\cfrac {1}{2}}\left[({\text{tr}}~{\boldsymbol {S}})^{2}-{\text{tr}}({\boldsymbol {S^{2}}})\right]=\lambda _{1}~\lambda _{2}+\lambda _{2}~\lambda _{3}+\lambda _{3}~\lambda _{1}\\I_{3}&=\det {\boldsymbol {S}}=\lambda _{1}~\lambda _{2}~\lambda _{3}\end{aligned}}

Note that $\lambda$ is an eigenvalue of ${\boldsymbol {S}}$ if and only if

\det({\boldsymbol {S}}-\lambda ~{\boldsymbol {\mathit {I}}})=0

The resulting equations is called the characteristic equation and is usually written in expanded form as

\lambda ^{3}-I_{1}({\boldsymbol {S}})~\lambda ^{2}+I_{2}({\boldsymbol {S}})~\lambda -I_{3}({\boldsymbol {S}})=0

Cayley-Hamilton theorem[edit | edit source]

The Cayley-Hamilton theorem is a very useful result in continuum mechanics. It states that

Cayley-Hamilton theorem

If ${\boldsymbol {S}}$ is a second order tensor then it satisfies its own characteristic equation

{\boldsymbol {S}}^{3}-I_{1}({\boldsymbol {S}})~{\boldsymbol {S}}^{2}+I_{2}({\boldsymbol {S}})~{\boldsymbol {S}}-I_{3}({\boldsymbol {S}})~{\boldsymbol {\mathit {1}}}={\boldsymbol {\mathit {0}}}

Index notation[edit | edit source]

All the equations so far have made no mention of the coordinate system. When we use vectors and tensor in computations we have to express them in some coordinate system (basis) and use the components of the object in that basis for our computations.

Commonly used bases are the Cartesian coordinate frame, the cylindrical coordinate frame, and the spherical coordinate frame.

A Cartesian coordinate frame consists of an orthonormal basis $(\mathbf {e} _{1},\mathbf {e} _{2},\mathbf {e} _{3})\,$ together with a point $\mathbf {o} \,$ called the origin. Since these vectors are mutually perpendicular, we have the following relations:

{\begin{aligned}{\text{(1)}}\qquad \mathbf {e} _{1}\bullet \mathbf {e} _{1}&=1~;~~\mathbf {e} _{1}\bullet \mathbf {e} _{2}=0~;~~\mathbf {e} _{1}\bullet \mathbf {e} _{3}=0~;\\\mathbf {e} _{2}\bullet \mathbf {e} _{1}&=0~;~~\mathbf {e} _{2}\bullet \mathbf {e} _{2}=1~;~~\mathbf {e} _{2}\bullet \mathbf {e} _{3}=0~;\\\mathbf {e} _{3}\bullet \mathbf {e} _{1}&=0~;~~\mathbf {e} _{3}\bullet \mathbf {e} _{2}=0~;~~\mathbf {e} _{3}\bullet \mathbf {e} _{3}=1~.\end{aligned}}

Kronecker delta[edit | edit source]

To make the above relations more compact, we introduce the Kronecker delta symbol

{\delta _{ij}={\begin{cases}1&~{\rm {{if}~i=j~.}}\\0&~{\rm {{if}~i\neq j~.}}\end{cases}}}

Then, instead of the nine equations in (1) we can write (in index notation)

\mathbf {e} _{i}\bullet \mathbf {e} _{j}=\delta _{ij}~.

Einstein summation convention[edit | edit source]

Recall that the vector $\mathbf {u} \,$ can be written as

{\text{(2)}}\qquad \mathbf {u} =u_{1}\mathbf {e} _{1}+u_{2}\mathbf {e} _{2}+u_{3}\mathbf {e} _{3}=\sum _{i=1}^{3}u_{i}\mathbf {e} _{i}~.

In index notation, equation (2) can be written as

{\mathbf {u} =u_{i}\mathbf {e} _{i}~.}

This convention is called the Einstein summation convention. If indices are repeated, we understand that to mean that there is a sum over the indices.

Components of a vector[edit | edit source]

We can write the Cartesian components of a vector $\mathbf {u} \,$ in the basis $(\mathbf {e} _{1},\mathbf {e} _{2},\mathbf {e} _{3})\,$ as

u_{i}=\mathbf {e} _{i}\bullet \mathbf {u} ~,~~~i=1,2,3~.

Components of a tensor[edit | edit source]

Similarly, the components $A_{ij}\,$ of a tensor ${\boldsymbol {A}}\,$ are defined by

{A_{ij}=\mathbf {e} _{i}\bullet ({\boldsymbol {A}}\bullet \mathbf {e} _{j})~.}

Using the definition of the tensor product, we can also write

{\boldsymbol {A}}=\sum _{i,j=1}^{3}A_{ij}\mathbf {e} _{i}\mathbf {e} _{j}\equiv \sum _{i,j=1}^{3}A_{ij}\mathbf {e} _{i}\otimes \mathbf {e} _{j}~.

Using the summation convention,

{{\boldsymbol {A}}=A_{ij}\mathbf {e} _{i}\mathbf {e} _{j}\equiv A_{ij}\mathbf {e} _{i}\otimes \mathbf {e} _{j}~.}

In this case, the bases of the tensor are $\{\mathbf {e} _{i}\otimes \mathbf {e} _{j}\}$ and the components are $A_{ij}\,$ .

Operation of a tensor on a vector[edit | edit source]

From the definition of the components of tensor ${\boldsymbol {A}}\,$ , we can also see that (using the summation convention)

{\mathbf {v} ={\boldsymbol {A}}\bullet \mathbf {u} ~~~\equiv ~~~v_{i}=A_{ij}u_{j}~.}

Dyadic product[edit | edit source]

Similarly, the dyadic product can be expressed as

{(\mathbf {a} \mathbf {b} )_{ij}\equiv (\mathbf {a} \otimes \mathbf {b} )_{ij}=a_{i}b_{j}~.}

Matrix notation[edit | edit source]

We can also write a tensor ${\boldsymbol {A}}$ in matrix notation as

{\boldsymbol {A}}=A_{ij}\mathbf {e} _{i}\mathbf {e} _{j}=A_{ij}\mathbf {e} _{i}\otimes \mathbf {e} _{j}\implies \mathbf {A} ={\begin{bmatrix}A_{11}&A_{12}&A_{13}\\A_{21}&A_{22}&A_{23}\\A_{31}&A_{32}&A_{33}\end{bmatrix}}~.

Note that the Kronecker delta represents the components of the identity tensor in a Cartesian basis. Therefore, we can write

{\boldsymbol {I}}=\delta _{ij}\mathbf {e} _{i}\mathbf {e} _{j}=\delta _{ij}\mathbf {e} _{i}\otimes \mathbf {e} _{j}\implies \mathbf {I} ={\begin{bmatrix}1&0&0\\0&1&0\\0&0&1\end{bmatrix}}~.

Tensor inner product[edit | edit source]

The inner product ${\boldsymbol {A}}:{\boldsymbol {B}}\,$ of two tensors ${\boldsymbol {A}}\,$ and ${\boldsymbol {B}}\,$ is an operation that generates a scalar. We define (summation implied)

{{\boldsymbol {A}}:{\boldsymbol {B}}=A_{ij}B_{ij}~.}

The inner product can also be expressed using the trace :

{{\boldsymbol {A}}:{\boldsymbol {B}}=Tr({\boldsymbol {A^{T}}}\bullet {\boldsymbol {B}})~.}

Proof using the definition of the trace below :

{Tr({\boldsymbol {A^{T}}}\bullet {\boldsymbol {B}})={\boldsymbol {I}}:({\boldsymbol {A^{T}}}\bullet {\boldsymbol {B}})=\delta _{ij}\mathbf {e} _{i}\otimes \mathbf {e} _{j}:(A_{lk}\mathbf {e} _{k}\otimes \mathbf {e} _{l}\bullet B_{mn}\mathbf {e} _{m}\otimes \mathbf {e} _{n})=\delta _{ij}\mathbf {e} _{i}\otimes \mathbf {e} _{j}:(A_{mk}B_{mn}\mathbf {e} _{k}\otimes \mathbf {e} _{n})=}

{A_{mk}B_{mn}\delta _{ij}\delta _{in}\delta _{jk}=A_{mk}B_{mi}\delta _{ij}\delta _{jk}=A_{mk}B_{mj}\delta _{jk}=A_{mj}B_{mj}=A:B}

Trace of a tensor[edit | edit source]

The trace of a tensor is the scalar given by

{\text{Tr}}({\boldsymbol {A}})={\boldsymbol {I}}:{\boldsymbol {A}}=\delta _{ij}\mathbf {e} _{i}\otimes \mathbf {e} _{j}:A_{mn}\mathbf {e} _{m}\otimes \mathbf {e} _{n}=\delta _{ij}\delta _{im}\delta _{jn}A_{mn}=A_{ii}

The trace of an N x N-matrix is the sum of the components on the downward-sloping diagonal.

Magnitude of a tensor[edit | edit source]

The magnitude of a tensor ${\boldsymbol {A}}\,$ is defined by

\Vert {\boldsymbol {A}}\Vert ={\sqrt {{\boldsymbol {A}}:{\boldsymbol {A}}}}\equiv {\sqrt {A_{ij}A_{ij}}}~.

Tensor product of a tensor with a vector[edit | edit source]

Another tensor operation that is often seen is the tensor product of a tensor with a vector. Let ${\boldsymbol {A}}\,$ be a tensor and let $\mathbf {v} \,$ be a vector. Then the tensor cross product gives a tensor ${\boldsymbol {C}}\,$ defined by

{{\boldsymbol {C}}={\boldsymbol {A}}\times \mathbf {v} \implies C_{ij}=e_{klj}A_{ik}v_{l}~.}

Permutation symbol[edit | edit source]

The permutation symbol $e_{ijk}\,$ is defined as

{e_{ijk}={\begin{cases}1&~{\text{if}}~ijk=123,231,~{\text{or}}~312\\-1&~{\text{if}}~ijk=321,132,~{\text{or}}~213\\0&~{\text{if any two indices are alike}}\end{cases}}}

Identities in tensor algebra[edit | edit source]

Let ${\boldsymbol {A}}$ , ${\boldsymbol {B}}$ and ${\boldsymbol {C}}$ be three second order tensors. Then

{\boldsymbol {A}}:({\boldsymbol {B}}\cdot {\boldsymbol {C}})=({\boldsymbol {C}}\cdot {\boldsymbol {A}}^{T}):{\boldsymbol {B}}^{T}=({\boldsymbol {B}}^{T}\cdot {\boldsymbol {A}}):{\boldsymbol {C}}

Proof:

It is easiest to show these relations by using index notation with respect to an orthonormal basis. Then we can write

${\boldsymbol {A}}:({\boldsymbol {B}}\cdot {\boldsymbol {C}})\equiv A_{ij}(B_{ik}~C_{kj})=C_{kj}~A_{ji}^{T}~B_{ki}^{T}\equiv ({\boldsymbol {C}}\cdot {\boldsymbol {A}}^{T}):{\boldsymbol {B}}^{T}$

Similarly,

${\boldsymbol {A}}:({\boldsymbol {B}}\cdot {\boldsymbol {C}})\equiv A_{ij}(B_{ik}~C_{kj})=B_{ki}^{T}~A_{ij}~C_{kj}\equiv ({\boldsymbol {B}}^{T}\cdot {\boldsymbol {A}}):{\boldsymbol {C}}$

Tensor calculus[edit | edit source]

Recall that the vector differential operator (with respect to a Cartesian basis) is defined as

{\boldsymbol {\nabla }}{}={\cfrac {\partial }{\partial x_{1}}}\mathbf {e} _{1}+{\cfrac {\partial }{\partial x_{2}}}\mathbf {e} _{2}+{\cfrac {\partial }{\partial x_{3}}}\mathbf {e} _{3}\equiv {\cfrac {\partial }{\partial x_{i}}}\mathbf {e} _{i}~.

In this section we summarize some operations of ${\boldsymbol {\nabla }}{}$ on vectors and tensors.

The gradient of a vector field[edit | edit source]

The dyadic product ${\boldsymbol {\nabla }}{\mathbf {v} }\,$ (or ${\boldsymbol {\nabla }}{}\otimes \mathbf {v}$ ) is called the gradient of the vector field $\mathbf {v} \,$ . Therefore, the quantity ${\boldsymbol {\nabla }}{\mathbf {v} }$ is a tensor given by

{{\boldsymbol {\nabla }}{\mathbf {v} }=\sum _{i}\sum _{j}{\cfrac {\partial v_{j}}{\partial x_{i}}}\mathbf {e} _{i}\mathbf {e} _{j}\equiv v_{j,i}\mathbf {e} _{i}\mathbf {e} _{j}~.}

In the alternative dyadic notation,

{{\boldsymbol {\nabla }}{\mathbf {v} }\equiv {\boldsymbol {\nabla }}{}\otimes \mathbf {v} =\sum _{i}\sum _{j}{\cfrac {\partial v_{j}}{\partial x_{i}}}\mathbf {e} _{i}\otimes \mathbf {e} _{j}\equiv v_{j,i}\mathbf {e} _{i}\otimes \mathbf {e} _{j}~.}

'Warning: Some authors define the $ij$ component of ${\boldsymbol {\nabla }}{\mathbf {v} }$ as $\partial v_{i}/\partial x_{j}=v_{i,j}$ .

The divergence of a tensor field[edit | edit source]

Let ${\boldsymbol {A}}\,$ be a tensor field. Then the divergence of the tensor field is a vector ${\boldsymbol {\nabla }}\bullet {\boldsymbol {A}}$ given by

{{\boldsymbol {\nabla }}\bullet {\boldsymbol {A}}=\sum _{j}\left[\sum _{i}{\cfrac {\partial A_{ij}}{\partial x_{i}}}\right]\mathbf {e} _{j}\equiv {\cfrac {\partial A_{ij}}{\partial x_{i}}}\mathbf {e} _{j}=A_{ij,i}\mathbf {e} _{j}~.}

To fix the definition of divergence of a general tensor field (possibly of higher order than 2), we use the relation

({\boldsymbol {\nabla }}\bullet {\boldsymbol {A}})\bullet \mathbf {c} ={\boldsymbol {\nabla }}\bullet ({\boldsymbol {A}}\bullet \mathbf {c} )

where $\mathbf {c}$ is an arbitrary constant vector.

The Laplacian of a vector field[edit | edit source]

The Laplacian of a vector field is given by

{\nabla ^{2}{\mathbf {v} }={\boldsymbol {\nabla }}\bullet {{\boldsymbol {\nabla }}{\mathbf {v} }}=\sum _{j}\left[\sum _{i}{\cfrac {\partial ^{2}v_{j}}{\partial x_{i}^{2}}}\right]\mathbf {e} _{j}\equiv v_{j,ii}\mathbf {e} _{j}~.}

Tensor Identities[edit | edit source]

Some important identities involving tensors are:

${\boldsymbol {\nabla }}\bullet {{\boldsymbol {\nabla }}{\mathbf {v} }}={\boldsymbol {\nabla }}{({\boldsymbol {\nabla }}\bullet {\mathbf {v} })}-{\boldsymbol {\nabla }}\times {({\boldsymbol {\nabla }}\times {\mathbf {v} })}$ .
$\mathbf {v} \bullet {\boldsymbol {\nabla }}{\mathbf {v} }={\frac {1}{2}}{\boldsymbol {\nabla }}{(\mathbf {v} \bullet \mathbf {v} )}-\mathbf {v} \times ({\boldsymbol {\nabla }}\times {\mathbf {v} )}$ .
${\boldsymbol {\nabla }}\bullet {(\mathbf {v} \otimes \mathbf {w} )}=\mathbf {v} \bullet {\boldsymbol {\nabla }}{\mathbf {w} }+\mathbf {w} ({\boldsymbol {\nabla }}\bullet {\mathbf {v} })$ .
${\boldsymbol {\nabla }}\bullet {(\varphi {\boldsymbol {A}})}={\boldsymbol {\nabla }}{\varphi }\bullet {\boldsymbol {A}}+\varphi {\boldsymbol {\nabla }}\bullet {\boldsymbol {A}}$ .
${\boldsymbol {\nabla }}{(\mathbf {v} \bullet \mathbf {w} )}=({\boldsymbol {\nabla }}{\mathbf {v} })\bullet \mathbf {w} +({\boldsymbol {\nabla }}{\mathbf {w} })\bullet \mathbf {v}$ .
${\boldsymbol {\nabla }}\bullet {({\boldsymbol {A}}\bullet \mathbf {w} )}=({\boldsymbol {\nabla }}\bullet {\boldsymbol {A}})\bullet \mathbf {w} +{\boldsymbol {A}}^{T}:({\boldsymbol {\nabla }}{\mathbf {w} })$ .

Integral theorems[edit | edit source]

The following integral theorems are useful in continuum mechanics and finite elements.

The Gauss divergence theorem[edit | edit source]

If $\Omega$ is a region in space enclosed by a surface $\Gamma \,$ and ${\boldsymbol {A}}\,$ is a tensor field, then

{\int _{\Omega }{\boldsymbol {\nabla }}\bullet {\boldsymbol {A}}~dV=\int _{\Gamma }\mathbf {n} \bullet {\boldsymbol {A}}~dA}

where $\mathbf {n} \,$ is the unit outward normal to the surface.

The Stokes curl theorem[edit | edit source]

If $\Gamma \,$ is a surface bounded by a closed curve ${\mathcal {C}}$ , then

\int _{\Gamma }\mathbf {n} \bullet ({\boldsymbol {\nabla }}\times {{\boldsymbol {A}})}~dA=\oint _{\mathcal {C}}\mathbf {t} \bullet {\boldsymbol {A}}~ds

where ${\boldsymbol {A}}\,$ is a tensor field, $\mathbf {n} \,$ is the unit normal vector to $\Gamma \,$ in the direction of a right-handed screw motion along ${\mathcal {C}}$ , and $\mathbf {t} \,$ is a unit tangential vector in the direction of integration along ${\mathcal {C}}$ .

The Leibniz formula[edit | edit source]

Let $\Omega$ be a closed moving region of space enclosed by a surface $\Gamma \,$ . Let the velocity of any surface element be $\mathbf {v} \,$ . Then if ${\boldsymbol {A}}(\mathbf {x} ,t)\,$ is a tensor function of position and time,

{\cfrac {\partial }{\partial t}}\int _{\Omega }{\boldsymbol {A}}~dV=\int _{\Omega }{\cfrac {\partial {\boldsymbol {A}}}{\partial t}}~dV+\int _{\Gamma }{\boldsymbol {A}}(\mathbf {v} \bullet \mathbf {n} )~dA

where $\mathbf {n} \,$ is the outward unit normal to the surface $\Gamma \,$ .

Directional derivatives[edit | edit source]

We often have to find the derivatives of vectors with respect to vectors and of tensors with respect to vectors and tensors. The directional directive provides a systematic way of finding these derivatives.

The definitions of directional derivatives for various situations are given below. It is assumed that the functions are sufficiently smooth that derivatives can be taken.

Derivatives of scalar valued functions of vectors[edit | edit source]

Let $f(\mathbf {v} )$ be a real valued function of the vector $\mathbf {v}$ . Then the derivative of $f(\mathbf {v} )$ with respect to $\mathbf {v}$ (or at $\mathbf {v}$ ) in the direction $\mathbf {u}$ is the vector defined as

{\frac {\partial f}{\partial \mathbf {v} }}\cdot \mathbf {u} =Df(\mathbf {v} )[\mathbf {u} ]=\left[{\frac {\partial }{\partial \alpha }}~f(\mathbf {v} +\alpha ~\mathbf {u} )\right]_{\alpha =0}

for all vectors $\mathbf {u}$ .

Properties:

1) If $f(\mathbf {v} )=f_{1}(\mathbf {v} )+f_{2}(\mathbf {v} )$ then ${\frac {\partial f}{\partial \mathbf {v} }}\cdot \mathbf {u} =\left({\frac {\partial f_{1}}{\partial \mathbf {v} }}+{\frac {\partial f_{2}}{\partial \mathbf {v} }}\right)\cdot \mathbf {u}$

2) If $f(\mathbf {v} )=f_{1}(\mathbf {v} )~f_{2}(\mathbf {v} )$ then ${\frac {\partial f}{\partial \mathbf {v} }}\cdot \mathbf {u} =\left({\frac {\partial f_{1}}{\partial \mathbf {v} }}\cdot \mathbf {u} \right)~f_{2}(\mathbf {v} )+f_{1}(\mathbf {v} )~\left({\frac {\partial f_{2}}{\partial \mathbf {v} }}\cdot \mathbf {u} \right)$

3) If $f(\mathbf {v} )=f_{1}(f_{2}(\mathbf {v} ))$ then ${\frac {\partial f}{\partial \mathbf {v} }}\cdot \mathbf {u} ={\frac {\partial f_{1}}{\partial f_{2}}}~{\frac {\partial f_{2}}{\partial \mathbf {v} }}\cdot \mathbf {u}$

Derivatives of vector valued functions of vectors[edit | edit source]

Let $\mathbf {f} (\mathbf {v} )$ be a vector valued function of the vector $\mathbf {v}$ . Then the derivative of $\mathbf {f} (\mathbf {v} )$ with respect to $\mathbf {v}$ (or at $\mathbf {v}$ ) in the direction $\mathbf {u}$ is the second order tensor defined as

{\frac {\partial \mathbf {f} }{\partial \mathbf {v} }}\cdot \mathbf {u} =D\mathbf {f} (\mathbf {v} )[\mathbf {u} ]=\left[{\frac {\partial }{\partial \alpha }}~\mathbf {f} (\mathbf {v} +\alpha ~\mathbf {u} )\right]_{\alpha =0}

for all vectors $\mathbf {u}$ .

Properties:

1) If $\mathbf {f} (\mathbf {v} )=\mathbf {f} _{1}(\mathbf {v} )+\mathbf {f} _{2}(\mathbf {v} )$ then ${\frac {\partial \mathbf {f} }{\partial \mathbf {v} }}\cdot \mathbf {u} =\left({\frac {\partial \mathbf {f} _{1}}{\partial \mathbf {v} }}+{\frac {\partial \mathbf {f} _{2}}{\partial \mathbf {v} }}\right)\cdot \mathbf {u}$

2) If $\mathbf {f} (\mathbf {v} )=\mathbf {f} _{1}(\mathbf {v} )\times \mathbf {f} _{2}(\mathbf {v} )$ then ${\frac {\partial \mathbf {f} }{\partial \mathbf {v} }}\cdot \mathbf {u} =\left({\frac {\partial \mathbf {f} _{1}}{\partial \mathbf {v} }}\cdot \mathbf {u} \right)\times \mathbf {f} _{2}(\mathbf {v} )+\mathbf {f} _{1}(\mathbf {v} )\times \left({\frac {\partial \mathbf {f} _{2}}{\partial \mathbf {v} }}\cdot \mathbf {u} \right)$

3) If $\mathbf {f} (\mathbf {v} )=\mathbf {f} _{1}(\mathbf {f} _{2}(\mathbf {v} ))$ then ${\frac {\partial \mathbf {f} }{\partial \mathbf {v} }}\cdot \mathbf {u} ={\frac {\partial \mathbf {f} _{1}}{\partial \mathbf {f} _{2}}}\cdot \left({\frac {\partial \mathbf {f} _{2}}{\partial \mathbf {v} }}\cdot \mathbf {u} \right)$

Derivatives of scalar valued functions of tensors[edit | edit source]

Let $f({\boldsymbol {S}})$ be a real valued function of the second order tensor ${\boldsymbol {S}}$ . Then the derivative of $f({\boldsymbol {S}})$ with respect to ${\boldsymbol {S}}$ (or at ${\boldsymbol {S}}$ ) in the direction ${\boldsymbol {T}}$ is the second order tensor defined as

{\frac {\partial f}{\partial {\boldsymbol {S}}}}:{\boldsymbol {T}}=Df({\boldsymbol {S}})[{\boldsymbol {T}}]=\left[{\frac {\partial }{\partial \alpha }}~f({\boldsymbol {S}}+\alpha ~{\boldsymbol {T}})\right]_{\alpha =0}

for all second order tensors ${\boldsymbol {T}}$ .

Properties:

1) If $f({\boldsymbol {S}})=f_{1}({\boldsymbol {S}})+f_{2}({\boldsymbol {S}})$ then ${\frac {\partial f}{\partial {\boldsymbol {S}}}}:{\boldsymbol {T}}=\left({\frac {\partial f_{1}}{\partial {\boldsymbol {S}}}}+{\frac {\partial f_{2}}{\partial {\boldsymbol {S}}}}\right):{\boldsymbol {T}}$

2) If $f({\boldsymbol {S}})=f_{1}({\boldsymbol {S}})~f_{2}({\boldsymbol {S}})$ then ${\frac {\partial f}{\partial {\boldsymbol {S}}}}:{\boldsymbol {T}}=\left({\frac {\partial f_{1}}{\partial {\boldsymbol {S}}}}:{\boldsymbol {T}}\right)~f_{2}({\boldsymbol {S}})+f_{1}({\boldsymbol {S}})~\left({\frac {\partial f_{2}}{\partial {\boldsymbol {S}}}}:{\boldsymbol {T}}\right)$

3) If $f({\boldsymbol {S}})=f_{1}(f_{2}({\boldsymbol {S}}))$ then ${\frac {\partial f}{\partial {\boldsymbol {S}}}}:{\boldsymbol {T}}={\frac {\partial f_{1}}{\partial f_{2}}}~\left({\frac {\partial f_{2}}{\partial {\boldsymbol {S}}}}:{\boldsymbol {T}}\right)$

Derivatives of tensor valued functions of tensors[edit | edit source]

Let ${\boldsymbol {F}}({\boldsymbol {S}})$ be a second order tensor valued function of the second order tensor ${\boldsymbol {S}}$ . Then the derivative of ${\boldsymbol {F}}({\boldsymbol {S}})$ with respect to ${\boldsymbol {S}}$ (or at ${\boldsymbol {S}}$ ) in the direction ${\boldsymbol {T}}$ is the fourth order tensor defined as

{\frac {\partial {\boldsymbol {F}}}{\partial {\boldsymbol {S}}}}:{\boldsymbol {T}}=D{\boldsymbol {F}}({\boldsymbol {S}})[{\boldsymbol {T}}]=\left[{\frac {\partial }{\partial \alpha }}~{\boldsymbol {F}}({\boldsymbol {S}}+\alpha ~{\boldsymbol {T}})\right]_{\alpha =0}

for all second order tensors ${\boldsymbol {T}}$ .

Properties:

1) If ${\boldsymbol {F}}({\boldsymbol {S}})={\boldsymbol {F}}_{1}({\boldsymbol {S}})+{\boldsymbol {F}}_{2}({\boldsymbol {S}})$ then ${\frac {\partial {\boldsymbol {F}}}{\partial {\boldsymbol {S}}}}:{\boldsymbol {T}}=\left({\frac {\partial {\boldsymbol {F}}_{1}}{\partial {\boldsymbol {S}}}}+{\frac {\partial {\boldsymbol {F}}_{2}}{\partial {\boldsymbol {S}}}}\right):{\boldsymbol {T}}$

2) If ${\boldsymbol {F}}({\boldsymbol {S}})={\boldsymbol {F}}_{1}({\boldsymbol {S}})\cdot {\boldsymbol {F}}_{2}({\boldsymbol {S}})$ then ${\frac {\partial {\boldsymbol {F}}}{\partial {\boldsymbol {S}}}}:{\boldsymbol {T}}=\left({\frac {\partial {\boldsymbol {F}}_{1}}{\partial {\boldsymbol {S}}}}:{\boldsymbol {T}}\right)\cdot {\boldsymbol {F}}_{2}({\boldsymbol {S}})+{\boldsymbol {F}}_{1}({\boldsymbol {S}})\cdot \left({\frac {\partial {\boldsymbol {F}}_{2}}{\partial {\boldsymbol {S}}}}:{\boldsymbol {T}}\right)$

3) If ${\boldsymbol {F}}({\boldsymbol {S}})={\boldsymbol {F}}_{1}({\boldsymbol {F}}_{2}({\boldsymbol {S}}))$ then ${\frac {\partial {\boldsymbol {F}}}{\partial {\boldsymbol {S}}}}:{\boldsymbol {T}}={\frac {\partial {\boldsymbol {F}}_{1}}{\partial {\boldsymbol {F}}_{2}}}:\left({\frac {\partial {\boldsymbol {F}}_{2}}{\partial {\boldsymbol {S}}}}:{\boldsymbol {T}}\right)$

3) If $f({\boldsymbol {S}})=f_{1}({\boldsymbol {F}}_{2}({\boldsymbol {S}}))$ then ${\frac {\partial f}{\partial {\boldsymbol {S}}}}:{\boldsymbol {T}}={\frac {\partial f_{1}}{\partial {\boldsymbol {F}}_{2}}}:\left({\frac {\partial {\boldsymbol {F}}_{2}}{\partial {\boldsymbol {S}}}}:{\boldsymbol {T}}\right)$

Derivative of the determinant of a tensor[edit | edit source]

Derivative of the determinant of a tensor

The derivative of the determinant of a second order tensor ${\boldsymbol {A}}$ is given by

{\frac {\partial }{\partial {\boldsymbol {A}}}}\det({\boldsymbol {A}})=\det({\boldsymbol {A}})~[{\boldsymbol {A}}^{-1}]^{T}~.

In an orthonormal basis the components of ${\boldsymbol {A}}$ can be written as a matrix $\mathbf {A}$ . In that case, the right hand side corresponds the cofactors of the matrix.

Proof:

Let ${\boldsymbol {A}}$ be a second order tensor and let $f({\boldsymbol {A}})=\det({\boldsymbol {A}})$ . Then, from the definition of the derivative of a scalar valued function of a tensor, we have

${\begin{aligned}{\frac {\partial f}{\partial {\boldsymbol {A}}}}:{\boldsymbol {T}}&=\left.{\cfrac {d}{d\alpha }}\det({\boldsymbol {A}}+\alpha ~{\boldsymbol {T}})\right|_{\alpha =0}\\&=\left.{\cfrac {d}{d\alpha }}\det \left[\alpha ~{\boldsymbol {A}}\left({\cfrac {1}{\alpha }}~{\boldsymbol {\mathit {1}}}+{\boldsymbol {A}}^{-1}\cdot {\boldsymbol {T}}\right)\right]\right|_{\alpha =0}\\&=\left.{\cfrac {d}{d\alpha }}\left[\alpha ^{3}~\det({\boldsymbol {A}})~\det \left({\cfrac {1}{\alpha }}~{\boldsymbol {\mathit {1}}}+{\boldsymbol {A}}^{-1}\cdot {\boldsymbol {T}}\right)\right]\right|_{\alpha =0}~.\end{aligned}}$

Recall that we can expand the determinant of a tensor in the form of a characteristic equation in terms of the invariants $I_{1},I_{2},I_{3}$ using (note the sign of $\lambda$ )

$\det(\lambda ~{\boldsymbol {\mathit {1}}}+{\boldsymbol {A}})=\lambda ^{3}+I_{1}({\boldsymbol {A}})~\lambda ^{2}+I_{2}({\boldsymbol {A}})~\lambda +I_{3}({\boldsymbol {A}})~.$

Using this expansion we can write

${\begin{aligned}{\frac {\partial f}{\partial {\boldsymbol {A}}}}:{\boldsymbol {T}}&=\left.{\cfrac {d}{d\alpha }}\left[\alpha ^{3}~\det({\boldsymbol {A}})~\left({\cfrac {1}{\alpha ^{3}}}+I_{1}({\boldsymbol {A}}^{-1}\cdot {\boldsymbol {T}})~{\cfrac {1}{\alpha ^{2}}}+I_{2}({\boldsymbol {A}}^{-1}\cdot {\boldsymbol {T}})~{\cfrac {1}{\alpha }}+I_{3}({\boldsymbol {A}}^{-1}\cdot {\boldsymbol {T}})\right)\right]\right|_{\alpha =0}\\&=\left.\det({\boldsymbol {A}})~{\cfrac {d}{d\alpha }}\left[1+I_{1}({\boldsymbol {A}}^{-1}\cdot {\boldsymbol {T}})~\alpha +I_{2}({\boldsymbol {A}}^{-1}\cdot {\boldsymbol {T}})~\alpha ^{2}+I_{3}({\boldsymbol {A}}^{-1}\cdot {\boldsymbol {T}})~\alpha ^{3}\right]\right|_{\alpha =0}\\&=\left.\det({\boldsymbol {A}})~\left[I_{1}({\boldsymbol {A}}^{-1}\cdot {\boldsymbol {T}})+2~I_{2}({\boldsymbol {A}}^{-1}\cdot {\boldsymbol {T}})~\alpha +3~I_{3}({\boldsymbol {A}}^{-1}\cdot {\boldsymbol {T}})~\alpha ^{2}\right]\right|_{\alpha =0}\\&=\det({\boldsymbol {A}})~I_{1}({\boldsymbol {A}}^{-1}\cdot {\boldsymbol {T}})~.\end{aligned}}$

Recall that the invariant $I_{1}$ is given by

$I_{1}({\boldsymbol {A}})={\text{tr}}{\boldsymbol {A}}~.$

Hence,

${\frac {\partial f}{\partial {\boldsymbol {A}}}}:{\boldsymbol {T}}=\det({\boldsymbol {A}})~{\text{tr}}({\boldsymbol {A}}^{-1}\cdot {\boldsymbol {T}})=\det({\boldsymbol {A}})~[{\boldsymbol {A}}^{-1}]^{T}:{\boldsymbol {T}}~.$

Invoking the arbitrariness of ${\boldsymbol {T}}$ we then have

${\frac {\partial f}{\partial {\boldsymbol {A}}}}=\det({\boldsymbol {A}})~[{\boldsymbol {A}}^{-1}]^{T}~.$

Derivatives of the invariants of a tensor[edit | edit source]

Derivatives of the principal invariants of a tensor

The principal invariants of a second order tensor are

{\begin{aligned}I_{1}({\boldsymbol {A}})&={\text{tr}}{\boldsymbol {A}}\\I_{2}({\boldsymbol {A}})&={\frac {1}{2}}\left[({\text{tr}}{\boldsymbol {A}})^{2}-{\text{tr}}{{\boldsymbol {A}}^{2}}\right]\\I_{3}({\boldsymbol {A}})&=\det({\boldsymbol {A}})\end{aligned}}

The derivatives of these three invariants with respect to ${\boldsymbol {A}}$ are

{\begin{aligned}{\frac {\partial I_{1}}{\partial {\boldsymbol {A}}}}&={\boldsymbol {\mathit {1}}}\\{\frac {\partial I_{2}}{\partial {\boldsymbol {A}}}}&=I_{1}~{\boldsymbol {\mathit {1}}}-{\boldsymbol {A}}^{T}\\{\frac {\partial I_{3}}{\partial {\boldsymbol {A}}}}&=\det({\boldsymbol {A}})~[{\boldsymbol {A}}^{-1}]^{T}=I_{2}~{\boldsymbol {\mathit {1}}}-{\boldsymbol {A}}^{T}~(I_{1}~{\boldsymbol {\mathit {1}}}-{\boldsymbol {A}}^{T})=({\boldsymbol {A}}^{2}-I_{1}~{\boldsymbol {A}}+I_{2}~{\boldsymbol {\mathit {1}}})^{T}\end{aligned}}

Proof:

From the derivative of the determinant we know that

${\frac {\partial I_{3}}{\partial {\boldsymbol {A}}}}=\det({\boldsymbol {A}})~[{\boldsymbol {A}}^{-1}]^{T}~.$

For the derivatives of the other two invariants, let us go back to the characteristic equation

$\det(\lambda ~{\boldsymbol {\mathit {1}}}+{\boldsymbol {A}})=\lambda ^{3}+I_{1}({\boldsymbol {A}})~\lambda ^{2}+I_{2}({\boldsymbol {A}})~\lambda +I_{3}({\boldsymbol {A}})~.$

Using the same approach as for the determinant of a tensor, we can show that

${\frac {\partial }{\partial {\boldsymbol {A}}}}\det(\lambda ~{\boldsymbol {\mathit {1}}}+{\boldsymbol {A}})=\det(\lambda ~{\boldsymbol {\mathit {1}}}+{\boldsymbol {A}})~[(\lambda ~{\boldsymbol {\mathit {1}}}+{\boldsymbol {A}})^{-1}]^{T}~.$

Now the left hand side can be expanded as

${\begin{aligned}{\frac {\partial }{\partial {\boldsymbol {A}}}}\det(\lambda ~{\boldsymbol {\mathit {1}}}+{\boldsymbol {A}})&={\frac {\partial }{\partial {\boldsymbol {A}}}}\left[\lambda ^{3}+I_{1}({\boldsymbol {A}})~\lambda ^{2}+I_{2}({\boldsymbol {A}})~\lambda +I_{3}({\boldsymbol {A}})\right]\\&={\frac {\partial I_{1}}{\partial {\boldsymbol {A}}}}~\lambda ^{2}+{\frac {\partial I_{2}}{\partial {\boldsymbol {A}}}}~\lambda +{\frac {\partial I_{3}}{\partial {\boldsymbol {A}}}}~.\end{aligned}}$

Hence

${\frac {\partial I_{1}}{\partial {\boldsymbol {A}}}}~\lambda ^{2}+{\frac {\partial I_{2}}{\partial {\boldsymbol {A}}}}~\lambda +{\frac {\partial I_{3}}{\partial {\boldsymbol {A}}}}=\det(\lambda ~{\boldsymbol {\mathit {1}}}+{\boldsymbol {A}})~[(\lambda ~{\boldsymbol {\mathit {1}}}+{\boldsymbol {A}})^{-1}]^{T}$

or,

$(\lambda ~{\boldsymbol {\mathit {1}}}+{\boldsymbol {A}})^{T}\cdot \left[{\frac {\partial I_{1}}{\partial {\boldsymbol {A}}}}~\lambda ^{2}+{\frac {\partial I_{2}}{\partial {\boldsymbol {A}}}}~\lambda +{\frac {\partial I_{3}}{\partial {\boldsymbol {A}}}}\right]=\det(\lambda ~{\boldsymbol {\mathit {1}}}+{\boldsymbol {A}})~{\boldsymbol {\mathit {1}}}~.$

Expanding the right hand side and separating terms on the left hand side gives

$(\lambda ~{\boldsymbol {\mathit {1}}}+{\boldsymbol {A}}^{T})\cdot \left[{\frac {\partial I_{1}}{\partial {\boldsymbol {A}}}}~\lambda ^{2}+{\frac {\partial I_{2}}{\partial {\boldsymbol {A}}}}~\lambda +{\frac {\partial I_{3}}{\partial {\boldsymbol {A}}}}\right]=\left[\lambda ^{3}+I_{1}~\lambda ^{2}+I_{2}~\lambda +I_{3}\right]{\boldsymbol {\mathit {1}}}$

or,

${\begin{aligned}\left[{\frac {\partial I_{1}}{\partial {\boldsymbol {A}}}}~\lambda ^{3}\right.&\left.+{\frac {\partial I_{2}}{\partial {\boldsymbol {A}}}}~\lambda ^{2}+{\frac {\partial I_{3}}{\partial {\boldsymbol {A}}}}~\lambda \right]{\boldsymbol {\mathit {1}}}+{\boldsymbol {A}}^{T}\cdot {\frac {\partial I_{1}}{\partial {\boldsymbol {A}}}}~\lambda ^{2}+{\boldsymbol {A}}^{T}\cdot {\frac {\partial I_{2}}{\partial {\boldsymbol {A}}}}~\lambda +{\boldsymbol {A}}^{T}\cdot {\frac {\partial I_{3}}{\partial {\boldsymbol {A}}}}\\&=\left[\lambda ^{3}+I_{1}~\lambda ^{2}+I_{2}~\lambda +I_{3}\right]{\boldsymbol {\mathit {1}}}~.\end{aligned}}$

If we define $I_{0}:=1$ and $I_{4}:=0$ , we can write the above as

${\begin{aligned}\left[{\frac {\partial I_{1}}{\partial {\boldsymbol {A}}}}~\lambda ^{3}\right.&\left.+{\frac {\partial I_{2}}{\partial {\boldsymbol {A}}}}~\lambda ^{2}+{\frac {\partial I_{3}}{\partial {\boldsymbol {A}}}}~\lambda +{\frac {\partial I_{4}}{\partial {\boldsymbol {A}}}}\right]{\boldsymbol {\mathit {1}}}+{\boldsymbol {A}}^{T}\cdot {\frac {\partial I_{0}}{\partial {\boldsymbol {A}}}}~\lambda ^{3}+{\boldsymbol {A}}^{T}\cdot {\frac {\partial I_{1}}{\partial {\boldsymbol {A}}}}~\lambda ^{2}+{\boldsymbol {A}}^{T}\cdot {\frac {\partial I_{2}}{\partial {\boldsymbol {A}}}}~\lambda +{\boldsymbol {A}}^{T}\cdot {\frac {\partial I_{3}}{\partial {\boldsymbol {A}}}}\\&=\left[I_{0}~\lambda ^{3}+I_{1}~\lambda ^{2}+I_{2}~\lambda +I_{3}\right]{\boldsymbol {\mathit {1}}}~.\end{aligned}}$

Collecting terms containing various powers of $\lambda$ , we get

${\begin{aligned}\lambda ^{3}&\left(I_{0}~{\boldsymbol {\mathit {1}}}-{\frac {\partial I_{1}}{\partial {\boldsymbol {A}}}}~{\boldsymbol {\mathit {1}}}-{\boldsymbol {A}}^{T}\cdot {\frac {\partial I_{0}}{\partial {\boldsymbol {A}}}}\right)+\lambda ^{2}\left(I_{1}~{\boldsymbol {\mathit {1}}}-{\frac {\partial I_{2}}{\partial {\boldsymbol {A}}}}~{\boldsymbol {\mathit {1}}}-{\boldsymbol {A}}^{T}\cdot {\frac {\partial I_{1}}{\partial {\boldsymbol {A}}}}\right)+\\&\qquad \qquad \lambda \left(I_{2}~{\boldsymbol {\mathit {1}}}-{\frac {\partial I_{3}}{\partial {\boldsymbol {A}}}}~{\boldsymbol {\mathit {1}}}-{\boldsymbol {A}}^{T}\cdot {\frac {\partial I_{2}}{\partial {\boldsymbol {A}}}}\right)+\left(I_{3}~{\boldsymbol {\mathit {1}}}-{\frac {\partial I_{4}}{\partial {\boldsymbol {A}}}}~{\boldsymbol {\mathit {1}}}-{\boldsymbol {A}}^{T}\cdot {\frac {\partial I_{3}}{\partial {\boldsymbol {A}}}}\right)=0~.\end{aligned}}$

Then, invoking the arbitrariness of $\lambda$ , we have

${\begin{aligned}I_{0}~{\boldsymbol {\mathit {1}}}-{\frac {\partial I_{1}}{\partial {\boldsymbol {A}}}}~{\boldsymbol {\mathit {1}}}-{\boldsymbol {A}}^{T}\cdot {\frac {\partial I_{0}}{\partial {\boldsymbol {A}}}}&=0\\I_{1}~{\boldsymbol {\mathit {1}}}-{\frac {\partial I_{2}}{\partial {\boldsymbol {A}}}}~{\boldsymbol {\mathit {1}}}-I_{2}~{\boldsymbol {\mathit {1}}}-{\frac {\partial I_{3}}{\partial {\boldsymbol {A}}}}~{\boldsymbol {\mathit {1}}}-{\boldsymbol {A}}^{T}\cdot {\frac {\partial I_{2}}{\partial {\boldsymbol {A}}}}&=0\\I_{3}~{\boldsymbol {\mathit {1}}}-{\frac {\partial I_{4}}{\partial {\boldsymbol {A}}}}~{\boldsymbol {\mathit {1}}}-{\boldsymbol {A}}^{T}\cdot {\frac {\partial I_{3}}{\partial {\boldsymbol {A}}}}&=0~.\end{aligned}}$

This implies that

${\begin{aligned}{\frac {\partial I_{1}}{\partial {\boldsymbol {A}}}}&={\boldsymbol {\mathit {1}}}\\{\frac {\partial I_{2}}{\partial {\boldsymbol {A}}}}&=I_{1}~{\boldsymbol {\mathit {1}}}-{\boldsymbol {A}}^{T}\\{\frac {\partial I_{3}}{\partial {\boldsymbol {A}}}}&=I_{2}~{\boldsymbol {\mathit {1}}}-{\boldsymbol {A}}^{T}~(I_{1}~{\boldsymbol {\mathit {1}}}-{\boldsymbol {A}}^{T})=({\boldsymbol {A}}^{2}-I_{1}~{\boldsymbol {A}}+I_{2}~{\boldsymbol {\mathit {1}}})^{T}\end{aligned}}$

Derivative of the identity tensor[edit | edit source]

Let ${\boldsymbol {\mathit {1}}}$ be the second order identity tensor. Then the derivative of this tensor with respect to a second order tensor ${\boldsymbol {A}}$ is given by

{\frac {\partial {\boldsymbol {\mathit {1}}}}{\partial {\boldsymbol {A}}}}:{\boldsymbol {T}}={\boldsymbol {\mathsf {0}}}:{\boldsymbol {T}}={\boldsymbol {\mathit {0}}}

This is because ${\boldsymbol {\mathit {1}}}$ is independent of ${\boldsymbol {A}}$ .

Derivative of a tensor with respect to itself[edit | edit source]

Let ${\boldsymbol {A}}$ be a second order tensor. Then

{\frac {\partial {\boldsymbol {A}}}{\partial {\boldsymbol {A}}}}:{\boldsymbol {T}}=\left[{\frac {\partial }{\partial \alpha }}({\boldsymbol {A}}+\alpha ~{\boldsymbol {T}})\right]_{\alpha =0}={\boldsymbol {T}}={\boldsymbol {\mathsf {I}}}:{\boldsymbol {T}}

Therefore,

{\frac {\partial {\boldsymbol {A}}}{\partial {\boldsymbol {A}}}}={\boldsymbol {\mathsf {I}}}

Here ${\boldsymbol {\mathsf {I}}}$ is the fourth order identity tensor. In index notation with respect to an orthonormal basis

{\boldsymbol {\mathsf {I}}}=\delta _{ik}~\delta _{jl}~\mathbf {e} _{i}\otimes \mathbf {e} _{j}\otimes \mathbf {e} _{k}\otimes \mathbf {e} _{l}

This result implies that

{\frac {\partial {\boldsymbol {A}}^{T}}{\partial {\boldsymbol {A}}}}:{\boldsymbol {T}}={\boldsymbol {\mathsf {I}}}^{T}:{\boldsymbol {T}}={\boldsymbol {T}}^{T}

where

{\boldsymbol {\mathsf {I}}}^{T}=\delta _{jk}~\delta _{il}~\mathbf {e} _{i}\otimes \mathbf {e} _{j}\otimes \mathbf {e} _{k}\otimes \mathbf {e} _{l}

Therefore, if the tensor ${\boldsymbol {A}}$ is symmetric, then the derivative is also symmetric and we get

{\frac {\partial {\boldsymbol {A}}}{\partial {\boldsymbol {A}}}}={\frac {\partial {\frac {1}{2}}({\boldsymbol {A}}+{\boldsymbol {A}}^{T})}{\partial {\boldsymbol {A}}}}={\frac {1}{2}}~({\boldsymbol {\mathsf {I}}}+{\boldsymbol {\mathsf {I}}}^{T})={\boldsymbol {\mathsf {I}}}^{(s)}

where the symmetric fourth order identity tensor is

{\boldsymbol {\mathsf {I}}}^{(s)}={\frac {1}{2}}~(\delta _{ik}~\delta _{jl}+\delta _{il}~\delta _{jk})~\mathbf {e} _{i}\otimes \mathbf {e} _{j}\otimes \mathbf {e} _{k}\otimes \mathbf {e} _{l}

Derivative of the inverse of a tensor[edit | edit source]

Derivative of the inverse of a tensor

Let ${\boldsymbol {A}}$ and ${\boldsymbol {T}}$ be two second order tensors, then

{\frac {\partial }{\partial {\boldsymbol {A}}}}\left({\boldsymbol {A}}^{-1}\right):{\boldsymbol {T}}=-{\boldsymbol {A}}^{-1}\cdot {\boldsymbol {T}}\cdot {\boldsymbol {A}}^{-1}

In index notation with respect to an orthonormal basis

{\frac {\partial A_{ij}^{-1}}{\partial A_{kl}}}~T_{kl}=-A_{ik}^{-1}~T_{kl}~A_{lj}^{-1}\implies {\frac {\partial A_{ij}^{-1}}{\partial A_{kl}}}=-A_{ik}^{-1}~A_{lj}^{-1}

We also have

{\frac {\partial }{\partial {\boldsymbol {A}}}}\left({\boldsymbol {A}}^{-T}\right):{\boldsymbol {T}}=-{\boldsymbol {A}}^{-T}\cdot {\boldsymbol {T}}\cdot {\boldsymbol {A}}^{-T}

In index notation

{\frac {\partial A_{ji}^{-1}}{\partial A_{kl}}}~T_{kl}=-A_{jk}^{-1}~T_{kl}~A_{li}^{-1}\implies {\frac {\partial A_{ji}^{-1}}{\partial A_{kl}}}=-A_{li}^{-1}~A_{jk}^{-1}

If the tensor ${\boldsymbol {A}}$ is symmetric then

{\frac {\partial A_{ij}^{-1}}{\partial A_{kl}}}=-{\cfrac {1}{2}}\left(A_{ik}^{-1}~A_{jl}^{-1}+A_{il}^{-1}~A_{jk}^{-1}\right)

Proof:

Recall that

${\frac {\partial {\boldsymbol {\mathit {1}}}}{\partial {\boldsymbol {A}}}}:{\boldsymbol {T}}={\boldsymbol {\mathit {0}}}$

Since ${\boldsymbol {A}}^{-1}\cdot {\boldsymbol {A}}={\boldsymbol {\mathit {1}}}$ , we can write

${\frac {\partial }{\partial {\boldsymbol {A}}}}({\boldsymbol {A}}^{-1}\cdot {\boldsymbol {A}}):{\boldsymbol {T}}={\boldsymbol {\mathit {0}}}$

Using the product rule for second order tensors

${\frac {\partial }{\partial {\boldsymbol {S}}}}[{\boldsymbol {F}}_{1}({\boldsymbol {S}})\cdot {\boldsymbol {F}}_{2}({\boldsymbol {S}})]:{\boldsymbol {T}}=\left({\frac {\partial {\boldsymbol {F}}_{1}}{\partial {\boldsymbol {S}}}}:{\boldsymbol {T}}\right)\cdot {\boldsymbol {F}}_{2}+{\boldsymbol {F}}_{1}\cdot \left({\frac {\partial {\boldsymbol {F}}_{2}}{\partial {\boldsymbol {S}}}}:{\boldsymbol {T}}\right)$

we get

${\frac {\partial }{\partial {\boldsymbol {A}}}}({\boldsymbol {A}}^{-1}\cdot {\boldsymbol {A}}):{\boldsymbol {T}}=\left({\frac {\partial {\boldsymbol {A}}^{-1}}{\partial {\boldsymbol {A}}}}:{\boldsymbol {T}}\right)\cdot {\boldsymbol {A}}+{\boldsymbol {A}}^{-1}\cdot \left({\frac {\partial {\boldsymbol {A}}}{\partial {\boldsymbol {A}}}}:{\boldsymbol {T}}\right)={\boldsymbol {\mathit {0}}}$

or,

$\left({\frac {\partial {\boldsymbol {A}}^{-1}}{\partial {\boldsymbol {A}}}}:{\boldsymbol {T}}\right)\cdot {\boldsymbol {A}}=-{\boldsymbol {A}}^{-1}\cdot {\boldsymbol {T}}$

Therefore,

${\frac {\partial }{\partial {\boldsymbol {A}}}}\left({\boldsymbol {A}}^{-1}\right):{\boldsymbol {T}}=-{\boldsymbol {A}}^{-1}\cdot {\boldsymbol {T}}\cdot {\boldsymbol {A}}^{-1}$

Remarks[edit | edit source]

The boldface notation that I've used is called the Gibbs notation. The index notation that I have used is also called Cartesian tensor notation.

Elasticity

Elasticity/Tensors

Tensors in Solid Mechanics[edit | edit source]

Notation[edit | edit source]

Motivation[edit | edit source]

Tensor algebra[edit | edit source]

Addition of tensors[edit | edit source]

Multiplication of a tensor by a scalar[edit | edit source]

Zero tensor[edit | edit source]

Identity tensor[edit | edit source]

Product of two tensors[edit | edit source]

Transpose of a tensor[edit | edit source]

Symmetric and skew tensors[edit | edit source]

Tensor product of two vectors[edit | edit source]

Spectral theorem[edit | edit source]

Eigenvalues and eigenvectors[edit | edit source]

Spectral theorem[edit | edit source]

Polar decomposition theorem[edit | edit source]

Principal invariants of a tensor[edit | edit source]

Cayley-Hamilton theorem[edit | edit source]

Index notation[edit | edit source]

Kronecker delta[edit | edit source]

Einstein summation convention[edit | edit source]

Components of a vector[edit | edit source]

Components of a tensor[edit | edit source]

Operation of a tensor on a vector[edit | edit source]

Dyadic product[edit | edit source]

Matrix notation[edit | edit source]

Tensor inner product[edit | edit source]

Trace of a tensor[edit | edit source]

Magnitude of a tensor[edit | edit source]

Tensor product of a tensor with a vector[edit | edit source]

Permutation symbol[edit | edit source]

Identities in tensor algebra[edit | edit source]

Tensor calculus[edit | edit source]

The gradient of a vector field[edit | edit source]

The divergence of a tensor field[edit | edit source]

The Laplacian of a vector field[edit | edit source]

Tensor Identities[edit | edit source]

Integral theorems[edit | edit source]

The Gauss divergence theorem[edit | edit source]

The Stokes curl theorem[edit | edit source]

The Leibniz formula[edit | edit source]

Directional derivatives[edit | edit source]

Derivatives of scalar valued functions of vectors[edit | edit source]

Derivatives of vector valued functions of vectors[edit | edit source]

Derivatives of scalar valued functions of tensors[edit | edit source]

Derivatives of tensor valued functions of tensors[edit | edit source]

Derivative of the determinant of a tensor[edit | edit source]

Derivatives of the invariants of a tensor[edit | edit source]

Derivative of the identity tensor[edit | edit source]

Derivative of a tensor with respect to itself[edit | edit source]

Derivative of the inverse of a tensor[edit | edit source]

Remarks[edit | edit source]

Navigation menu

Search