Trace (linear algebra)
From Wikipedia, the free encyclopedia
In linear algebra, the trace of an n-by-n square matrix A is defined to be the sum of the elements on the main diagonal (the diagonal from the upper left to the lower right) of A, i.e.,
where aij represents the entry on the ith row and jth column of A. Equivalently, the trace of a matrix is the sum of its eigenvalues, making it an invariant with respect to a change of basis. This characterization can be used to define the trace for a linear operator in general.
Note that the trace is only defined for a square matrix (i.e. n×n).
The use of the term trace arises from the German term Spur (cognate with the English spoor), which, as a function in mathematics, is often abbreviated to "Sp".
Contents |
[edit] Examples
Let T be a linear operator represented by the matrix
Then tr(T) = −2 + 1 − 1 = −2.
The trace of the identity matrix is the dimension of the space. More generally, the trace of a projection (i.e., P2 = P) is the rank of the projection. The trace of a nilpotent matrix is zero.
If A and B are positive semi-definite matrices of the same order then
[edit] Properties
The trace is a linear map. That is,
- tr(A + B) = tr(A) + tr(B),
- tr(cA) = ctr(A),
for all square matrices A and B, and all scalars c.
If A is an m×n matrix and B is an n×m matrix, then
- tr(AB) = tr(BA).[1]
Note however that order does matter in taking traces: in general,
In other words, we can only interchange the two halves of the expression, albeit repeatedly. This means that the trace is invariant under cyclic permutations, i.e.,
- tr(ABCD) = tr(BCDA) = tr(CDAB) = tr(DABC).
Unlike the determinant, the trace of the product is not the product of traces.
Conversely, the above properties characterize the trace completely in the sense as follows. Let f be a linear functional on the space of square matrices satisfying f(xy) = f(yx). Then f and tr are proportional.[2]
The trace is similarity-invariant, which means that A and P−1AP have the same trace. This is because
When both A and B are n by n, the trace of the commutator of A and B vanishes: tr([A, B]) = 0. In particular, using similarity invariance, it follows that the identity matrix is never similar to the commutator of any pair of matrices.
Conversely, any square matrix with zero trace is the commutator of some pair of matrices.[3]
A matrix and its transpose have the same trace:
[edit] Trace of a linear operator
Given some linear map f : V → V (V is a finite-dimensional vector space) generally, we can define the trace of this map by considering the trace of matrix representation of f, that is, choosing a basis for V and describing f as a matrix relative to this basis, and taking the trace of this square matrix. The result will not depend on the basis chosen, since different bases will give rise to similar matrices, allowing for the possibility of a basis independent definition for the trace of a linear map.
Such a definition can be given using the canonical isomorphism between the space End(V) of linear maps on V and V⊗V*, where V* is the dual space of V. Let v be in V and let f be in V*. Then the trace of the decomposable element v⊗f is defined to be f(v); the trace of a general element is defined by linearity. Using an explicit basis for V and the corresponding dual basis for V*, one can show that this gives the same definition of the trace as given above.
[edit] Eigenvalue relationships
If A is a square n-by-n matrix with real or complex entries and if λ1,...,λn are the (complex and distinct) eigenvalues of A (listed according to their algebraic multiplicities), then
This follows from the fact that A is always similar to its Jordan form, an upper triangular matrix having λ1,...,λn on the main diagonal. In contrast, the determinant of A is the product of its eigenvalues; i.e.,
[edit] Derivatives
The trace is the derivative of the determinant: it is the Lie algebra analog of the (Lie group) map of the determinant. This is made precise in Jacobi's formula for the derivative of the determinant (see under determinant). As a particular case,
: the trace is the derivative of the determinant at the identity. From this (or from the connection between the trace and the eigenvalues), one can derive a connection between the trace function, the exponential map between a Lie algebra and its Lie group (or concretely, the matrix exponential function), and the determinant:
- det(exp(A)) = exp(tr(A)).
For example, consider the one-parameter family of linear transformations given by rotation through angle θ,
These transformations all have determinant 1, so they preserve area. The derivative of this family at θ = 0 is the antisymmetric matrix
which clearly has trace zero, indicating that this matrix represents an infinitesimal transformation which preserves area.
A related characterization of the trace applies to linear vector fields. Given a matrix A, define a vector field F on Rn by F(x) = Ax. The components of this vector field are linear functions (given by the rows of A). The divergence div F is a constant function, whose value is equal to tr(A). By the divergence theorem, one can interpret this in terms of flows: if F(x) represents the velocity of a fluid at the location x, and U is a region in Rn, the net flow of the fluid out of U is given by tr(A)· vol(U), where vol(U) is the volume of U.
The trace is a linear operator, hence its derivative is constant:
[edit] Applications
The trace is used to define characters of group representations. Two representations
of a group G are equivalent (up to change of basis on V) if
for all
.
The trace also plays a central role in the distribution of quadratic forms.
[edit] Lie algebra
A matrix whose trace is zero is said to be traceless or tracefree, and these matrices form the simple Lie algebra sln, which is the Lie algebra of the special linear group of matrices with determinant 1. The special linear group consists of the matrices which do not change volume, while the special linear algebra is the matrices which infinitesimally do not change volume.
The bilinear form
is called the Killing form, which is used for the classification of Lie algebras.
The trace defines a bilinear form:
(x, y square matrices).
The form is symmetric, non-degenerate[4] and associative in the sense that:
In a simple Lie algebra (e.g.,
), every such bilinear form is proportional to each other; in particular, to the Killing form.
Two matrices x and y are said to be trace orthogonal if
[edit] Inner product
For an m-by-n matrix A with complex (or real) entries and * being the conjugate transpose, we have
with equality if and only if A = 0. The assignment
yields an inner product on the space of all complex (or real) m-by-n matrices.
If m = n then the norm induced by the above inner product is called the Frobenius norm of a square matrix. Indeed it is simply the Euclidean norm if the matrix is considered as a vector of length n2.
[edit] Generalization
The concept of trace of a matrix is generalised to the trace class of compact operators on Hilbert spaces, and the analog of the Frobenius norm is called the Hilbert-Schmidt norm.
The partial trace is another generalization of the trace that is operator-valued.
If A is a general associative algebra over a field k, then a trace on A is often defined to be any map tr: A → k which vanishes on commutators: tr([a, b]) = 0 for all a, b in A. Such a trace is not uniquely defined; it can always at least be modified by multiplication by a nonzero scalar.
A supertrace is the generalization of a trace to the setting of superalgebras.
The operation of tensor contraction generalizes the trace to arbitrary tensors.
[edit] Coordinate-free definition
We can identify the space of linear operators on a vector space V with the space
, where
. We also have a canonical bilinear function
that consists of applying an element of V * to an element of V to get an element of F. This induces a linear function on the tensor product (by its universal property), which, as it turns out, when that tensor product is viewed as the space of operators, is equal to the trace.
For V finite-dimensional, with basis {ei} and dual basis {ei}, then
is the ij entry of the matrix of the operator with respect to that basis. Any operator A is therefore a sum of the form
. With t defined as above,
. The latter, however, is just the Kronecker delta, being 1 if i=j and 0 otherwise. This shows that t(A) is simply the sum of the coefficients along the diagonal. This method, however, makes coordinate invariance an immediate consequence of the definition.
[edit] See also
[edit] Notes
- ^ This is immediate from the definition of matrix multiplication.
- ^ Proof:
- f(eij) = 0 if and only if
and f(ejj) = f(e11) (with the standard basis eij),
.
- f(eij) = 0 if and only if
- ^ Proof:
is a semisimple Lie algebra and thus every element in it is the commutator of some pair of elements. - ^ This follows from the fact that
if and only if A = 0











![B(x, y) = \operatorname{trace}(\operatorname{ad}(x)\operatorname{ad}(y))\text{ where }\operatorname{ad}(x)y = [x, y] = xy - yx](http://upload.wikimedia.org/math/2/2/2/22293217973f3260888a1cf7092c6765.png)

![\operatorname{tr}(x[y, z]) = \operatorname{tr}([x, y]z). \,](http://upload.wikimedia.org/math/c/b/0/cb0283f6961f0dcf25bbf8d2a916d02f.png)





