Matrix similarity


In linear algebra, two n-by-n matrices and are called similar if there exists an invertible n-by-n matrix such that
Similar matrices represent the same linear map under two different bases, with being the change of basis matrix.
A transformation is called a similarity transformation or conjugation of the matrix. In the general linear group, similarity is therefore the same as conjugacy, and similar matrices are also called conjugate; however in a given subgroup of the general linear group, the notion of conjugacy may be more restrictive than similarity, since it requires that be chosen to lie in.

Motivating example

When defining a linear transformation, it can be the case that a change of basis can result in a simpler form of the same transformation. For example, the matrix representing a rotation in when the axis of rotation is not aligned with the coordinate axis can be complicated to compute. If the axis of rotation were aligned with the positive -axis, then it would simply be
where is the angle of rotation. In the new coordinate system, the transformation would be written as
where and are respectively the original and transformed vectors in a new basis containing a vector parallel to the axis of rotation. In the original basis, the transform would be written as
where vectors and and the unknown transform matrix are in the original basis. To write in terms of the simpler matrix, we use the change-of-basis matrix that transforms and as and :
Thus, the matrix in the original basis is given by. The transform in the original basis is found to be the product of three easy-to-derive matrices. In effect, the similarity transform operates in three steps: change to a new basis, perform the simple transformation, and change back to the old basis.

Properties

Similarity is an equivalence relation on the space of square matrices.
Because matrices are similar if and only if they represent the same linear operator with respect to different bases, similar matrices share all properties of their shared underlying operator:
Because of this, for a given matrix A, one is interested in finding a simple "normal form" B which is similar to A—the study of A then reduces to the study of the simpler matrix B. For example, A is called diagonalizable if it is similar to a diagonal matrix. Not all matrices are diagonalizable, but at least over the complex numbers, every matrix is similar to a matrix in Jordan form. Neither of these forms is unique so they are not really normal forms; moreover their determination depends on being able to factor the minimal or characteristic polynomial of A. The rational canonical form does not have these drawbacks: it exists over any field, is truly unique, and it can be computed using only arithmetic operations in the field; A and B are similar if and only if they have the same rational canonical form. The rational canonical form is determined by the elementary divisors of A; these can be immediately read off from a matrix in Jordan form, but they can also be determined directly for any matrix by computing the Smith normal form, over the ring of polynomials, of the matrix . Note that this Smith normal form is not a normal form of A itself; moreover it is not similar to either, but obtained from the latter by left and right multiplications by different invertible matrices.
Similarity of matrices does not depend on the base field: if L is a field containing K as a subfield, and A and B are two matrices over K, then A and B are similar as matrices over K if and only if they are similar as matrices over L. This is so because the rational canonical form over K is also the rational canonical form over L. This means that one may use Jordan forms that only exist over a larger field to determine whether the given matrices are similar.
In the definition of similarity, if the matrix P can be chosen to be a permutation matrix then A and B are permutation-similar; if P can be chosen to be a unitary matrix then A and B are unitarily equivalent. The spectral theorem says that every normal matrix is unitarily equivalent to some diagonal matrix. Specht's theorem states that two matrices are unitarily equivalent if and only if they satisfy certain trace equalities.