Matrix (math)

Scheme for a general matrix

{\ displaystyle m \ times n}

Designations

In mathematics , a matrix (plural matrices ) is a rectangular arrangement (table) of elements (mostly mathematical objects , such as numbers). You can then calculate with these objects in a certain way by adding matrices or multiplying them with one another.

Matrices are a key concept in linear algebra and appear in almost all areas of mathematics. They clearly show relationships in which linear combinations play a role and thus facilitate arithmetic and thought processes. They are used in particular to represent linear images and to describe and solve linear systems of equations . The name matrix was introduced in 1850 by James Joseph Sylvester .

The elements are arranged in rows and columns , as shown in the figure below . The generalization to more than two indices is also called a hypermatrix . ${\ displaystyle m \ cdot n}$ ${\ displaystyle a_ {ij} \,}$ ${\ displaystyle m}$ ${\ displaystyle n}$

Concepts and first properties

notation

The arrangement of the elements in rows and columns between two large opening and closing brackets has established itself as a notation. As a rule, round brackets are used, but square brackets are also used. For example denote

{\ displaystyle {\ begin {pmatrix} a_ {11} & a_ {12} & a_ {13} \\ a_ {21} & a_ {22} & a_ {23} \ end {pmatrix}}}

and

{\ displaystyle {\ begin {bmatrix} a_ {11} & a_ {12} & a_ {13} \\ a_ {21} & a_ {22} & a_ {23} \ end {bmatrix}}}

Matrices with two rows and three columns. Matrices are usually designated with capital letters (sometimes in bold or, handwritten, single or double underlined), preferably . A matrix with rows and columns: ${\ displaystyle A}$ ${\ displaystyle m}$ ${\ displaystyle n}$

{\ displaystyle A = {\ boldsymbol {A}} = {\ underline {A}} = {\ begin {pmatrix} a_ {11} & a_ {12} & \ cdots & a_ {1n} \\ a_ {21} & a_ { 22} & \ cdots & a_ {2n} \\\ vdots & \ vdots && \ vdots \\ a_ {m1} & a_ {m2} & \ cdots & a_ {mn} \\\ end {pmatrix}} = (a_ {ij} )}

.

Elements of the matrix

The elements of the matrix are also called entries or components of the matrix. They come from a set , usually a body or a ring . One speaks of a matrix about . If one chooses for the set of real numbers , one speaks of a real matrix, in the case of complex numbers of a complex matrix. ${\ displaystyle K,}$ ${\ displaystyle K}$ ${\ displaystyle K}$

A certain element is described by two indices , usually the element in the first row and the first column is described by. Generally denotes the element in the -th row and -th column. When indexing, the row index is always mentioned first and the column index of the element second. Remember rule: line first, column later. If there is a risk of confusion, the two indices are separated with a comma. For example, the matrix element in the first row and the eleventh column is denoted by. ${\ displaystyle a_ {11}}$ ${\ displaystyle a_ {ij}}$ ${\ displaystyle i}$ ${\ displaystyle j}$ ${\ displaystyle a_ {1,11}}$

Individual rows and columns are often referred to as column or row vectors . An example:

{\ displaystyle A = {\ begin {pmatrix} a_ {11} & a_ {12} \\ a_ {21} & a_ {22} \ end {pmatrix}},}

here are and the columns or column vectors as well as and the rows or row vectors.

{\ displaystyle {\ begin {pmatrix} a_ {11} \\ a_ {21} \ end {pmatrix}}}

{\ displaystyle {\ begin {pmatrix} a_ {12} \\ a_ {22} \ end {pmatrix}}}

{\ displaystyle {\ begin {pmatrix} a_ {11} & a_ {12} \ end {pmatrix}}}

{\ displaystyle {\ begin {pmatrix} a_ {21} & a_ {22} \ end {pmatrix}}}

In the case of single row and column vectors in a matrix, the unchangeable index is sometimes left out. Sometimes column vectors are written as transposed row vectors for a more compact representation, so:

{\ displaystyle {\ begin {pmatrix} a_ {11} \\ a_ {21} \ end {pmatrix}}}

or as or

{\ displaystyle {\ begin {pmatrix} a_ {1} \\ a_ {2} \ end {pmatrix}}}

{\ displaystyle {\ begin {pmatrix} a_ {11} & a_ {21} \ end {pmatrix}} ^ {T}}

{\ displaystyle {\ begin {pmatrix} a_ {1} & a_ {2} \ end {pmatrix}} ^ {T}}

Type

The type of a matrix results from the number of its rows and columns. A matrix with rows and columns is called a matrix (read: m-by-n - or m-cross-n matrix ). If the number of rows and columns match, one speaks of a square matrix. ${\ displaystyle m}$ ${\ displaystyle n}$ ${\ displaystyle m \ times n}$

A matrix that consists of only one column or only one row is usually understood as a vector . A vector with elements can be represented as a single-column matrix or a single- row matrix, depending on the context . In addition to the terms column vector and row vector, the terms column matrix and row matrix are also used. A matrix is both column and row matrix and is considered a scalar . ${\ displaystyle n}$ ${\ displaystyle n \ times 1}$ ${\ displaystyle 1 \ times n}$ ${\ displaystyle 1 \ times 1}$

Formal representation

A matrix is a double-indexed family . Formally, this is a function

{\ displaystyle A \ colon \ {1, \ dotsc, m \} \ times \ {1, \ dotsc, n \} \ to K, \ quad (i, j) \ mapsto a_ {ij},}

which assigns the entry to each index pair as a function value. For example, the entry is assigned to the index pair as a function value. The function value is therefore the entry in the -th row and -th column. The variables and correspond to the number of rows or columns. Not to be confused with this formal definition of a matrix as a function is that matrices themselves describe linear mappings . ${\ displaystyle (i, j)}$ ${\ displaystyle a_ {ij}}$ ${\ displaystyle (1,2)}$ ${\ displaystyle a_ {12}}$ ${\ displaystyle a_ {ij}}$ ${\ displaystyle i}$ ${\ displaystyle j}$ ${\ displaystyle m}$ ${\ displaystyle n}$

The set of all matrices over the set is also written in common mathematical notation ; the short notation has become established for this. Sometimes the spellings or are used less often. ${\ displaystyle \ operatorname {Fig} \ left (\ {1, \ dotsc, m \} \ times \ {1, \ dotsc, n \}, K \ right)}$ ${\ displaystyle m \ times n}$ ${\ displaystyle K}$ ${\ displaystyle K ^ {\ {1, \ dotsc, m \} \ times \ {1, \ dotsc, n \}}}$ ${\ displaystyle K ^ {m \ times n}}$ ${\ displaystyle K ^ {m, n},}$ ${\ displaystyle M (m \ times n, K)}$ ${\ displaystyle {} ^ {m} K ^ {n}}$

Addition and multiplication

Elementary arithmetic operations are defined in the space of the matrices.

Matrix addition

Two matrices can be added if they are of the same type, that is, if they have the same number of rows and the same number of columns. The sum of two matrices is defined by component: ${\ displaystyle m \ times n}$

{\ displaystyle A + B: = (a_ {ij} + b_ {ij}) _ {i = 1, \ dotsc, m; \ j = 1, \ dotsc, n}}

Sample calculation:

{\ displaystyle {\ begin {pmatrix} 1 & -3 & 2 \\ 1 & 2 & 7 \ end {pmatrix}} + {\ begin {pmatrix} 0 & 3 & 5 \\ 2 & 1 & -1 \ end {pmatrix}} = {\ begin {pmatrix} 1 + 0 & -3 + 3 & 2 + 5 \\ 1 + 2 & 2 + 1 & 7 + (- 1) \ end {pmatrix}} = {\ begin {pmatrix} 1 & 0 & 7 \\ 3 & 3 & 6 \ end {pmatrix}}}

In linear algebra, the entries of the matrices are usually elements of a field , e.g. B. the real or complex numbers . In this case the matrix addition is associative , commutative and has a neutral element with the zero matrix . In general, matrix addition only has these properties if the entries are elements of an algebraic structure that has these properties.

Scalar multiplication

A matrix is multiplied by a scalar by multiplying each entry in the matrix by the scalar:

{\ displaystyle \ lambda \ cdot A: = (\ lambda \ cdot a_ {ij}) _ {i = 1, \ dotsc, m; \ j = 1, \ dotsc, n}}

Sample calculation:

{\ displaystyle 5 \ cdot {\ begin {pmatrix} 1 & -3 & 2 \\ 1 & 2 & 7 \ end {pmatrix}} = {\ begin {pmatrix} 5 \ cdot 1 & 5 \ cdot (-3) & 5 \ cdot 2 \\ 5 \ cdot 1 & 5 \ cdot 2 & 5 \ cdot 7 \ end {pmatrix}} = {\ begin {pmatrix} 5 & -15 & 10 \\ 5 & 10 & 35 \ end {pmatrix}}}

The scalar multiplication must not be confused with the scalar product . In order to be able to perform the scalar multiplication, the scalar ( lambda ) and the entries in the matrix must come from the same ring . The set of matrices in this case is a (left) module over ${\ displaystyle \ lambda}$ ${\ displaystyle (K, +, \ cdot, 0)}$ ${\ displaystyle m \ times n}$ ${\ displaystyle K.}$

Matrix multiplication

Two matrices can be multiplied if the number of columns in the left matrix corresponds to the number of rows in the right matrix. The product of a matrix and a matrix is a matrix whose entries are calculated by applying the sum of products formula, similar to the scalar product, to pairs of a row vector of the first matrix and a column vector of the second matrix: ${\ displaystyle l \ times m}$ ${\ displaystyle A = (a_ {ij}) _ {i = 1, \ dotsc, l, \; j = 1, \ dotsc, m}}$ ${\ displaystyle m \ times n}$ ${\ displaystyle B = (b_ {ij}) _ {i = 1, \ dotsc, m, \; j = 1, \ dotsc, n}}$ ${\ displaystyle l \ times n}$ ${\ displaystyle C = (c_ {ij}) _ {i = 1, \ dotsc, l, \; j = 1, \ dotsc, n},}$

{\ displaystyle c_ {ij} = \ sum _ {k = 1} ^ {m} a_ {ik} \ cdot b_ {kj}}

The matrix multiplication is not commutative ; i.e., in general . The matrix multiplication is, however, associative ; i.e., it always applies: ${\ displaystyle B \ cdot A \ neq A \ cdot B}$

{\ displaystyle (A \ cdot B) \ cdot C = A \ cdot (B \ cdot C)}

A chain of matrix multiplications can therefore be bracketed in different ways. The problem of finding a bracket that leads to a calculation with the minimum number of elementary arithmetic operations is an optimization problem . The matrix addition and matrix multiplication also satisfy the two distributive laws :

{\ displaystyle (A + B) \ cdot C = A \ cdot C + B \ cdot C}

for all matrices and matrices as well ${\ displaystyle l \ times m}$ ${\ displaystyle A, B}$ ${\ displaystyle m \ times n}$ ${\ displaystyle C}$

{\ displaystyle A \ cdot (B + C) = A \ cdot B + A \ cdot C}

for all matrices and matrices ${\ displaystyle l \ times m}$ ${\ displaystyle A}$ ${\ displaystyle m \ times n}$ ${\ displaystyle B, C.}$

Square matrices can be multiplied by themselves, analogous to the power of the real numbers, the matrix power or etc. are abbreviated . It makes sense to use square matrices as elements in polynomials. For more information on this, see Characteristic Polynomial . The Jordanian normal form can be used here for easier calculation . Square matrices above or beyond that can even be used in power series, cf. Matrix exponential . The square matrices over a ring , that is, play a special role with regard to matrix multiplication . With the matrix addition and multiplication, these in turn form a ring, which is called the matrix ring . ${\ displaystyle A \ in K ^ {n \ times n}}$ ${\ displaystyle A ^ {2} = A \ cdot A}$ ${\ displaystyle A ^ {3} = A \ cdot A \ cdot A}$ ${\ displaystyle \ mathbb {R}}$ ${\ displaystyle \ mathbb {C}}$ ${\ displaystyle R}$ ${\ displaystyle R ^ {n \ times n}}$

Further arithmetic operations

Transposed matrix

Animation of the transposition of the matrix A

The transpose of a matrix is the matrix , that is, to ${\ displaystyle m \ times n}$ ${\ displaystyle A = \ left (a_ {ij} \ right)}$ ${\ displaystyle n \ times m}$ ${\ displaystyle A ^ {T} = \ left (a_ {ji} \ right)}$

{\ displaystyle A = {\ begin {pmatrix} a_ {11} & \ dots & a_ {1n} \\\ vdots && \ vdots \\ a_ {m1} & \ dots & a_ {mn} \ end {pmatrix}}}

is

{\ displaystyle A ^ {T} = {\ begin {pmatrix} a_ {11} & \ dots & a_ {m1} \\\ vdots && \ vdots \\ a_ {1n} & \ dots & a_ {mn} \ end {pmatrix }}}

the transpose. So you write the first row as the first column, the second row as the second column, etc. The matrix is mirrored on its main diagonal . The following calculation rules apply: ${\ displaystyle a_ {11}, a_ {22}, \ dotsc}$

{\ displaystyle {\ begin {aligned} (A + B) ^ {T} & = A ^ {T} + B ^ {T} \\ (c \ cdot A) ^ {T} & = c \ cdot A ^ {T} \\\ left (A ^ {T} \ right) ^ {T} & = A \\ (A \ cdot B) ^ {T} & = B ^ {T} \ cdot A ^ {T} \ \\ left (A ^ {- 1} \ right) ^ {T} & = \ left (A ^ {T} \ right) ^ {- 1} \ end {aligned}}}

For matrices above the adjoint matrix is exactly the transposed matrix. ${\ displaystyle \ mathbb {R}}$

Inverse matrix

If the determinant of a square matrix over a body is not equal to zero, i.e. i.e., if so, the matrix inverse to the matrix exists . For this applies ${\ displaystyle n \ times n}$ ${\ displaystyle A}$ ${\ displaystyle K}$ ${\ displaystyle \ det (A) \ neq 0}$ ${\ displaystyle A}$ ${\ displaystyle A ^ {- 1}}$

{\ displaystyle AA ^ {- 1} = A ^ {- 1} A = E}

,

where the - is identity matrix . Matrices that have an inverse matrix are called invertible or regular matrices . These have full rank . Conversely, non-invertible matrices are called singular matrices. A generalization of the inverse for singular matrices are so-called pseudo-inverse matrices. ${\ displaystyle E}$ ${\ displaystyle n \ times n}$

Vector vector products

The matrix product of two -Vectors and is not defined because the number of columns of is generally not equal to the number of rows of . The two products and do exist, however. ${\ displaystyle v \ cdot w}$ ${\ displaystyle n \ times 1}$ ${\ displaystyle v}$ ${\ displaystyle w}$ ${\ displaystyle 1}$ ${\ displaystyle v}$ ${\ displaystyle n}$ ${\ displaystyle w}$ ${\ displaystyle v ^ {T} \ cdot w}$ ${\ displaystyle v \ cdot w ^ {T}}$

The first product is a matrix that is interpreted as a number; it is called the standard scalar product of and and is denoted by or . Geometric that dot product corresponds to a Cartesian coordinate system the product ${\ displaystyle v ^ {T} \ cdot w}$ ${\ displaystyle 1 \ times 1}$ ${\ displaystyle v}$ ${\ displaystyle w}$ ${\ displaystyle \ langle v, w \ rangle}$ ${\ displaystyle {\ vec {v}} \ cdot {\ vec {w}}}$

{\ displaystyle {\ vec {v}} \ cdot {\ vec {w}} = | {\ vec {v}} | \ cdot | {\ vec {w}} | \ cdot \ cos \ sphericalangle ({\ vec {v}}, {\ vec {w}})}

the magnitudes of the two vectors and the cosine of the angle enclosed by the two vectors. For example

{\ displaystyle {\ begin {pmatrix} 1 \\ 2 \\ 3 \ end {pmatrix}} ^ {T} \ cdot {\ begin {pmatrix} -2 \\ - 1 \\ 1 \ end {pmatrix}} = {\ begin {pmatrix} 1 & 2 & 3 \ end {pmatrix}} \ cdot {\ begin {pmatrix} -2 \\ - 1 \\ 1 \ end {pmatrix}} = 1 \ cdot (-2) +2 \ cdot (- 1) +3 \ cdot 1 = -1}

The second product is a matrix and is called the dyadic product or tensor product of and (written ). Its columns are scalar multiples of , its rows are scalar multiples of . For example ${\ displaystyle v \ cdot w ^ {T}}$ ${\ displaystyle n \ times n}$ ${\ displaystyle v}$ ${\ displaystyle w}$ ${\ displaystyle v \ otimes w}$ ${\ displaystyle v}$ ${\ displaystyle w ^ {T}}$

{\ displaystyle {\ begin {pmatrix} 1 \\ 2 \\ 3 \ end {pmatrix}} \ cdot {\ begin {pmatrix} -2 \\ - 1 \\ 1 \ end {pmatrix}} ^ {T} = {\ begin {pmatrix} 1 \\ 2 \\ 3 \ end {pmatrix}} \ cdot {\ begin {pmatrix} -2 & -1 & 1 \ end {pmatrix}} = {\ begin {pmatrix} 1 \ cdot (-2 ) & 1 \ cdot (-1) & 1 \ cdot 1 \\ 2 \ cdot (-2) & 2 \ cdot (-1) & 2 \ cdot 1 \\ 3 \ cdot (-2) & 3 \ cdot (-1) & 3 \ cdot 1 \ end {pmatrix}} = {\ begin {pmatrix} -2 & -1 & 1 \\ - 4 & -2 & 2 \\ - 6 & -3 & 3 \ end {pmatrix}}}

Vector spaces of matrices

The set of -matrices over a field forms with the matrix addition and the scalar multiplication a - vector space . This vector space has the dimension . A base of is given by the amount of the standard dies with , . This base is sometimes referred to as the standard base of . ${\ displaystyle m \ times n}$ ${\ displaystyle K}$ ${\ displaystyle K}$ ${\ displaystyle K ^ {m \ times n}}$ ${\ displaystyle m \ cdot n}$ ${\ displaystyle K ^ {m \ times n}}$ ${\ displaystyle E_ {ij}}$ ${\ displaystyle i \ in \ {1, \ dotsc, m \}}$ ${\ displaystyle j \ in \ {1, \ dotsc, n \}}$ ${\ displaystyle K ^ {m \ times n}}$

The trace of the matrix product ${\ displaystyle A ^ {T} \ cdot B}$

{\ displaystyle \ left \ langle A, B \ right \ rangle = \ operatorname {spur} (A ^ {T} B) = \ sum _ {j = 1} ^ {n} \ sum _ {i = 1} ^ {m} a_ {ij} b_ {ij}}

is then in the special case a real scalar product . In this Euclidean vector space , the symmetrical matrices and the skew-symmetrical matrices are perpendicular to one another. If there is a symmetric and a skew-symmetric matrix, then we have . ${\ displaystyle K = \ mathbb {R}}$ ${\ displaystyle A}$ ${\ displaystyle B}$ ${\ displaystyle {\ begin {matrix} \ left \ langle A, B \ right \ rangle = 0 \ end {matrix}}}$

In the special case is the trace of the matrix product ${\ displaystyle K = \ mathbb {C}}$ ${\ displaystyle {\ overline {A ^ {T}}} \ cdot B}$

{\ displaystyle \ left \ langle A, B \ right \ rangle = \ operatorname {spur} ({\ overline {A ^ {T}}} B) = \ sum _ {j = 1} ^ {n} \ sum _ {i = 1} ^ {m} {\ overline {a_ {ij}}} b_ {ij}}

becomes a complex scalar product and the matrix space becomes a unitary vector space . This scalar product is also called the Frobenius scalar product . The norm induced by the Frobenius scalar product is called the Frobenius norm and with it the matrix space becomes a Banach space .

Applications

Relationship with linear maps

The special thing about matrices over a ring is the connection to linear maps . A linear mapping with a definition range (set of column vectors) and value range can be defined for each matrix by mapping each column vector to . Conversely, each linear mapping corresponds exactly to one matrix in this way ; where the columns from are the images of the standard basis vectors from below . This relationship between linear mappings and matrices is also known as (canonical) isomorphism ${\ displaystyle K}$ ${\ displaystyle A \ in K ^ {m \ times n}}$ ${\ displaystyle K ^ {n}}$ ${\ displaystyle K ^ {m}}$ ${\ displaystyle u \ in K ^ {n}}$ ${\ displaystyle A \ cdot u \ in K ^ {m}}$ ${\ displaystyle f \ colon K ^ {n} \ to K ^ {m}}$ ${\ displaystyle m \ times n}$ ${\ displaystyle A}$ ${\ displaystyle A}$ ${\ displaystyle e_ {1}, \ dotsc, e_ {n}}$ ${\ displaystyle K ^ {n}}$ ${\ displaystyle f}$

{\ displaystyle \ operatorname {Hom} _ {K} (K ^ {n}, K ^ {m}) \ simeq K ^ {m \ times n}.}

Given a given and, it represents a bijection between the set of matrices and the set of linear images. The matrix product is transformed into the composition (sequential execution) of linear images. Because the brackets play no role in the execution of three linear mappings one after the other, this also applies to the matrix multiplication, which is therefore associative . ${\ displaystyle K,}$ ${\ displaystyle m}$ ${\ displaystyle n}$

If there is even a body, one can consider arbitrary finite-dimensional vector spaces and (of dimension or ) instead of the column vector spaces . (If a commutative ring with 1, then one can analog-free K-modules to view.) These are the choice of bases of , and of the coordinate spaces or isomorphic, because at any given vector a unique decomposition into basis vectors ${\ displaystyle K}$ ${\ displaystyle K}$ ${\ displaystyle V}$ ${\ displaystyle W}$ ${\ displaystyle n}$ ${\ displaystyle m}$ ${\ displaystyle K}$ ${\ displaystyle v = (v_ {1}, \ dotsc, v_ {n})}$ ${\ displaystyle V}$ ${\ displaystyle w = (w_ {1}, \ dotsc, w_ {m})}$ ${\ displaystyle W}$ ${\ displaystyle K ^ {n}}$ ${\ displaystyle K ^ {m}}$ ${\ displaystyle u \ in V}$

{\ displaystyle u = \ sum _ {j = 1} ^ {n} \ alpha _ {j} v_ {j}}

exists and the body elements occurring in it the coordinate vector ${\ displaystyle \ alpha _ {j}}$

{\ displaystyle {} _ {v} u = {\ begin {pmatrix} \ alpha _ {1} \\\ vdots \\\ alpha _ {n} \ end {pmatrix}} \ in K ^ {n}}

form. However, the coordinate vector depends on the base used , which is also included in the name . ${\ displaystyle v}$ ${\ displaystyle {} _ {v} u}$

The situation is analogous in vector space. If a linear mapping is given, the images of the basis vectors of can be uniquely divided into the basis vectors of in the form ${\ displaystyle W.}$ ${\ displaystyle f \ colon V \ to W}$ ${\ displaystyle V}$ ${\ displaystyle W}$

{\ displaystyle f (v_ {j}) = \ sum _ {i = 1} ^ {m} a_ {ij} w_ {i}}

with coordinate vector

{\ displaystyle {} _ {w} f (v_ {j}) = {\ begin {pmatrix} a_ {1j} \\\ vdots \\ a_ {mj} \ end {pmatrix}} \ in K ^ {m} .}

The mapping is then completely defined by the so-called mapping matrix

{\ displaystyle {} _ {w} f_ {v} = {\ begin {pmatrix} a_ {11} & \ ldots & a_ {1n} \\\ vdots && \ vdots \\ a_ {m1} & \ ldots & a_ {mn } \ end {pmatrix}} \ in K ^ {m \ times n},}

because for the picture of the above Vector applies ${\ displaystyle u}$

{\ displaystyle f (u) = \ sum _ {i = 1} ^ {m} \ sum _ {j = 1} ^ {n} a_ {ij} \ alpha _ {j} w_ {i},}

thus ("coordinate vector = matrix times coordinate vector"). (The matrix depends on the bases used and ; with multiplication, the base that is to the left and right of the painting point is “truncated” and the “outside” base is left over.) ${\ displaystyle {} _ {w} f (u) = {} _ {w} f_ {v} \ cdot {} _ {v} u}$ ${\ displaystyle {} _ {w} f_ {v}}$ ${\ displaystyle v}$ ${\ displaystyle w}$ ${\ displaystyle v}$ ${\ displaystyle w}$

The execution of two linear mappings and (with bases , or ) one after the other corresponds to the matrix multiplication, i.e. ${\ displaystyle f \ colon V \ to W}$ ${\ displaystyle g \ colon W \ to X}$ ${\ displaystyle v}$ ${\ displaystyle w}$ ${\ displaystyle x}$

{\ displaystyle {} _ {x} (g \ circ f) _ {v} = {} _ {x} g_ {w} \ cdot {} _ {w} f_ {v}}

(Here, too, the base is "shortened"). ${\ displaystyle w}$

Thus, the set of linear mappings from to is again isomorphic to. The isomorphism depends on the selected bases and from and is therefore not canonical: If you choose a different basis for or for , the same linear map is assigned a different matrix from the old is created by multiplying right or left with an invertible matrix or matrix (so-called base change matrix ) that only depends on the bases involved . This follows by applying the multiplication rule from the previous paragraph twice, namely ${\ displaystyle V}$ ${\ displaystyle W}$ ${\ displaystyle K ^ {m \ times n}.}$ ${\ displaystyle f \ mapsto {} _ {w} f_ {v}}$ ${\ displaystyle v}$ ${\ displaystyle w}$ ${\ displaystyle v '}$ ${\ displaystyle V}$ ${\ displaystyle w '}$ ${\ displaystyle W}$ ${\ displaystyle m \ times m}$ ${\ displaystyle n \ times n}$

{\ displaystyle {} _ {w '} f_ {v'} = {} _ {w '} e_ {w} ^ {W} \ cdot {} _ {w} f_ {v} \ cdot {} _ {v } e_ {v '} ^ {V}}

("Matrix = base change matrix times matrix times base change matrix"). The identity mappings and each vector form from or onto themselves. ${\ displaystyle e ^ {V}}$ ${\ displaystyle e ^ {W}}$ ${\ displaystyle V}$ ${\ displaystyle W}$

If a property of matrices remains unaffected by such base changes, it makes sense to assign this property to the corresponding linear mapping regardless of the base.

Terms that are often used in connection with matrices are the rank and the determinant of a matrix. The rank is (if there is a body) in the stated sense independent of the base, and one can therefore speak of the rank also for linear mappings. The determinant is only defined for square matrices that match the case ; it remains unchanged if the same base change is carried out in the definition and value range, with both base change matrices being inverse to one another: ${\ displaystyle K}$ ${\ displaystyle V = W}$

{\ displaystyle {} _ {v '} f_ {v'} = ({} _ {v} e_ {v '} ^ {V}) ^ {- 1} \ cdot {} _ {v} f_ {v} \ cdot {} _ {v} e_ {v '} ^ {V}}

In this sense, the determinant is also independent of the base.

Transformation of matrix equations

Especially in the multivariate method arguments, derivations are often so in matrices calculus performed.

In principle, equations are transformed like algebraic equations, although the non-commutativity of matrix multiplication and the existence of zero divisors must be taken into account.

Example: Linear system of equations as a simple transformation

Find the solution vector of a linear system of equations ${\ displaystyle x}$

{\ displaystyle A \ cdot x = b}

with as coefficient matrix. If the inverse matrix exists, you can multiply with it from the left: ${\ displaystyle A}$ ${\ displaystyle n \ times n}$ ${\ displaystyle A ^ {- 1}}$

{\ displaystyle A ^ {- 1} \ cdot A \ cdot x = A ^ {- 1} \ cdot b \ Leftrightarrow E \ cdot x = A ^ {- 1} \ cdot b}

and you get the solution

{\ displaystyle x = A ^ {- 1} \ cdot b.}

Special matrices

Properties of endomorphisms

The following properties of square matrices correspond to properties of the endomorphisms they represent.

Orthogonal matrices

Unitary matrices

They are the complex counterpart to the orthogonal matrices. A complex matrix is unitary if the associated transformation receives the normalization, that is, if

{\ displaystyle A}

{\ displaystyle \ langle Av, Aw \ rangle = \ langle v, w \ rangle}

applies. This condition is equivalent to that the equation

{\ displaystyle A}

{\ displaystyle A ^ {- 1} = A ^ {*}}

Fulfills; the conjugate-transposed matrix denotes to

{\ displaystyle A ^ {*}}

{\ displaystyle A.}

If one understands the -dimensional complex vector space as -dimensional real vector space, then the unitary matrices correspond exactly to those orthogonal matrices that swap with the multiplication with .

{\ displaystyle n}

{\ displaystyle 2n}

{\ displaystyle \ mathrm {i}}

Projection matrices

A matrix is a projection matrix, if

{\ displaystyle A = A ^ {2}}

holds, i.e. it is idempotent , that is, the multiple application of a projection matrix to a vector leaves the result unchanged. An idempotent matrix does not have a full rank unless it is the identity matrix. Geometrically, projection matrices correspond to the parallel projection along the null space of the matrix. If the null space is perpendicular to the image space , an orthogonal projection is obtained .

Example: Let it be a matrix and therefore not invertible itself. If the rank of is equal , then is invertible and so is the matrix

{\ displaystyle X}

{\ displaystyle (m \ times n)}

{\ displaystyle X}

{\ displaystyle n}

{\ displaystyle (X ^ {T} X)}

{\ displaystyle (m \ times m)}

{\ displaystyle A = X \, (X ^ {T} X) ^ {- 1} X ^ {T}}

idempotent. This matrix is used, for example, in the least squares method.

Nilpotent matrices: A matrix is called nilpotent if a power (and thus every higher power) results in the zero matrix. ${\ displaystyle N}$ ${\ displaystyle N ^ {k}}$

Properties of bilinear shapes

Properties of matrices are listed below, the properties of the associated bilinear form

{\ displaystyle (v, w) \ mapsto v ^ {T} Aw}

correspond. Nevertheless, these properties can also have an independent meaning for the endomorphisms shown.

Symmetrical matrices

A matrix is called symmetric if it is equal to its transposed matrix:

{\ displaystyle A}

{\ displaystyle A ^ {T} = A}

In clear terms, the entries of symmetrical matrices are symmetrical to the main diagonal.

Example:

{\ displaystyle {\ begin {pmatrix} 1 & 2 & 3 \\ 2 & 4 & 5 \\ 3 & 5 & 6 \ end {pmatrix}} ^ {T} = {\ begin {pmatrix} 1 & 2 & 3 \\ 2 & 4 & 5 \\ 3 & 5 & 6 \ end {pmatrix}}}

On the one hand, symmetric matrices correspond to symmetric bilinear forms:

{\ displaystyle v ^ {T} Aw = w ^ {T} Av,}

on the other hand the self-adjoint linear mappings:

{\ displaystyle \ langle Av, w \ rangle = \ langle v, Aw \ rangle}

Hermitian matrices

Hermitian matrices are the complex analog of symmetric matrices. They correspond to the Hermitian sesquilinear forms and the self-adjoint endomorphisms .

A matrix is Hermitian or self-adjoint if:

{\ displaystyle A \ in \ mathbb {C} ^ {n \ times n}}

{\ displaystyle A = A ^ {*}}

Misaligned matrices

A matrix is called skew symmetric or also antisymmetric if the following applies:

{\ displaystyle A}

{\ displaystyle -A ^ {T} = A}

To meet this requirement, all entries on the main diagonal must have the value zero; the remaining values are mirrored on the main diagonal and multiplied by.

{\ displaystyle -1}

Example:

{\ displaystyle {\ begin {pmatrix} 0 & 1 & 2 \\ - 1 & 0 & 3 \\ - 2 & -3 & 0 \ end {pmatrix}}}

Skew-symmetric matrices correspond to antisymmetric bilinear forms:

{\ displaystyle v ^ {T} \ cdot A \ cdot w = -w ^ {T} \ cdot A \ cdot v}

and anti-self-adjoint endomorphisms:

{\ displaystyle \ langle Av, w \ rangle = - \ langle v, Aw \ rangle}

Positive definite matrices

A real matrix is positive definite if the associated bilinear form is positive definite, that is, if the following applies for all vectors :

{\ displaystyle v \ neq 0}

{\ displaystyle v ^ {T} \ cdot A \ cdot v> 0}

Positive definite matrices define generalized scalar products . If the bilinear form has no negative values, the matrix is called positive semidefinite. Similarly, a matrix can be called negative definite or negative semidefinite if the above bilinear form only has negative or no positive values. Matrices that do not meet any of these properties are called indefinite.

Other constructions

Conjugated and adjoint matrix

If a matrix contains complex numbers, the conjugate matrix is obtained by replacing its components with the conjugate complex elements. The adjoint matrix (also Hermitian conjugate matrix) of a matrix is denoted by and corresponds to the transposed matrix, in which all elements are also complexly conjugated. ${\ displaystyle A}$ ${\ displaystyle A ^ {*}}$

Adjoint or complementary matrix

The complementary matrix of a square matrix is made up of its sub-determinants, whereby one sub-determinant is also called a minor. To determine the sub-determinants , the -th row and -th column of are deleted. The determinant is then calculated from the resulting matrix . The complementary matrix then has the entries This matrix is sometimes also referred to as the matrix of cofactors . ${\ displaystyle \ operatorname {adj} (A)}$ ${\ displaystyle A}$ ${\ displaystyle \ det (A_ {ij})}$ ${\ displaystyle i}$ ${\ displaystyle j}$ ${\ displaystyle A}$ ${\ displaystyle (n-1) \ times (n-1)}$ ${\ displaystyle \ det (A_ {ij})}$ ${\ displaystyle (-1) ^ {i + j} \ det (A_ {ji}).}$

The complementary matrix is used, for example, to calculate the inverse of a matrix , because according to Laplace's expansion theorem:

{\ displaystyle A}

{\ displaystyle \ operatorname {adj} (A) \ cdot A = A \ cdot \ operatorname {adj} (A) = \ det (A) \ cdot E_ {n}}

So the inverse is if

{\ displaystyle A ^ {- 1} = {\ frac {1} {\ det (A)}} \ cdot \ operatorname {adj} (A),}

{\ displaystyle \ det (A) \ neq 0.}

Transitional or stochastic matrices

A transition or stochastic matrix is a matrix whose entries are all between 0 and 1 and whose rows or column sums result in 1. In stochastics they serve to characterize temporally discrete Markov chains with finite state space. The double-stochastic matrices are a special case of this .

Infinite dimensional spaces

Also for infinitely dimensional vector spaces (even over oblique bodies ) it applies that every linear mapping is uniquely determined by the images of the elements of a base and these can be chosen arbitrarily and can be continued to a linear mapping on completely . If now is a basis of , then one can clearly write as a (finite) linear combination of basis vectors, i. That is, there are unique coefficients for , of which only finitely many are different from zero, so that . Accordingly, every linear mapping can be understood as a possibly infinite matrix, but in every column ( "number" the columns and the column to consist of the elements of numbered coordinates ) only finitely many entries differ from zero, and vice versa. The correspondingly defined matrix multiplication in turn corresponds to the composition of linear images. ${\ displaystyle f \ colon U \ to V}$ ${\ displaystyle f (u)}$ ${\ displaystyle u}$ ${\ displaystyle {\ mathcal {B}} _ {U} \ subset U}$ ${\ displaystyle U}$ ${\ displaystyle {\ mathcal {B}} _ {V}}$ ${\ displaystyle V}$ ${\ displaystyle f (u)}$ ${\ displaystyle f (u) _ {b} \ in K}$ ${\ displaystyle b \ in {\ mathcal {B}} _ {V}}$ ${\ displaystyle f (u) = \ sum _ {b \ in {\ mathcal {B}} _ {V}} f (u) _ {b} b}$ ${\ displaystyle {\ mathcal {B}} _ {U}}$ ${\ displaystyle u}$ ${\ displaystyle {\ mathcal {B}} _ {V}}$ ${\ displaystyle f (u) _ {b}}$

In functional analysis , one considers topological vector spaces , i. H. Vector spaces on which one can speak of convergence and accordingly also form infinite sums . In such cases, matrices with an infinite number of non-zero entries in a column can, under certain circumstances, be understood as linear mappings, whereby other basic terms are also used.

Hilbert spaces are a special case . So let Hilbert spaces and orthonormal bases of or . Then a matrix representation of a linear operator is obtained (for only densely defined operators it works as well if the domain has an orthonormal basis, which is always the case in the countable-dimensional case) by defining the matrix elements ; here is the scalar product in the Hilbert space under consideration (in the complex case semilinear in the first argument). ${\ displaystyle U, V}$ ${\ displaystyle (u_ {i}) _ {i \ in I}, (v_ {i}) _ {i \ in I}}$ ${\ displaystyle U}$ ${\ displaystyle V}$ ${\ displaystyle f \ colon U \ to V}$ ${\ displaystyle f_ {i, k}: = \ langle u_ {i}, fu_ {k} \ rangle}$ ${\ displaystyle \ langle u, v \ rangle}$

This so-called Hilbert-Schmidt scalar product can only be defined in the infinite-dimensional case for a certain subclass of linear operators, the so-called Hilbert-Schmidt operators , for which the series over which this scalar product is defined always converges.

literature

Gerd Fischer : Linear Algebra. (An introduction for first-year students). 13th revised edition. Vieweg, Braunschweig et al. 2002, ISBN 3-528-97217-3 .
Günter Gramlich: Linear Algebra. Fachbuchverlag Leipzig in Carl Hanser Verlag, Munich et al. 2003, ISBN 3-446-22122-0 .
Klaus Jänich : Linear Algebra. 11th edition. Springer, Berlin et al. 2008, ISBN 978-3-540-75501-2 .
Karsten Schmidt, Götz Trenkler: Introduction to Modern Matrix Algebra. With applications in statistics. 2nd, completely revised edition. Springer, Berlin et al. 2006, ISBN 3-540-33007-0 .
Gilbert Strang : Linear Algebra. Springer, Berlin et al. 2003, ISBN 3-540-43949-8 .

Web links

Wiktionary: Matrix - explanations of meanings, word origins, synonyms, translations

Commons : Matrix - collection of images, videos and audio files

Matrix calculator - calculator that performs arithmetic operations for matrices with concrete numerical values, but also with variables.
The Matrix Cookbook - An extensive matrix formulary collection in English (PDF; 522 kB).

supporting documents

↑ Eric W. Weisstein : Hypermatrix . In: MathWorld (English).

[1] Eric W. Weisstein : Hypermatrix . In: MathWorld (English).