Principal axis transformation

from Wikipedia, the free encyclopedia

The principal axis transformation (HAT) is a process in Euclidean geometry with which the equations of quadrics ( ellipse , hyperbola , ...; ellipsoid , hyperboloid , ...) are brought to the respective normal form by means of a suitable coordinate transformation and thus their type and geometric properties (Center, vertex, semi-axes) can determine. Orthogonal coordinate transformations (rotations, reflections, shifts) must be used (see below) so that lengths and angles are not changed during the transformation . The essential aid of this procedure is the diagonalization of a symmetrical matrix with the help of an orthogonal matrix.

Major axis transformation of an ellipse with the help of a rotation of the coordinate system

In addition to the purely mathematical-geometric meaning of the main axis transformation for determining the type of quadrics, it is used in numerous disciplines of theoretical physics as well as in computer science and the geosciences (see section Application ).

Simple examples and motivation of the procedure

example 1

That the equation

Describing the circle with its center and radius can be recognized by adding the square to the form .

Ellipse with main axes parallel to the axes

Example 2

The equation too

can be brought to the shape by adding a square and you can see that it is an ellipse with a center and the semi-axes .

Example 3

It is much more difficult to find the equation

to see that it is an ellipse with the semiaxes . The problem arises from the "mixed" term . It is a sign that the mutually perpendicular main axes in this case are not parallel to the coordinate axes. This can be changed by using a suitable rotation of the coordinate system (around the zero point). The angle of rotation results from the coefficients at (see conic sections ). However, this descriptive procedure becomes very confusing when examining quadrics in Euclidean space. Linear algebra provides a method that can be used in any dimension: the diagonalization of symmetric matrices. To do this, write the equation of the quadric with the help of a symmetrical matrix: the coefficients of are on the diagonal of the matrix, and half of the coefficients of are on the secondary diagonal .

Now the matrix is ​​diagonalized by applying an orthogonal coordinate transformation (rotation or rotation mirroring im ). In the case of an orthogonal coordinate transformation, lengths are not changed, so that after the transformation and any necessary quadratic addition (see above), the lengths of the semi-axes and positions of the center and vertex can be read off.

Diagonalization of a symmetric matrix (principal axis theorem)

A symmetric matrix always has an orthogonal matrix , so

is a diagonal matrix. The main diagonal of the diagonal matrix consists of the eigenvalues ​​of the matrix . A symmetric matrix always has real eigenvalues ​​taking into account the respective multiplicity. For the matrix , orthonormal eigenvectors of the matrix are chosen as column vectors . (Eigenvectors of a symmetrical matrix with different eigenvalues ​​are always orthogonal. If a eigenspace has a dimension larger than 1, one has to determine an orthonormal basis of the eigenspace with the help of the Gram-Schmidt orthogonalization method.) The determinant of is . So that there is a rotation matrix in the plane case, one must choose the orientations of the eigenvectors used so that is.

If the matrix is ​​interpreted as a linear mapping , the matrix can be understood as a transformation on the new basis . The relationship exists between the old and new coordinates . The effect of the matrix in the new coordinate system is taken over by the diagonal matrix . An important property of the (orthogonal!) Matrix is . So can also be easily coordinate old into new convert: .

The equation of a quadric

has a square part that can be described by a ( oBdA symmetrical) matrix . With the main axis theorem, this quadratic part is transformed into the "diagonal shape " . After that, there are no more mixed terms with .

Major axis transformation of a conic section

Description of the method

A conic section suffices for one equation

.

This equation can be written in matrix form as follows:

Step 1
Set .
2nd step
Find the eigenvalues ​​of the matrix as solutions to the eigenvalue equation
. The eigenvalues ​​are .
3rd step
Determine normalized eigenvectors from the 2 systems of equations:
4th step
Set and replace by using .
5th step
The result is the equation of the conic section in the new coordinates:
Since the quadratic part in this equation is fixed by the eigenvalues ​​as coefficients and the disappearance of the mixed part, the old coordinates x and y only need to be replaced in the linear part .
6th step
By adding a square, you get the center or apex shape of the conic section and can read off the center (for ellipse, hyperbola, ...) or apex (for parabola) and possibly semiaxes.
7th step
With the help of the relationship , the - -coordinates of the center point and vertex can finally be calculated.

Example 3 (continued)

Major axis transformation: Ellipse with major axes NOT parallel to the axes
Step 1
2nd step
3rd step
A normalized eigenvector zu results from the linear system of equations
to .
A normalized eigenvector zu results from the linear system of equations
to .
4th step
and this results in:
Because of this , the transformation is a rotation around the angle . The latter follows from (see rotation matrix ).
5th step
6th step
Since there is neither linear nor linear in the last equation , no quadratic addition is necessary.
Result: The conic section is an ellipse with the center at the zero point and the semi-axes . The vertices are in - coordinates:
7th step
The - coordinates of the vertices are (see 4th step):

Note: The new coordinate system and the matrix S are not clearly defined. Both depend on the order of the eigenvalues ​​and the orientation of the eigenvectors chosen. The position of the conic section (center point, vertex) in the xy coordinate system is clearly determined by the given conic section equation.

Example 4: hyperbola

Major axis transformation of a hyperbola

The conic section has the equation:

Step 1
2nd step
3rd step
4th step
5th step
6th step
The conic section is a hyperbola with the center point and the semiaxes .
7th step

The xy coordinates of the center are (see 4th step).

Example 5: parabola

Major axis transformation of a parabola

The conic section has the equation:

It is

.

The associated eigenvalues ​​are and the transformation matrix is . In - -coordinates, the conic section satisfies the equation:

So the conic section is a parabola with the vertex or in xy coordinates . The matrix describes a rotation around the angle .

Principal axis transformation of surfaces

The principal axis transformation for quadrics in space proceeds according to the same method as in the plane case for conic sections. However, the bills are much more extensive.

Example: hyperboloid

The principal axis transformation is used to determine which area is described by the following equation:

Step 1
2nd step
The eigenvalues ​​are:
3rd step

Determination of the eigenvectors:

to
to
and to
Major axis transformation: single-shell hyperboloid in - - coordinates with 2 vertices and a secondary vertex
4th step
5th step
6th step

The quadric is a single-shell hyperboloid (see list of quadrics ) with the center in the zero point, the semiaxes , the vertices and the minor vertices .

7th step

With the relationships in step 4 you get the vertices or minor vertices in xyz coordinates: (the center is the zero point).

Example: cone

The equation is given

This equation is to be converted into a normal form by means of a major axis transformation and the type of quadric represented by the equation is to be determined.

Step 1
2nd step
The eigenvalues ​​of the matrix are .
3rd step

The eigenspace zu is the solution of the (one!) Equation . Two mutually orthogonal solution vectors have to be determined. One solution is . A solution vector orthogonal to this must also satisfy the equation . Obviously the vector satisfies both equations. Now both vectors have to be normalized:

A normalized eigenvector zu is

4th step
Major axis transformation of a cone (the apex is the point (1,1,1), the center of the represented base circle is the zero point)
5th step
6th step

Square complement supplies

.
The quadric is a vertical circular cone with the tip at the point and the axis as the axis of rotation.
7th step

The point is the point in xyz coordinates . The cone axis has the direction .

Major axis transformation in any dimension

A quadric im is (analogous to n = 2) the solution set of a general quadratic equation (see quadric ):

where a symmetric matrix and and are column vectors .

The main axis transformation in this general case proceeds according to the same scheme as for conic sections (see above). After the diagonalization, however, the zero point is often shifted to the center or vertex of the quadric, so that the normal shape of the quadric arises, from which the type and properties of the quadric can be read.

application

In theoretical physics , the main axis transformation is used in classical mechanics to describe the kinematics of rigid bodies : Here, using a main axis transformation of the inertia tensor , which specifies the inertia of the body with respect to rotations around different axes, any moments of deviation that may be present - for example in the case of a top  - for Disappear.

A moment of deviation is a measure of the tendency of a rigid body to change its axis of rotation. Moments of deviation are combined with the moments of inertia in inertia tensors, with the moments of inertia being on the main diagonal of the tensor and the moments of deviation on the secondary diagonals . As shown above, the symmetric inertia tensor can be made into a diagonal shape. The axes of the new, adjusted coordinate system determined by the main axis transformation are called the main axes of inertia , the new coordinate system is called the main axis system. The diagonal elements of the transformed tensor are consequently called the main moments of inertia .

The main axis transformation is also used in other sub-areas of classical mechanics, for example in strength theory to calculate the main stresses that act on a body. Major axis transformations are still frequently used in relativistic mechanics for the basic representation of spacetime in four-dimensional Minkowski space or, for example, in electrostatics for the quadrupole moment and other higher multipole moments.

In addition, the principal axis transformation in multivariate statistics is part of the principal component analysis , which is also known as the Karhunen-Loève transformation , especially in image processing . Sometimes the terms are used synonymously, but the two transformations are not identical.

In practice, the principal axis transformation is used as part of principal component analysis to reduce the size of large data sets without significant data loss. Existing relationships between individual statistical variables are reduced as much as possible by transferring them to a new, linearly independent, problem-adapted coordinate system. For example, the number of required signal channels can be reduced by arranging them according to variance and removing the channels with the lowest variance from the data set without any relevant loss of data. This can improve the efficiency and result of a later analysis of the data.

In electronic image processing, the reduction of the data set size through principal component analyzes is used, especially in remote sensing through satellite images and the associated scientific disciplines of geodesy , geography , cartography and climatology . Here, the quality of the satellite images can be significantly improved by suppressing the noise using principal component analysis.

In computer science, the main axis transformation is mainly used in pattern recognition , to create artificial neural networks , a sub-area of artificial intelligence , for data reduction (see main component analysis ).

literature

  • Burg & Haf & Wille: Higher Mathematics for Engineers. Volume II (Linear Algebra), Teubner-Verlag, Stuttgart 1992, ISBN 3 519 22956 0 , pp. 214, 335.
  • Meyberg & Vachenauer: Höhere Mathematik 1. Springer-Verlag, 1995, ISBN 3 540 59188 5 , p. 341.
  • W. Nolting: Basic course Theoretical Physics 1: Classical Mechanics. 7th edition. Springer, Berlin 2004, ISBN 3-540-21474-7 .
  • T. Fließbach: Mechanics. 2nd Edition. Spectrum, Heidelberg 1996, ISBN 3-86025-686-6 .
  • W. Greiner: Theoretical Physics. Volume 2. Mechanics Part 2. 5th edition. Harri Deutsch, Thun / Frankfurt am Main, 1989, ISBN 3-8171-1136-3 .

Individual evidence

  1. See pattern recognition script . Cape. 5.2: Karhunen-Loeve transformation. (PDF) Laboratory for Message Processing, University of Hanover.
  2. ^ Society for data analysis and remote sensing Hanover. ( Memento of July 10, 2009 in the Internet Archive ).
  3. a b Probabilistic major axis transformation for generic object recognition. ( Memento of July 18, 2007 in the Internet Archive ) ( Postscript ). Diploma thesis in computer science, Friedrich-Alexander-Universitat Erlangen-Nürnberg.