The principal axis transformation (HAT) is a process in Euclidean geometry with which the equations of quadrics ( ellipse , hyperbola , ...; ellipsoid , hyperboloid , ...) are brought to the respective normal form by means of a suitable coordinate transformation and thus their type and geometric properties (Center, vertex, semi-axes) can determine. Orthogonal coordinate transformations (rotations, reflections, shifts) must be used (see below) so that lengths and angles are not changed during the transformation . The essential aid of this procedure is the diagonalization of a symmetrical matrix with the help of an orthogonal matrix.
Major axis transformation of an ellipse with the help of a rotation of the coordinate system
In addition to the purely mathematical-geometric meaning of the main axis transformation for determining the type of quadrics, it is used in numerous disciplines of theoretical physics as well as in computer science and the geosciences (see section Application ).
Simple examples and motivation of the procedure
example 1
That the equation
Describing the circle with its center and radius can be recognized by adding the square to the form .
Ellipse with main axes parallel to the axes
Example 2
The equation too
can be brought to the shape by adding a square
and you can see that it is an ellipse with a center and the semi-axes .
Example 3
It is much more difficult to find the equation
to see that it is an ellipse with the semiaxes . The problem arises from the "mixed" term . It is a sign that the mutually perpendicular main axes in this case are not parallel to the coordinate axes. This can be changed by using a suitable rotation of the coordinate system (around the zero point). The angle of rotation results from the coefficients at (see conic sections ). However, this descriptive procedure becomes very confusing when examining quadrics in Euclidean space. Linear algebra provides a method that can be used in any dimension: the diagonalization of symmetric matrices. To do this, write the equation of the quadric with the help of a symmetrical matrix: the coefficients of are on the diagonal of the matrix, and half of the coefficients of are on the secondary diagonal .
Now the matrix is diagonalized by applying an orthogonal coordinate transformation (rotation or rotation mirroring im ). In the case of an orthogonal coordinate transformation, lengths are not changed, so that after the transformation and any necessary quadratic addition (see above), the lengths of the semi-axes and positions of the center and vertex can be read off.
Diagonalization of a symmetric matrix (principal axis theorem)
A symmetric matrix always has an orthogonal matrix , so
is a diagonal matrix. The main diagonal of the diagonal matrix consists of the eigenvalues of the matrix . A symmetric matrix always has real eigenvalues taking into account the respective multiplicity. For the matrix , orthonormal eigenvectors of the matrix are chosen as column vectors . (Eigenvectors of a symmetrical matrix with different eigenvalues are always orthogonal. If a eigenspace has a dimension larger than 1, one has to determine an orthonormal basis of the eigenspace with the help of the Gram-Schmidt orthogonalization method.) The determinant of is . So that there is a rotation matrix in the plane case, one must choose the orientations of the eigenvectors used so that is.
If the matrix is interpreted as a linear mapping , the matrix can be understood as a transformation on the new basis . The relationship exists between the old and new coordinates . The effect of the matrix in the new coordinate system is taken over by the diagonal matrix . An important property of the (orthogonal!) Matrix is . So can also be easily coordinate old into new convert: .
The equation of a quadric
has a square part that can be described by a ( oBdA symmetrical) matrix . With the main axis theorem, this quadratic part is transformed into the "diagonal shape " . After that, there are no more mixed terms with .
Major axis transformation of a conic section
Description of the method
A conic section suffices for one equation
-
.
This equation can be written in matrix form as follows:
- Step 1
- Set .
- 2nd step
- Find the eigenvalues of the matrix as solutions to the eigenvalue equation
-
. The eigenvalues are .
- 3rd step
- Determine normalized eigenvectors from the 2 systems of equations:
- 4th step
- Set and replace by using .
- 5th step
- The result is the equation of the conic section in the new coordinates:
- Since the quadratic part in this equation is fixed by the eigenvalues as coefficients and the disappearance of the mixed part, the old coordinates x and y only need to be replaced in the linear part .
- 6th step
- By adding a square, you get the center or apex shape of the conic section and can read off the center (for ellipse, hyperbola, ...) or apex (for parabola) and possibly semiaxes.
- 7th step
- With the help of the relationship , the - -coordinates of the center point and vertex can finally be calculated.
Example 3 (continued)
Major axis transformation: Ellipse with major axes NOT parallel to the axes
- Step 1
- 2nd step
- 3rd step
- A normalized eigenvector zu results from the linear system of equations
-
to .
- A normalized eigenvector zu results from the linear system of equations
-
to .
- 4th step
-
and this results in:
- Because of this , the transformation is a rotation around the angle . The latter follows from (see rotation matrix ).
- 5th step
- 6th step
- Since there is neither linear nor linear in the last equation , no quadratic addition is necessary.
- Result: The conic section is an ellipse with the center at the zero point and the semi-axes . The vertices are in - coordinates:
- 7th step
- The - coordinates of the vertices are (see 4th step):
Note: The new coordinate system and the matrix S are not clearly defined. Both depend on the order of the eigenvalues and the orientation of the eigenvectors chosen. The position of the conic section (center point, vertex) in the xy coordinate system is clearly determined by the given conic section equation.
Example 4: hyperbola
Major axis transformation of a hyperbola
The conic section has the equation:
- Step 1
- 2nd step
- 3rd step
- 4th step
- 5th step
- 6th step
- The conic section is a hyperbola with the center point and the semiaxes .
- 7th step
The xy coordinates of the center are (see 4th step).
Example 5: parabola
Major axis transformation of a parabola
The conic section has the equation:
It is
-
.
The associated eigenvalues are and the transformation matrix is . In - -coordinates, the conic section satisfies the equation:
So the conic section is a parabola with the vertex or in xy coordinates . The matrix describes a rotation around the angle .
Principal axis transformation of surfaces
The principal axis transformation for quadrics in space proceeds according to the same method as in the plane case for conic sections. However, the bills are much more extensive.
Example: hyperboloid
The principal axis transformation is used to determine which area is described by the following equation:
- Step 1
- 2nd step
- The eigenvalues are:
- 3rd step
Determination of the eigenvectors:
- to
- to
- and to
- 4th step
- 5th step
- 6th step
The quadric is a single-shell hyperboloid (see list of quadrics ) with the center in the zero point, the semiaxes , the vertices and the minor vertices .
- 7th step
With the relationships in step 4 you get the vertices or minor vertices in xyz coordinates: (the center is the zero point).
Example: cone
The equation is given
This equation is to be converted into a normal form by means of a major axis transformation and the type of quadric represented by the equation is to be determined.
- Step 1
- 2nd step
- The eigenvalues of the matrix are .
- 3rd step
The eigenspace zu is the solution of the (one!) Equation . Two mutually orthogonal solution vectors have to be determined. One solution is . A solution vector orthogonal to this must also satisfy the equation . Obviously the vector satisfies both equations. Now both vectors have to be normalized:
A normalized eigenvector zu is
- 4th step
Major axis transformation of a cone (the apex is the point (1,1,1), the center of the represented base circle is the zero point)
- 5th step
- 6th step
Square complement supplies
-
.
- The quadric is a vertical circular cone with the tip at the point and the axis as the axis of rotation.
- 7th step
The point is the point in xyz coordinates . The cone axis has the direction .
Major axis transformation in any dimension
A quadric im is (analogous to n = 2) the solution set of a general quadratic equation (see quadric ):
where a symmetric matrix and and are column vectors .
The main axis transformation in this general case proceeds according to the same scheme as for conic sections (see above). After the diagonalization, however, the zero point is often shifted to the center or vertex of the quadric, so that the normal shape of the quadric arises, from which the type and properties of the quadric can be read.
application
In theoretical physics , the main axis transformation is used in classical mechanics to describe the kinematics of rigid bodies : Here, using a main axis transformation of the inertia tensor , which specifies the inertia of the body with respect to rotations around different axes, any moments of deviation that may be present - for example in the case of a top - for Disappear.
A moment of deviation is a measure of the tendency of a rigid body to change its axis of rotation. Moments of deviation are combined with the moments of inertia in inertia tensors, with the moments of inertia being on the main diagonal of the tensor and the moments of deviation on the secondary diagonals . As shown above, the symmetric inertia tensor can be made into a diagonal shape. The axes of the new, adjusted coordinate system determined by the main axis transformation are called the main axes of inertia , the new coordinate system is called the main axis system. The diagonal elements of the transformed tensor are consequently called the main moments of inertia .
The main axis transformation is also used in other sub-areas of classical mechanics, for example in strength theory to calculate the main stresses that act on a body. Major axis transformations are still frequently used in relativistic mechanics for the basic representation of spacetime in four-dimensional Minkowski space or, for example, in electrostatics for the quadrupole moment and other higher multipole moments.
In addition, the principal axis transformation in multivariate statistics is part of the principal component analysis , which is also known as the Karhunen-Loève transformation , especially in image processing . Sometimes the terms are used synonymously, but the two transformations are not identical.
In practice, the principal axis transformation is used as part of principal component analysis to reduce the size of large data sets without significant data loss. Existing relationships between individual statistical variables are reduced as much as possible by transferring them to a new, linearly independent, problem-adapted coordinate system. For example, the number of required signal channels can be reduced by arranging them according to variance and removing the channels with the lowest variance from the data set without any relevant loss of data. This can improve the efficiency and result of a later analysis of the data.
In electronic image processing, the reduction of the data set size through principal component analyzes is used, especially in remote sensing through satellite images and the associated scientific disciplines of geodesy , geography , cartography and climatology . Here, the quality of the satellite images can be significantly improved by suppressing the noise using principal component analysis.
In computer science, the main axis transformation is mainly used in pattern recognition , to create artificial neural networks , a sub-area of artificial intelligence , for data reduction (see main component analysis ).
literature
- Burg & Haf & Wille: Higher Mathematics for Engineers. Volume II (Linear Algebra), Teubner-Verlag, Stuttgart 1992, ISBN 3 519 22956 0 , pp. 214, 335.
- Meyberg & Vachenauer: Höhere Mathematik 1. Springer-Verlag, 1995, ISBN 3 540 59188 5 , p. 341.
- W. Nolting: Basic course Theoretical Physics 1: Classical Mechanics. 7th edition. Springer, Berlin 2004, ISBN 3-540-21474-7 .
- T. Fließbach: Mechanics. 2nd Edition. Spectrum, Heidelberg 1996, ISBN 3-86025-686-6 .
- W. Greiner: Theoretical Physics. Volume 2. Mechanics Part 2. 5th edition. Harri Deutsch, Thun / Frankfurt am Main, 1989, ISBN 3-8171-1136-3 .
Individual evidence
-
↑ See pattern recognition script . Cape. 5.2: Karhunen-Loeve transformation. (PDF) Laboratory for Message Processing, University of Hanover.
-
^ Society for data analysis and remote sensing Hanover. ( Memento of July 10, 2009 in the Internet Archive ).
-
↑ a b Probabilistic major axis transformation for generic object recognition. ( Memento of July 18, 2007 in the Internet Archive ) ( Postscript ). Diploma thesis in computer science, Friedrich-Alexander-Universitat Erlangen-Nürnberg.