{{Short description|Real square matrix whose columns and rows are orthogonal unit vectors}} {{for|matrices with orthogonality over the complex number field|unitary matrix}} {{More footnotes needed|date=May 2023}} In linear algebra, an '''orthogonal matrix''' or '''orthonormal matrix {{mvar|Q}}''', is a real-valued square matrix whose columns and rows are orthonormal vectors.
One way to express this is <math display="block">Q^\mathrm{T} Q = Q Q^\mathrm{T} = I,</math> where {{math|''Q''<sup>T</sup>}} is the transpose of {{mvar|Q}} and {{mvar|I}} is the identity matrix.
This leads to the equivalent characterization: a matrix {{mvar|Q}} is orthogonal if its transpose is equal to its inverse: <math display="block">Q^\mathrm{T}=Q^{-1},</math> where {{math|''Q''<sup>−1</sup>}} is the inverse of {{mvar|Q}}.
An orthogonal matrix {{mvar|Q}} is necessarily invertible (with inverse {{math|1=''Q''<sup>−1</sup> = ''Q''<sup>T</sup>}}), unitary ({{math|1=''Q''<sup>−1</sup> = ''Q''<sup>∗</sup>}}), where {{math|1=''Q''<sup>∗</sup>}} is the Hermitian adjoint (conjugate transpose) of {{mvar|Q}}, and therefore normal ({{math|1=''Q''<sup>∗</sup>''Q'' = ''QQ''<sup>∗</sup>}}) over the real numbers. The determinant of any orthogonal matrix is either +1 or −1. As a linear transformation, an orthogonal matrix preserves the inner product of vectors, and therefore acts as an isometry of Euclidean space, such as a rotation, reflection or rotoreflection. In other words, it is a unitary transformation.
The set of {{math|''n'' × ''n''}} orthogonal matrices, under multiplication, forms the group {{math|O(''n'')}}, known as the orthogonal group. The subgroup {{math|SO(''n'')}} consisting of orthogonal matrices with determinant +1 is called the special orthogonal group, and each of its elements is a special orthogonal matrix. As a linear transformation, every special orthogonal matrix acts as a rotation.
==Overview== thumb|275px|Visual understanding of multiplication by the transpose of a matrix. If A is an orthogonal matrix and B is its transpose, the ij-th element of the product AA<sup>T</sup> will vanish if i≠j, because the i-th row of A is orthogonal to the j-th row of A.
An orthogonal matrix is the real specialization of a unitary matrix, and thus always a normal matrix. Although we consider only real matrices here, the definition can be used for matrices with entries from any field. However, orthogonal matrices arise naturally from dot products, and for matrices of complex numbers that leads instead to the unitary requirement. Orthogonal matrices preserve the dot product,<ref>{{Harvp|Dawkins|2005|loc=p. [https://www.cs.cornell.edu/courses/cs485/2006sp/LinAlg_Complete.pdf#page=298 298], theorem [https://web.archive.org/web/20121218233605/https://tutorial.math.lamar.edu/Classes/LinAlg/OrthogonalMatrix.aspx#VS_OrthogMatrix_Thm3 3], (a) and (c)}}</ref> so, for vectors {{math|'''u'''}} and {{math|'''v'''}} in an {{mvar|n}}-dimensional real Euclidean space <math display="block">{\mathbf u} \cdot {\mathbf v} = \left(Q {\mathbf u}\right) \cdot \left(Q {\mathbf v}\right) </math> where {{mvar|Q}} is an orthogonal matrix. To see the inner product connection, consider a vector {{math|'''v'''}} in an {{mvar|n}}-dimensional real Euclidean space. Written with respect to an orthonormal basis, the squared length of {{math|'''v'''}} is {{math|'''v'''<sup>T</sup>'''v'''}}. If a linear transformation, in matrix form {{math|''Q'''''v'''}}, preserves vector lengths, then <math display="block">{\mathbf v}^\mathrm{T}{\mathbf v} = (Q{\mathbf v})^\mathrm{T}(Q{\mathbf v}) = {\mathbf v}^\mathrm{T} Q^\mathrm{T} Q {\mathbf v} .</math>
Thus finite-dimensional linear isometries—rotations, reflections, and their combinations—produce orthogonal matrices. The converse is also true: orthogonal matrices imply orthogonal transformations. However, linear algebra includes orthogonal transformations between spaces which may be neither finite-dimensional nor of the same dimension, and these have no orthogonal matrix equivalent.
Orthogonal matrices are important for a number of reasons, both theoretical and practical. The {{math|''n'' × ''n''}} orthogonal matrices form a group under matrix multiplication, the orthogonal group denoted by {{math|O(''n'')}}, which—with its subgroups—is widely used in mathematics and the physical sciences. For example, the point group of a molecule is a subgroup of O(3). Because floating point versions of orthogonal matrices have advantageous properties, they are key to many algorithms in numerical linear algebra, such as {{mvar|QR}} decomposition. As another example, with appropriate normalization the discrete cosine transform (used in MP3 compression) is represented by an orthogonal matrix.
==Examples== Below are a few examples of small orthogonal matrices and possible interpretations. *<math> \begin{bmatrix} 1 & 0 \\ 0 & 1 \\ \end{bmatrix}</math>    (identity transformation) *<math> \begin{bmatrix} \cos \theta & -\sin \theta \\ \sin \theta & \cos \theta \\ \end{bmatrix}</math>    (rotation about the origin) *<math> \begin{bmatrix} 1 & 0 \\ 0 & -1 \\ \end{bmatrix}</math>    (reflection across ''x''-axis) *<math> \begin{bmatrix} 0 & 0 & 0 & 1 \\ 0 & 0 & 1 & 0 \\ 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \end{bmatrix}</math>    (permutation of coordinate axes)
==Elementary constructions==
===Lower dimensions=== The simplest orthogonal matrices are the {{nowrap|1 × 1}} matrices [1] and [−1], which we can interpret as the identity and a reflection of the real line across the origin.
The {{nowrap|2 × 2}} matrices have the form <math display="block">\begin{bmatrix} p & t\\ q & u \end{bmatrix},</math> which orthogonality demands satisfy the three equations <math display="block">\begin{align} 1 & = p^2+t^2, \\ 1 & = q^2+u^2, \\ 0 & = pq+tu. \end{align}</math>
In consideration of the first equation, without loss of generality let {{math|1=''p'' = cos ''θ''}}, {{math|1=''q'' = sin ''θ''}}; then either {{math|1=''t'' = −''q''}}, {{math|1=''u'' = ''p''}} or {{math|1=''t'' = ''q''}}, {{math|1=''u'' = −''p''}}. We can interpret the first case as a rotation by {{mvar|θ}} (where {{math|1=''θ'' = 0}} is the identity), and the second as a reflection across a line at an angle of {{math|{{sfrac|''θ''|2}}}}.
<math display="block"> \begin{bmatrix} \cos \theta & -\sin \theta \\ \sin \theta & \cos \theta \\ \end{bmatrix}\text{ (rotation), }\qquad \begin{bmatrix} \cos \theta & \sin \theta \\ \sin \theta & -\cos \theta \\ \end{bmatrix}\text{ (reflection)} </math>
The special case of the reflection matrix with {{math|1=''θ'' = 90°}} generates a reflection about the line at 45° given by {{math|1=''y'' = ''x''}} and therefore exchanges {{mvar|x}} and {{mvar|y}}; it is a permutation matrix, with a single 1 in each column and row (and otherwise 0): <math display="block">\begin{bmatrix} 0 & 1\\ 1 & 0 \end{bmatrix}.</math>
The identity is also a permutation matrix.
A reflection is its own inverse, which implies that a reflection matrix is symmetric (equal to its transpose) as well as orthogonal. The product of two rotation matrices is a rotation matrix, and the product of two reflection matrices is also a rotation matrix.
===Higher dimensions=== Regardless of the dimension, it is always possible to classify orthogonal matrices as purely rotational or not, but for {{nowrap|3 × 3}} matrices and larger the non-rotational matrices can be more complicated than reflections. For example, <math display="block"> \begin{bmatrix} -1 & 0 & 0\\ 0 & -1 & 0\\ 0 & 0 & -1 \end{bmatrix}\text{ and } \begin{bmatrix} 0 & -1 & 0\\ 1 & 0 & 0\\ 0 & 0 & -1 \end{bmatrix}</math>
represent an ''inversion'' through the origin and a ''rotoinversion'', respectively, about the {{math|z}}-axis.
Rotations become more complicated in higher dimensions; they can no longer be completely characterized by one angle, and may affect more than one planar subspace. It is common to describe a {{nowrap|3 × 3}} rotation matrix in terms of an axis and angle, but this only works in three dimensions. Above three dimensions two or more angles are needed, each associated with a plane of rotation.
However, we have elementary building blocks for permutations, reflections, and rotations that apply in general.
===Primitives=== The most elementary permutation is a transposition, obtained from the identity matrix by exchanging two rows. Any {{math|''n'' × ''n''}} permutation matrix can be constructed as a product of no more than {{math|''n'' − 1}} transpositions.
A Householder reflection is constructed from a non-null vector {{math|'''v'''}} as <math display="block">Q = I - 2 \frac{{\mathbf v}{\mathbf v}^\mathrm{T}}{{\mathbf v}^\mathrm{T}{\mathbf v}} .</math>
Here the numerator is a symmetric matrix while the denominator is a number, the squared magnitude of {{math|'''v'''}}. This is a reflection in the hyperplane perpendicular to {{math|'''v'''}} (negating any vector component parallel to {{math|'''v'''}}). If {{math|'''v'''}} is a unit vector, then {{math|1=''Q'' = ''I'' − 2'''vv'''<sup>T</sup>}} suffices. A Householder reflection is typically used to simultaneously zero the lower part of a column. Any orthogonal matrix of size {{nowrap|''n'' × ''n''}} can be constructed as a product of at most {{mvar|n}} such reflections.
A Givens rotation acts on a two-dimensional (planar) subspace spanned by two coordinate axes, rotating by a chosen angle. It is typically used to zero a single subdiagonal entry. Any rotation matrix of size {{math|''n'' × ''n''}} can be constructed as a product of at most {{math|{{sfrac|''n''(''n'' − 1)|2}}}} such rotations. In the case of {{nowrap|3 × 3}} matrices, three such rotations suffice; and by fixing the sequence we can thus describe all {{nowrap|3 × 3}} rotation matrices (though not uniquely) in terms of the three angles used, often called Euler angles.
A Jacobi rotation has the same form as a Givens rotation, but is used to zero both off-diagonal entries of a {{nowrap|2 × 2}} symmetric submatrix.
==Properties==
===Matrix properties=== A real square matrix is orthogonal if and only if its columns form an orthonormal basis of the Euclidean space {{math|'''R'''<sup>''n''</sup>}} with the ordinary Euclidean dot product, which is the case if and only if its rows form an orthonormal basis of {{math|'''R'''<sup>''n''</sup>}}. It might be tempting to suppose a matrix with orthogonal (not orthonormal) columns would be called an orthogonal matrix, but such matrices have no special interest and no special name; they only satisfy {{math|1=''M''<sup>T</sup>''M'' = ''D''}}, with {{mvar|D}} a diagonal matrix.
The determinant of any orthogonal matrix is +1 or −1. This follows from basic facts about determinants, as follows: <math display="block">1=\det(I)=\det\left(Q^\mathrm{T}Q\right)=\det\left(Q^\mathrm{T}\right)\det(Q)=\bigl(\det(Q)\bigr)^2 .</math>
The converse is not true; having a determinant of ±1 is no guarantee of orthogonality, even with orthogonal columns, as shown by the following counterexample. <math display="block">\begin{bmatrix} 2 & 0 \\ 0 & \frac{1}{2} \end{bmatrix}</math>
With permutation matrices the determinant matches the signature, being +1 or −1 as the parity of the permutation is even or odd, for the determinant is an alternating function of the rows.
Stronger than the determinant restriction is the fact that an orthogonal matrix can always be diagonalized over the complex numbers to exhibit a full set of eigenvalues, all of which must have (complex) modulus 1.
===Group properties=== The inverse of every orthogonal matrix is again orthogonal, as is the matrix product of two orthogonal matrices. In fact, the set of all {{math|''n'' × ''n''}} orthogonal matrices satisfies all the axioms of a group. It is a compact Lie group of dimension {{math|{{sfrac|''n''(''n'' − 1)|2}}}}, called the orthogonal group and denoted by {{math|O(''n'')}}.
The orthogonal matrices whose determinant is +1 form a path-connected normal subgroup of {{math|O(''n'')}} of index 2, the special orthogonal group {{math|SO(''n'')}} of rotations. The quotient group {{math|O(''n'')/SO(''n'')}} is isomorphic to {{math|O(1)}}, with the projection map choosing [+1] or [−1] according to the determinant. Orthogonal matrices with determinant −1 do not include the identity, and so do not form a subgroup but only a coset; it is also (separately) connected. Thus each orthogonal group falls into two pieces; and because the projection map splits, {{math|O(''n'')}} is a semidirect product of {{math|SO(''n'')}} by {{math|O(1)}}. In practical terms, a comparable statement is that any orthogonal matrix can be produced by taking a rotation matrix and possibly negating one of its columns, as we saw with {{nowrap|2 × 2}} matrices. If {{mvar|n}} is odd, then the semidirect product is in fact a direct product, and any orthogonal matrix can be produced by taking a rotation matrix and possibly negating all of its columns. This follows from the property of determinants that negating a column negates the determinant, and thus negating an odd (but not even) number of columns negates the determinant.
Now consider {{math|(''n'' + 1) × (''n'' + 1)}} orthogonal matrices with bottom right entry equal to 1. The remainder of the last column (and last row) must be zeros, and the product of any two such matrices has the same form. The rest of the matrix is an {{math|''n'' × ''n''}} orthogonal matrix; thus {{math|O(''n'')}} is a subgroup of {{math|O(''n'' + 1)}} (and of all higher groups).
<math display="block">\begin{bmatrix} & & & 0\\ & \mathrm{O}(n) & & \vdots\\ & & & 0\\ 0 & \cdots & 0 & 1 \end{bmatrix}</math>
Since an elementary reflection in the form of a Householder matrix can reduce any orthogonal matrix to this constrained form, a series of such reflections can bring any orthogonal matrix to the identity; thus an orthogonal group is a reflection group. The last column can be fixed to any unit vector, and each choice gives a different copy of {{math|O(''n'')}} in {{math|O(''n'' + 1)}}; in this way {{math|O(''n'' + 1)}} is a bundle over the unit sphere {{math|''S''<sup>''n''</sup>}} with fiber {{math|O(''n'')}}.
Similarly, {{math|SO(''n'')}} is a subgroup of {{math|SO(''n'' + 1)}}; and any special orthogonal matrix can be generated by Givens plane rotations using an analogous procedure. The bundle structure persists: <math>\mathrm{SO}(n) \hookrightarrow \mathrm{SO}(n+1) \to S^n</math>. A single rotation can produce a zero in the first row of the last column, and series of {{math|''n'' − 1}} rotations will zero all but the last row of the last column of an {{math|''n'' × ''n''}} rotation matrix. Since the planes are fixed, each rotation has only one degree of freedom, its angle. By induction, {{math|SO(''n'')}} therefore has <math display="block">(n-1) + (n-2) + \cdots + 1 = \frac{n(n-1)}{2}</math> degrees of freedom, and so does {{math|O(''n'')}}.
Permutation matrices are simpler still; they form, not a Lie group, but only a finite group, the order {{math|''n''!}} symmetric group {{math|S<sub>''n''</sub>}}. By the same kind of argument, {{math|S<sub>''n''</sub>}} is a subgroup of {{math|S<sub>''n'' + 1</sub>}}. The even permutations produce the subgroup of permutation matrices of determinant +1, the order {{math|{{sfrac|''n''!|2}}}} alternating group.
===Canonical form=== More broadly, the effect of any orthogonal matrix separates into independent actions on orthogonal two-dimensional subspaces. That is, if {{mvar|Q}} is special orthogonal then one can always find an orthogonal matrix {{mvar|P}}, a (rotational) change of basis, that brings {{mvar|Q}} into block diagonal form:
<math display="block">P^\mathrm{T}QP = \begin{bmatrix} R_1 & & \\ & \ddots & \\ & & R_k \end{bmatrix}\ (n\text{ even}), \ P^\mathrm{T}QP = \begin{bmatrix} R_1 & & & \\ & \ddots & & \\ & & R_k & \\ & & & 1 \end{bmatrix}\ (n\text{ odd}).</math>
where the matrices {{math|''R''<sub>1</sub>, ..., ''R''<sub>''k''</sub>}} are {{nowrap|2 × 2}} rotation matrices, and with the remaining entries zero. Exceptionally, a rotation block may be diagonal, {{math|±''I''}}. Thus, negating one column if necessary, and noting that a {{nowrap|2 × 2}} reflection diagonalizes to a +1 and −1, any orthogonal matrix can be brought to the form <ref>{{Harvp|Gantmacher|1959|loc=p. [https://archive.org/details/gantmacher-the-theory-of-matrices-vol-1-1959/page/285 285], eq. 113}} / {{Harvp|Гантмахер|n.d.|loc=p. [https://lib.brsu.by/sites/default/files/books/%D0%93%D0%B0%D0%BD%D1%82%D0%BC%D0%B0%D1%85%D0%B5%D1%80%20%D0%A4.%D0%A0.%20-%20%D0%A2%D0%B5%D0%BE%D1%80%D0%B8%D1%8F%20%D0%BC%D0%B0%D1%82%D1%80%D0%B8%D1%86.pdf#page=263 258], eq. 123}}</ref><ref>{{Harvp|Lang|1987|loc=§ 6 "UNITARY OPERATORS", p. 230}}</ref> <math display="block">P^\mathrm{T}QP = \begin{bmatrix} \begin{matrix}R_1 & & \\ & \ddots & \\ & & R_k\end{matrix} & 0 \\ 0 & \begin{matrix}\pm 1 & & \\ & \ddots & \\ & & \pm 1\end{matrix} \\ \end{bmatrix},</math>
The matrices {{math|''R''<sub>1</sub>, ..., ''R''<sub>''k''</sub>}} give conjugate pairs of eigenvalues lying on the unit circle in the complex plane; so this decomposition confirms that all eigenvalues have absolute value 1. If {{mvar|n}} is odd, there is at least one real eigenvalue, +1 or −1; for a {{nowrap|3 × 3}} rotation, the eigenvector associated with +1 is the rotation axis.
===Lie algebra=== Suppose the entries of {{mvar|Q}} are differentiable functions of {{mvar|t}}, and that {{math|1=''t'' = 0}} gives {{math|1=''Q'' = ''I''}}. Differentiating the orthogonality condition <math display="block">Q^\mathrm{T} Q = I </math> yields <math display="block">\dot{Q}^\mathrm{T} Q + Q^\mathrm{T} \dot{Q} = 0</math>
Evaluation at {{math|1=''t'' = 0}} ({{math|1=''Q'' = ''I''}}) then implies <math display="block">\dot{Q}^\mathrm{T} = -\dot{Q} .</math>
In Lie group terms, this means that the Lie algebra of an orthogonal matrix group consists of skew-symmetric matrices. Going the other direction, the matrix exponential of any skew-symmetric matrix is an orthogonal matrix (in fact, special orthogonal).
For example, the three-dimensional object physics calls angular velocity is a differential rotation, thus a vector in the Lie algebra <math>\mathfrak{so}(3)</math> tangent to {{math|SO(3)}}. Given {{math|1='''ω''' = (''xθ'', ''yθ'', ''zθ'')}}, with {{math|1='''v''' = (''x'', ''y'', ''z'')}} being a unit vector, the correct skew-symmetric matrix form of {{mvar|'''ω'''}} is <math display="block"> \Omega = \begin{bmatrix} 0 & -z\theta & y\theta \\ z\theta & 0 & -x\theta \\ -y\theta & x\theta & 0 \end{bmatrix} .</math>
The exponential of this is the orthogonal matrix for rotation around axis {{math|'''v'''}} by angle {{mvar|θ}}; setting {{math|1=''c'' = cos {{sfrac|''θ''|2}}}}, {{math|1=''s'' = sin {{sfrac|''θ''|2}}}}, <math display="block">\exp(\Omega) = \begin{bmatrix} 1 - 2s^2 + 2x^2 s^2 & 2xy s^2 - 2z sc & 2xz s^2 + 2y sc\\ 2xy s^2 + 2z sc & 1 - 2s^2 + 2y^2 s^2 & 2yz s^2 - 2x sc\\ 2xz s^2 - 2y sc & 2yz s^2 + 2x sc & 1 - 2s^2 + 2z^2 s^2 \end{bmatrix}.</math>
==Numerical linear algebra==
===Benefits=== Numerical analysis takes advantage of many of the properties of orthogonal matrices for numerical linear algebra, and they arise naturally. For example, it is often desirable to compute an orthonormal basis for a space, or an orthogonal change of bases; both take the form of orthogonal matrices. Having determinant ±1 and all eigenvalues of magnitude 1 is of great benefit for numerical stability. One implication is that the condition number is 1 (which is the minimum), so errors are not magnified when multiplying with an orthogonal matrix. Many algorithms use orthogonal matrices like Householder reflections and Givens rotations for this reason. It is also helpful that, not only is an orthogonal matrix invertible, but its inverse is available essentially free, by exchanging indices.
Permutations are essential to the success of many algorithms, including the workhorse Gaussian elimination with partial pivoting (where permutations do the pivoting). However, they rarely appear explicitly as matrices; their special form allows more efficient representation, such as a list of {{mvar|n}} indices.
Likewise, algorithms using Householder and Givens matrices typically use specialized methods of multiplication and storage. For example, a Givens rotation affects only two rows of a matrix it multiplies, changing a full multiplication of order {{math|''n''<sup>3</sup>}} to a much more efficient order {{mvar|n}}. When uses of these reflections and rotations introduce zeros in a matrix, the space vacated is enough to store sufficient data to reproduce the transform, and to do so robustly. (Following G. W. Stewart of UMD,<ref>{{Cite journal |last=Stewart |first=G. W. |date=June 1976 |orig-date=received February 3, 1975 |title=The Economical Storage of Plane Rotations |url=https://link.springer.com/article/10.1007/BF01462266 |journal=Numerische Mathematik |language=en |volume=25 |issue=2 |pages=137–138 |doi=10.1007/BF01462266 |issn=0029-599X |eissn=0945-3245 |s2cid=120372682|url-access=subscription }}</ref> we do ''not'' store a rotation angle, which is both expensive and badly behaved.)
===Decompositions=== A number of important matrix decompositions involve orthogonal matrices, including especially:
;{{mvar|QR}} decomposition : {{math|1=''M'' = ''QR''}}, {{mvar|Q}} orthogonal, {{mvar|R}} upper triangular<ref>{{Harvp|Golub|Van Loan|1996|pp=[https://www.google.com/books/edition/Matrix_Computations/mlOa7wPX6OYC?gbpv=1&pg=PA206&dq=QR%20orthogonal 206]–208, [https://www.google.com/books/edition/Matrix_Computations/mlOa7wPX6OYC?gbpv=1&pg=PA223&dq=QR%20orthogonal 223]–236, ...|p=}}</ref> ;Singular value decomposition : {{math|1=''M'' = ''U''Σ''V''<sup>T</sup>}}, {{mvar|U}} and {{mvar|V}} orthogonal, {{math|Σ}} diagonal matrix<ref>{{Harvp|Golub|Van Loan|1996|pp=[https://www.google.com/books/edition/Matrix_Computations/mlOa7wPX6OYC?gbpv=1&pg=PA69&dq=SVD%20orthogonal 69]–75|p=}}</ref> ;Eigendecomposition of a symmetric matrix (decomposition according to the spectral theorem) : {{math|1=''S'' = ''Q''Λ''Q''<sup>T</sup>}}, {{mvar|S}} symmetric, {{mvar|Q}} orthogonal, {{math|Λ}} diagonal ;Polar decomposition : {{math|1=''M'' = ''QS''}}, {{mvar|Q}} orthogonal{{Citation needed|date=April 2026|reason=Golub & Van Loan (1996) says "has orthonormal columns"}}, {{mvar|S}} symmetric positive-semidefinite<ref>{{Harvp|Golub|Van Loan|1996|pp=|p=[https://www.google.com/books/edition/Matrix_Computations/mlOa7wPX6OYC?gbpv=1&pg=PA149&dq=polar 149]}}</ref>
====Examples==== Consider an overdetermined system of linear equations, as might occur with repeated measurements of a physical phenomenon to compensate for experimental errors. Write {{math|1=''A'''''x''' = '''b'''}}, where {{mvar|A}} is {{math|''m'' × ''n''}}, {{math|''m'' > ''n''}}. A {{mvar|QR}} decomposition reduces {{mvar|A}} to upper triangular {{mvar|R}}. For example, if {{mvar|A}} is {{nowrap|5 × 3}} then {{mvar|R}} has the form <math display="block">R = \begin{bmatrix} \cdot & \cdot & \cdot \\ 0 & \cdot & \cdot \\ 0 & 0 & \cdot \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{bmatrix}.</math>
The linear least squares problem is to find the {{math|'''x'''}} that minimizes {{math|{{norm|''A'''''x''' − '''b'''}}}}, which is equivalent to projecting {{math|'''b'''}} to the subspace spanned by the columns of {{mvar|A}}. Assuming the columns of {{mvar|A}} (and hence {{mvar|R}}) are independent, the projection solution is found from {{math|1=''A''<sup>T</sup>''A'''''x''' = ''A''<sup>T</sup>'''b'''}}. Now {{math|''A''<sup>T</sup>''A''}} is square ({{math|''n'' × ''n''}}) and invertible, and also equal to {{math|''R''<sup>T</sup>''R''}}. But the lower rows of zeros in {{mvar|R}} are superfluous in the product, which is thus already in lower-triangular upper-triangular factored form, as in Gaussian elimination (Cholesky decomposition). Here orthogonality is important not only for reducing {{math|1=''A''<sup>T</sup>''A'' = (''R''<sup>T</sup>''Q''<sup>T</sup>)''QR''}} to {{math|''R''<sup>T</sup>''R''}}, but also for allowing solution without magnifying numerical problems.
In the case of a linear system which is underdetermined, or an otherwise non-invertible matrix, singular value decomposition (SVD) is equally useful. With {{mvar|A}} factored as {{math|''U''Σ''V''<sup>T</sup>}}, a satisfactory solution uses the Moore-Penrose pseudoinverse, {{math|''V''Σ<sup>+</sup>''U''<sup>T</sup>}}, where {{math|Σ<sup>+</sup>}} merely replaces each non-zero diagonal entry with its reciprocal. Set {{math|'''x'''}} to {{math|''V''Σ<sup>+</sup>''U''<sup>T</sup>'''b'''}}.
The case of a square invertible matrix also holds interest. Suppose, for example, that {{mvar|A}} is a {{nowrap|3 × 3}} rotation matrix which has been computed as the composition of numerous twists and turns. Floating point does not match the mathematical ideal of real numbers, so {{mvar|A}} has gradually lost its true orthogonality. A Gram–Schmidt process could orthogonalize the columns, but it is not the most reliable, nor the most efficient, nor the most invariant method. The polar decomposition factors a matrix into a pair, one of which is the unique ''closest'' orthogonal matrix to the given matrix, or one of the closest if the given matrix is singular. (Closeness can be measured by any matrix norm invariant under an orthogonal change of basis, such as the spectral norm or the Frobenius norm.) For a near-orthogonal matrix, rapid convergence to the orthogonal factor can be achieved by a "Newton's method" approach due to Nicholas Higham of Manchester<ref>{{Cite journal |last=Higham |first=Nicholas J. |date=October 1986 |orig-date=received March 13, 1985 |title=Computing the Polar Decomposition—with Applications |url=http://www.maths.man.ac.uk/~nareports/narep94.pdf <!-- linked from https://web.archive.org/web/20051016153437/http://www.ma.man.ac.uk/~higham/pap-mf.html --> |url-status=dead |journal=SIAM Journal on Scientific and Statistical Computing |language=en |volume=7 |issue=4 |pages=1160–1174 |doi=10.1137/0907079 |issn=0196-5204 |id=MIMS EPrint [https://eprints.maths.manchester.ac.uk/694/1/high86p.pdf 2007.9]. {{ProQuest|920895097}}. |archive-url=https://web.archive.org/web/20070418094944/http://www.maths.manchester.ac.uk/~nareports/narep94.pdf |archive-date=2007-04-18}}</ref><ref>{{Cite journal |last=Higham |first=Nicholas J. |last2=Schreiber |first2=Robert S. |date=July 1990 |title=Fast polar decomposition of an arbitrary matrix |url=http://www.maths.man.ac.uk/~higham/papers/hisc90.pdf |url-status=dead |journal=SIAM Journal on Scientific and Statistical Computing |language=en |volume=11 |issue=4 |pages=648–655 |citeseerx=10.1.1.230.4322 |doi=10.1137/0911038 |issn=0196-5204 |s2cid=14268409 |id={{ProQuest|921763908}}. |archive-url=https://web.archive.org/web/20230521163024/https://www.maths.manchester.ac.uk/~higham/papers/hisc90.pdf |archive-date=2023-05-21}}</ref>, repeatedly averaging the matrix with its inverse transpose. Augustin A. Dubrulle of ORNL has published an accelerated method with a convenient convergence test.<ref>{{Cite journal |last=Dubrulle |first=A. A. |date=1999 |title=An Optimum Iteration for the Matrix Polar Decomposition |url=https://etna.ricam.oeaw.ac.at/volumes/1993-2000/vol8/abstract.php?pages=21-25 <!-- PDF at https://etna.ricam.oeaw.ac.at/vol.8.1999/pp21-25.dir/pp21-25.pdf; old websites include etna.mcs.kent.edu and www.emis.de/journals/ETNA --> |journal=Electronic Transactions on Numerical Analysis |publisher=<!-- Kent State University --> |volume=8 |pages=21–25 |issn=1068-9613 |zbl=0939.65068 |id=EMIS [https://emis.de/ft/7655 7655]. EuDML [https://eudml.org/doc/120380 120380].}}</ref>
For example, consider a non-orthogonal matrix for which the simple averaging algorithm takes seven steps <math display="block">\begin{bmatrix}3 & 1\\7 & 5\end{bmatrix} \rightarrow \begin{bmatrix}1.8125 & 0.0625\\3.4375 & 2.6875\end{bmatrix} \rightarrow \cdots \rightarrow \begin{bmatrix}0.8 & -0.6\\0.6 & 0.8\end{bmatrix}</math> and which acceleration trims to two steps (with {{mvar|γ}} = 0.353553, 0.565685).
<math display="block">\begin{bmatrix}3 & 1\\7 & 5\end{bmatrix} \rightarrow \begin{bmatrix}1.41421 & -1.06066\\1.06066 & 1.41421\end{bmatrix} \rightarrow \begin{bmatrix}0.8 & -0.6\\0.6 & 0.8\end{bmatrix}</math>
Gram-Schmidt yields an inferior solution, shown by a Frobenius distance of 8.28659 instead of the minimum 8.12404.
<math display="block">\begin{bmatrix}3 & 1\\7 & 5\end{bmatrix} \rightarrow \begin{bmatrix}0.393919 & -0.919145\\0.919145 & 0.393919\end{bmatrix}</math>
===Randomization=== Some numerical applications, such as Monte Carlo methods and exploration of high-dimensional data spaces, require generation of uniformly distributed random orthogonal matrices. In this context, "uniform" is defined in terms of Haar measure, which essentially requires that the distribution not change if multiplied by any freely chosen orthogonal matrix. Orthogonalizing matrices with independent uniformly distributed random entries does not result in uniformly distributed orthogonal matrices{{Citation needed|date=June 2009}}, but the {{mvar|QR}} decomposition of independent normally distributed random entries does, as long as the diagonal of {{mvar|R}} contains only positive entries.<ref>{{Cite journal |last=Mezzadri |first=Francesco |date=May 2007 |orig-date=arXiv preprint first submitted 18 Sep 2006, last revised 27 Feb 2007 |title=How to Generate Random Matrices from the Classical Compact Groups |url=https://www.ams.org/journals/notices/200705/fea-mezzadri-web.pdf <!-- linked from https://www.ams.org/cgi-bin/notices/nxgnotices.pl?fm=main¤t=200705 --> |journal=Notices of the American Mathematical Society (AMS) |volume=54 |issue=5 |pages=592–604 |arxiv=math-ph/0609050 |bibcode=2006math.ph...9050M}}</ref> G. W. Stewart of UMD<ref>{{cite journal |last=Stewart |first=G. W. |author-link=G. W. Stewart |date=June 1980 |year= |title=The Efficient Generation of Random Orthogonal Matrices with an Application to Condition Estimators |journal=SIAM Journal on Numerical Analysis |volume=17 |issue=3 |pages=403–409, appendix of 16 microfiches |bibcode=1980SJNA...17..403S |doi=10.1137/0717034 |issn=0036-1429 |eissn=1095-7170}}</ref> replaced this with a more efficient idea that Persi Diaconis of Stanford and Mehrdad Shahshahani of JPL later generalized as the "subgroup algorithm" (in which form it works just as well for permutations and rotations). To generate an {{math|(''n'' + 1) × (''n'' + 1)}} orthogonal matrix, take an {{math|''n'' × ''n''}} one and a uniformly distributed unit vector of dimension {{nowrap|''n'' + 1}}. Construct a Householder reflection from the vector, then apply it to the smaller matrix (embedded in the larger size with a 1 at the bottom right corner).<ref>{{Harvp|Diaconis|Shahshahani|1987|pp=19, 22–23, 32}}</ref>
===Nearest orthogonal matrix===
The problem of finding the orthogonal matrix {{mvar|Q}} nearest a given matrix {{mvar|M}} is related to the Orthogonal Procrustes problem. There are several different ways to get the unique solution, the simplest of which is taking the singular value decomposition of {{mvar|M}} and replacing the singular values with ones. Another method expresses the {{mvar|R}} explicitly but requires the use of a matrix square root:<ref>{{Cite web |last=Horn |first=Berthold K.P. |author-link=Berthold K.P. Horn |date=21 November 2008 |title=Finding the Nearest Orthonormal Matrix |url=https://people.csail.mit.edu/bkph/articles/Nearest_Orthonormal_Matrix.pdf |publisher=Computer Science and Artificial Intelligence Laboratory (CSAIL), Massachusetts Institute of Technology}}</ref> <math display="block">Q = M \left(M^\mathrm{T} M\right)^{-\frac 1 2}</math>
This may be combined with the Babylonian method for extracting the square root of a matrix to give a recurrence which converges to an orthogonal matrix quadratically: <math display="block">Q_{n + 1} = 2 M \left(Q_n^{-1} M + M^\mathrm{T} Q_n\right)^{-1}</math> where {{math|1=''Q''<sub>0</sub> = ''M''}}.
These iterations are stable provided the condition number of {{mvar|M}} is less than three.<ref>{{Cite journal |last=Higham |first=Nicholas J. |date=April 1986 |orig-date=Received October 22, 1984; revised July 30, 1985 |title=Newton's Method for the Matrix Square Root |url=http://www.maths.manchester.ac.uk/~nareports/narep91.pdf |url-status=dead |journal=Mathematics of Computation |language=en |volume=46 |issue=174 |pages=537–549 |doi=10.1090/S0025-5718-1986-0829624-5 |issn=0025-5718 |eissn=1088-6842 |jstor=2007992 |archive-url=https://web.archive.org/web/20110929131330/http://www.maths.manchester.ac.uk/~nareports/narep91.pdf |archive-date=2011-09-29}}</ref>{{Verification needed|date=April 2026|reason=Where does this article give a bound of three? pp. 538, 543 give a bound of nine. Is there a discrepancy between the situation here and the preconditions there that results in an additional calculation being done to determine this number?}}
Using a first-order approximation of the inverse and the same initialization results in the modified iteration:
<math display="block">N_{n} = Q_n^\mathrm{T} Q_n</math> <math display="block">P_{n} = \frac 1 2 Q_n N_{n}</math> <math display="block">Q_{n + 1} = 2 Q_n + P_n N_n - 3 P_n</math>
==Spin and pin== A subtle technical problem afflicts some uses of orthogonal matrices. Not only are the group components with determinant +1 and −1 not connected to each other, even the +1 component, {{math|SO(''n'')}}, is not simply connected (except for SO(1), which is trivial). Thus it is sometimes advantageous, or even necessary, to work with a covering group of SO(''n''), the spin group, {{math|Spin(''n'')}}. Likewise, {{math|O(''n'')}} has covering groups, the pin groups, Pin(''n''). For {{math|''n'' > 2}}, {{math|Spin(''n'')}} is simply connected and thus the universal covering group for {{math|SO(''n'')}}. By far the most famous example of a spin group is {{math|Spin(3)}}, which is nothing but {{math|SU(2)}}, or the group of unit quaternions.
The Pin and Spin groups are found within Clifford algebras, which themselves can be built from orthogonal matrices.
==Rectangular matrices== {{Main|Semi-orthogonal matrix}}
If {{mvar|Q}} is not a square matrix, then the conditions {{math|1=''Q''<sup>T</sup>''Q'' = ''I''}} and {{math|1=''QQ''<sup>T</sup> = ''I''}} are not equivalent. The condition {{math|1=''Q''<sup>T</sup>''Q'' = ''I''}} says that the columns of {{mvar|Q}} are orthonormal. This can only happen if {{mvar|Q}} is an {{math|''m'' × ''n''}} matrix with {{math|''n'' ≤ ''m''}} (due to linear dependence). Similarly, {{math|1=''QQ''<sup>T</sup> = ''I''}} says that the rows of {{mvar|Q}} are orthonormal, which requires {{math|''n'' ≥ ''m''}}.
There is no standard terminology for these matrices. They are variously called "semi-orthogonal matrices", "orthonormal matrices", "orthogonal matrices", and sometimes simply "matrices with orthonormal rows/columns".
For the case {{math|''n'' ≤ ''m''}}, matrices with orthonormal columns may be referred to as orthogonal k-frames and they are elements of the Stiefel manifold.
==See also== * Biorthogonal system
==References== {{Reflist}}
=== Works cited<!-- Using WP:CITET --> ===
* {{Cite book |last=Dawkins |first=Paul |url=https://www.cs.cornell.edu/courses/cs485/2006sp/LinAlg_Complete.pdf |title=Linear Algebra |date=2005 |publisher=Lamar University |series=Paul's Online Math Notes |pages=295–300 |chapter=Orthogonal Matrices |chapter-url=https://tutorial.math.lamar.edu/Classes/LinAlg/OrthogonalMatrix.aspx |archive-url=https://web.archive.org/web/20121218233605/https://tutorial.math.lamar.edu/Classes/LinAlg/OrthogonalMatrix.aspx |archive-date=2012-12-18 |url-status=dead}} * {{cite journal |last1=Diaconis |first1=Persi |author-link1=Persi Diaconis |last2=Shahshahani |first2=Mehrdad |date=January 1987 |year= |title=The subgroup algorithm for generating uniform random variables |url=https://www.cambridge.org/core/journals/probability-in-the-engineering-and-informational-sciences/article/subgroup-algorithm-for-generating-uniform-random-variables/5F15097B60994F362F6926689FA8A465 |journal=Probability in the Engineering and Informational Sciences |volume=1 |issue=1 |pages=15–32 (see pp. 21–28, 31–32) |doi=10.1017/S0269964800000255 |issn=0269-9648 |s2cid=122752374|url-access=subscription }} * {{Cite book |last=Gantmacher |first=F. R. |url=https://bookstore.ams.org/chel-131/ |title=The Theory of Matrices |date=1959 |publisher=Chelsea Publishing Company |isbn=978-1-4704-8159-9 |volume=1 |pages=262–309 |trans-title= |chapter=IX. Linear Operators in a Unitary Space : § 7. Orthonormal Bases, § 13. Linear Operators in a Euclidean Space, § 14. Polar Decomposition of an Operator and the Cayley Formulas in a Euclidean Space, § 15. Commuting Normal Operators; X. Quadratic and Hermitian Forms : § 5. Reduction of a Quadratic Form to Principal Axes |id=Internet Archive [https://archive.org/details/gantmacher-the-theory-of-matrices-vol-1-1959 gantmacher-the-theory-of-matrices-vol-1-1959].}} (Translated from: {{Cite book |last=Гантмахер |first=Ф.Р. |url=https://lib.brsu.by/sites/default/files/books/%D0%93%D0%B0%D0%BD%D1%82%D0%BC%D0%B0%D1%85%D0%B5%D1%80%20%D0%A4.%D0%A0.%20-%20%D0%A2%D0%B5%D0%BE%D1%80%D0%B8%D1%8F%20%D0%BC%D0%B0%D1%82%D1%80%D0%B8%D1%86.pdf |title=Теория Матриц |volume=I: Основы Теории |pages=237–280 |chapter=IX. Линейные операторы в унитарном пространстве : § 7. Ортонормированный базис, § 13. Линейные операторы в евклидовом пространстве, § 14. Полярное разложение оператора и формулы Кэли в евклидовом пространстве, § 15. Коммутирующие нормальные операторы; X. Квадратичные и эрмитовы формы : § 5. Приведение квадратичной формы к главным осям}}) * {{cite book | last1 = Golub | first1 = Gene H. | author-link1 = Gene H. Golub | first2 = Charles F.|last2=Van Loan|author-link2=Charles F. Van Loan | title = Matrix Computations | edition = 3rd | publisher = Johns Hopkins University Press | year = 1996 | location = Baltimore | isbn = 978-0-8018-5414-9 <!-- this (or 0-...14-8) is for paperback; hardcover is 0-8018-5413-X --> |series=Johns Hopkins Series in the Mathematical Sciences|pages=69–80, 206–212, ..., 601–611|id=Google Books [https://www.google.com/books/edition/Matrix_Computations/mlOa7wPX6OYC mlOa7wPX6OYC].|orig-date=first ed. 1983}} * {{Cite book |last=Lang |first=Serge |title=Linear Algebra |date=1987 |work= |publisher=Springer New York |isbn=978-1-4419-3081-1 |edition=3rd |pages=194–230 |language=en |chapter=Chapter VIII: Eigenvectors and Eigenvalues |doi=10.1007/978-1-4757-1949-9_8 |chapter-url=https://link.springer.com/chapter/10.1007/978-1-4757-1949-9_8}}
==Further reading==
{{sister project|project=Wikiversity |text=Wikiversity introduces the '''orthogonal matrix'''.}}
* {{Cite web |date=7 June 2020 <!-- "This page was last edited on 7 June 2020, at 14:54." --> |title=Orthogonal matrix |url=http://encyclopediaofmath.org/index.php?title=Orthogonal_matrix&oldid;=49505 |url-status=dead |archive-url=https://web.archive.org/web/20241211114829/https://encyclopediaofmath.org/index.php?title=Orthogonal_matrix <!-- alternatively, https://web.archive.org/web/20140910124507/http://www.encyclopediaofmath.org/index.php?title=p/o070320 redirects to https://web.archive.org/web/20140913114857/http://www.encyclopediaofmath.org/index.php/Orthogonal_matrix --> |archive-date=2024-12-11 |website=Encyclopedia of Mathematics |language=en}} "This article was adapted from an original article by D.A. Suprunenko (originator), which appeared in Encyclopedia of Mathematics - {{ISBN|1402006098}}. [https://web.archive.org/web/20240716131538/https://encyclopediaofmath.org/index.php?title=Orthogonal_matrix&oldid=17418 See original article]"
== External links ==
* [http://people.revoledu.com/kardi/tutorial/LinearAlgebra/MatrixOrthogonal.html Tutorial and Interactive Program on Orthogonal Matrix by Kardi Teknomo]
{{Matrix classes}}
Category:Matrices (mathematics)