【线性代数】为什么正交矩阵的转置矩阵与逆矩阵相等？

命题

如果一个 <math xmlns="http://www.w3.org/1998/Math/MathML"> n × n n \times n </math>n×n 的实矩阵 <math xmlns="http://www.w3.org/1998/Math/MathML"> A A </math>A 满足norm preserving （即保持向量模长不变），那么 <math xmlns="http://www.w3.org/1998/Math/MathML"> A A </math>A 是正交矩阵，且 <math xmlns="http://www.w3.org/1998/Math/MathML"> A − 1 = A T A^{-1} = A^T </math>A−1=AT。

已知

一个实矩阵 <math xmlns="http://www.w3.org/1998/Math/MathML"> A ∈ R n × n A \in \mathbb{R}^{n \times n} </math>A∈Rn×n，对任意向量 <math xmlns="http://www.w3.org/1998/Math/MathML"> x ∈ R n x \in \mathbb{R}^n </math>x∈Rn，有：
<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"> ∣ A x ∣ = ∣ x ∣ | A x | = | x | </math>∣Ax∣=∣x∣

其中 <math xmlns="http://www.w3.org/1998/Math/MathML"> ∣ ⋅ ∣ | \cdot | </math>∣⋅∣ 表示欧几里得范数，也就是向量 <math xmlns="http://www.w3.org/1998/Math/MathML"> x x </math>x 的模长（ <math xmlns="http://www.w3.org/1998/Math/MathML"> ∣ x ∣ = x T x |x| = \sqrt{x^T x} </math>∣x∣=xTx ）。

证明过程

将等式两边平方：
<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"> ∣ A x ∣ 2 = ∣ x ∣ 2 |A x|^2 = |x|^2 </math>∣Ax∣2=∣x∣2

利用内积表示范数平方（ <math xmlns="http://www.w3.org/1998/Math/MathML"> ∣ v ∣ 2 = v T v |v|^2 = v^T v </math>∣v∣2=vTv）：
<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"> ( A x ) T ( A x ) = x T x (Ax)^T (Ax) = x^T x </math>(Ax)T(Ax)=xTx

展开左边：
<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"> x T A T A x = x T x x^T A^T A x = x^T x </math>xTATAx=xTx

移项得：
<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"> x T ( A T A − I ) x = 0 , ∀ x ∈ R n x^T (A^T A - I) x = 0, \ \forall x \in \mathbb{R}^n </math>xT(ATA−I)x=0, ∀x∈Rn

然后我们试图证明对称矩阵 <math xmlns="http://www.w3.org/1998/Math/MathML"> M = A T A − I M = A^T A - I </math>M=ATA−I 为零矩阵。

令 <math xmlns="http://www.w3.org/1998/Math/MathML"> M = A T A − I M = A^T A - I </math>M=ATA−I，则 <math xmlns="http://www.w3.org/1998/Math/MathML"> M M </math>M 是实对称矩阵（因为 <math xmlns="http://www.w3.org/1998/Math/MathML"> ( A T A ) T = A T A (A^T A)^T = A^T A </math>(ATA)T=ATA）。

已知对于所有 <math xmlns="http://www.w3.org/1998/Math/MathML"> x ∈ R n x \in \mathbb{R}^n </math>x∈Rn，有：
<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"> x T M x = 0 x^T M x = 0 </math>xTMx=0

我们证明这蕴含 <math xmlns="http://www.w3.org/1998/Math/MathML"> M = 0 M = 0 </math>M=0：

取 <math xmlns="http://www.w3.org/1998/Math/MathML"> x x </math>x 为标准基向量 <math xmlns="http://www.w3.org/1998/Math/MathML"> e i e_i </math>ei：

<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"> e i T M e i = M i , i = 0 e_i^T M e_i = M_{i,i} = 0 </math>eiTMei=Mi,i=0

所以 <math xmlns="http://www.w3.org/1998/Math/MathML"> M M </math>M 的所有对角线元素为 <math xmlns="http://www.w3.org/1998/Math/MathML"> 0 0 </math>0。
取 <math xmlns="http://www.w3.org/1998/Math/MathML"> x = e i + e j x = e_i + e_j </math>x=ei+ej（ <math xmlns="http://www.w3.org/1998/Math/MathML"> i ≠ j i \neq j </math>i=j）：

<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"> ( e i + e j ) T M ( e i + e j ) = M i , i + M j , j + M i , j + M j , i (e_i + e_j)^T M (e_i + e_j) = M_{i,i} + M_{j,j} + M_{i,j} + M_{j,i} </math>(ei+ej)TM(ei+ej)=Mi,i+Mj,j+Mi,j+Mj,i

因为 <math xmlns="http://www.w3.org/1998/Math/MathML"> M M </math>M 对称， <math xmlns="http://www.w3.org/1998/Math/MathML"> M i , j = M j , i M_{i,j} = M_{j,i} </math>Mi,j=Mj,i，且 <math xmlns="http://www.w3.org/1998/Math/MathML"> M i , i = M j , j = 0 M_{i,i} = M_{j,j} = 0 </math>Mi,i=Mj,j=0，所以：
<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"> 0 + 0 + 2 M i j = 0 ⇒ M i j = 0 0 + 0 + 2M_{ij} = 0 \quad \Rightarrow \quad M_{ij} = 0 </math>0+0+2Mij=0⇒Mij=0

因此 <math xmlns="http://www.w3.org/1998/Math/MathML"> M M </math>M 的所有元素为 <math xmlns="http://www.w3.org/1998/Math/MathML"> 0 0 </math>0，即 <math xmlns="http://www.w3.org/1998/Math/MathML"> M = 0 M = 0 </math>M=0。

由 <math xmlns="http://www.w3.org/1998/Math/MathML"> M = 0 M = 0 </math>M=0 得：
<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"> A T A − I = 0 ⇒ A T A = I A^T A - I = 0 \quad \Rightarrow \quad A^T A = I </math>ATA−I=0⇒ATA=I

这意味着 <math xmlns="http://www.w3.org/1998/Math/MathML"> A A </math>A 是正交矩阵，且 <math xmlns="http://www.w3.org/1998/Math/MathML"> A − 1 = A T A^{-1} = A^T </math>A−1=AT。

结论

我们证明了：若实方阵 <math xmlns="http://www.w3.org/1998/Math/MathML"> A A </math>A 保持所有向量的欧几里得范数不变，则 <math xmlns="http://www.w3.org/1998/Math/MathML"> A A </math>A 必为正交矩阵，其逆等于其转置。