张量分析 | 指标记号 / 张量记号

注:本文为 "指标记号与张量记号" 相关合辑。

英文引文,机翻未校。

如有内容异常,请看原文。


Index Notation and Tensor Notation

指标记号与张量记号

Introduction

引言

The full notation and array notation are very helpful when introducing the operations and rules in tensor analysis. However, tensor notation and index notation are more commonly used in the context of partial differential equations and tensor analysis. The tensor notation just requires the utilization of different symbols for tensors of different orders and the use of appropriate symbols as operators connecting these tensors. The tensor notation thus enables us to write PDEs in a concise way, which is also independent of the adopted coordinate system. But in many cases, the index notation is preferred as it is proven to be much more powerful for occasions such as derivations. In this chapter, we will start from the basic rules of the index notation, then move to the use of the index notation for tensor algebra, and finally reach the calculus in terms of the index notation. At the end of the chapter, two examples will be given to show the algebraic manipulations, i.e., derivations, using the index notation.

在介绍张量分析的运算与规则时,全记号和数组记号十分实用。但在偏微分方程与张量分析的研究中,张量记号和指标记号的应用更为广泛。张量记号仅要求为不同阶数的张量使用不同符号,并采用恰当的符号作为连接这些张量的算符。借助张量记号,我们能够以简洁的形式书写偏微分方程,且该形式与所采用的坐标系无关。不过在诸多情况下,指标记号更受青睐,因为事实证明,它在推导等场景中具有更强的实用性。本章将从指标记号的基本规则入手,接着介绍指标记号在张量代数中的应用,最后阐述指标记号下的微积分运算。本章末尾将给出两个实例,展示如何运用指标记号进行代数运算,即推导过程。

Rules of Index Notation

指标记号的规则

In the index notation, indices are categorized into two groups: free indices and dummy indices. A free index means an "independent dimension" or an order of the tensor whereas a dummy index means summation. The following three basic rules must be met for the index notation:

在指标记号中,指标分为两类:自由指标和哑指标。自由指标代表张量的 "独立维度" 或阶数,而哑指标表示求和运算。指标记号需遵循以下三条基本规则:

  1. The same index (subscript) may not appear more than twice in a product of two (or more) vectors or tensors. Thus

    同一个指标(下标)在两个(或多个)矢量或张量的乘积中出现的次数不得超过两次。因此
    A i k u k , A i k B k j , A i j B j k C n k A_{i k} u_{k}, A_{i k} B_{k j}, A_{i j} B_{j k} C_{n k} Aikuk,AikBkj,AijBjkCnk

    are valid, but

    是有效的,而

    A k k u k , A i k B k k , A i j B i k C i k A_{k k} u_{k}, A_{i k} B_{k k}, A_{i j} B_{i k} C_{i k} Akkuk,AikBkk,AijBikCik

    are meaningless.

    是无意义的。

  2. Free indices on each term of an equation must agree. Thus

    方程中每一项的自由指标必须保持一致。因此
    u i = v i + w i , u i = A i k B k j v j + C i k w k . \begin {aligned} u_i &= v_i + w_i, \\ u_i &= A_{ik} B_{kj} v_j + C_{ik} w_k. \end {aligned} uiui=vi+wi,=AikBkjvj+Cikwk.

    are valid, but

    是有效的,而
    u i = A i j , u j = A i k u k , u i = A i k v k + w j . \begin {aligned} u_i &= A_{ij}, \\ u_j &= A_{ik} u_k, \\ u_i &= A_{ik} v_k + w_j. \end {aligned} uiujui=Aij,=Aikuk,=Aikvk+wj.

    are meaningless.

    是无意义的。

  3. Free and dummy indices may be changed without altering the meaning of an expression under the condition that Rules 1 and 2 are not violated. Thus

    在不违反规则 1 和规则 2 的前提下,自由指标和哑指标可进行替换,且不会改变表达式的含义。因此
    u j = A j k v k ⇔ u i = A i k v k ⇔ u j = A j i v i . u_{j}=A_{j k} v_{k} \Leftrightarrow u_{i}=A_{i k} v_{k} \Leftrightarrow u_{j}=A_{j i} v_{i} . uj=Ajkvk⇔ui=Aikvk⇔uj=Ajivi.

Tensor Algebra in Index Notation

指标记号下的张量代数

First, according to the meaning of free dimension, a vector u and a second-order tensor A should be written as u i u_{i} ui and A i j A_{i j} Aij , respectively. The free indices can be changed to other symbols. The basic operations in tensor algebra can be expressed using the index notation as follows:

首先,根据自由维度的定义,矢量 u u u 和二阶张量 A A A 应分别表示为 u i u_i ui 和 A i j A_{ij} Aij,自由指标可替换为其他符号。张量代数中的基本运算可通过指标记号表示如下:

Addition 加法 u i = v i + w i ⇔ u = v + w u_i = v_i + w_i \Leftrightarrow \mathbf {u} = \mathbf {v} + \mathbf {w} ui=vi+wi⇔u=v+w
Dot product 点积 λ = u ⋅ v ⇔ λ = u i v i \lambda = \mathbf {u} \cdot \mathbf {v} \Leftrightarrow \lambda = u_i v_i λ=u⋅v⇔λ=uivi
Vector product 叉积 / 向量积 u = v × w ⇔ u i = ε i j k v j w k \mathbf {u} = \mathbf {v} \times \mathbf {w} \Leftrightarrow u_i = \varepsilon_{ijk} v_j w_k u=v×w⇔ui=εijkvjwk
Dyadic product 并矢积 A = u v ⇔ A i j = u i v j \mathbf {A} = \mathbf {u}\mathbf {v} \Leftrightarrow A_{ij} = u_i v_j A=uv⇔Aij=uivj
Addition 加法 C = A + B ⇔ C i j = A i j + B i j \mathbf {C} = \mathbf {A} + \mathbf {B} \Leftrightarrow C_{ij} = A_{ij} + B_{ij} C=A+B⇔Cij=Aij+Bij
Transpose 转置 A = B T ⇔ A i j = B j i \mathbf {A} = \mathbf {B}^T \Leftrightarrow A_{ij} = B_{ji} A=BT⇔Aij=Bji
Scalar products 标量积 λ = A : B ⇔ λ = A i j B i j λ = A ⋅ ⋅ B ⇔ λ = A i j B j i \lambda = \mathbf {A} : \mathbf {B} \Leftrightarrow \lambda = A_{ij} B_{ij} \\ \lambda = \mathbf {A} \cdot \cdot \mathbf {B} \Leftrightarrow \lambda = A_{ij} B_{ji} λ=A:B⇔λ=AijBijλ=A⋅⋅B⇔λ=AijBji
Inner product of a tensor and a vector 张量与向量的内积 u = A v ⇔ u i = A i j v j \mathbf {u} = \mathbf {A}\mathbf {v} \Leftrightarrow u_i = A_{ij} v_j u=Av⇔ui=Aijvj
Inner product of two tensors 两个张量的内积 C = A ⋅ B ⇔ C i j = A i k B k j \mathbf {C} = \mathbf {A} \cdot \mathbf {B} \Leftrightarrow C_{ij} = A_{ik} B_{kj} C=A⋅B⇔Cij=AikBkj
Determinant 行列式 λ = det ⁡ A ⇔ λ = 1 6 ε i j k ε l m n A l i A m j A n k = ε i j k A i 1 A j 2 A k 3 ⇔ ε l m n λ = ε i j k A l i A m j A n k = ε i j k A i l A j m A k n \begin {aligned} \lambda = \det\mathbf {A} &\Leftrightarrow \lambda = \tfrac {1}{6}\varepsilon_{ijk}\varepsilon_{lmn} A_{li} A_{mj} A_{nk} = \varepsilon_{ijk} A_{i1} A_{j2} A_{k3} \\ &\Leftrightarrow \varepsilon_{lmn}\lambda = \varepsilon_{ijk} A_{li} A_{mj} A_{nk} = \varepsilon_{ijk} A_{il} A_{jm} A_{kn} \end {aligned} λ=detA⇔λ=61εijkεlmnAliAmjAnk=εijkAi1Aj2Ak3⇔εlmnλ=εijkAliAmjAnk=εijkAilAjmAkn
Inverse 逆(张量) A j i − 1 = 1 2 det ⁡ ( A ) ε i p q ε j k l A p k A q l A_{ji}^{-1} = \dfrac {1}{2\det (\mathbf {A})}\varepsilon_{ipq}\varepsilon_{jkl} A_{pk} A_{ql} Aji−1=2det(A)1εipqεjklApkAql

Calculus Using Index Notation

指标记号下的微积分运算

The derivative with respect to one coordinate in the index notation can be represented by putting the corresponding symbols for the coordinate after a comma in the indices. The gradient of a scalar u thus can be written as

在指标记号中,对某一坐标的偏导数可通过在指标中添加逗号,并在逗号后标注该坐标对应的符号来表示。因此,标量 u u u 的梯度可写为:
∇ u = u , i \nabla u=u_{, i} ∇u=u,i

where i is the free index. Therefore, u , i u_{, i} u,i is still a vector since it has one free index like u i u_{i} ui . Accordingly, the gradient of a vector u is written as

其中 i i i 为自由指标。因此, u , i u_{,i} u,i 与 u i u_i ui 一样拥有一个自由指标,故其仍为矢量。相应地,矢量 u u u 的梯度可表示为:
∇ u = u i , j , \nabla u=u_{i, j}, ∇u=ui,j,
u i , j u_{i, j} ui,j has two free indices; hence, it is a second-order tensor.
u i , j u_{i,j} ui,j 含有两个自由指标,因此它是一个二阶张量。

The divergence of a vector, which can be viewed as the combination of the gradient vector and contraction, requires the contraction of two indices into one via Kronecker delta. Therefore, we will have this operation in the index notation as

矢量的散度可看作是梯度矢量与缩并运算的结合,需通过克罗内克 δ 将两个指标缩并为一个。因此,该运算在指标记号中可表示为:
∇ ⋅ u = u i , i . \nabla \cdot u=u_{i, i} . ∇⋅u=ui,i.

The divergence of a second-order tensor A is

二阶张量 A A A 的散度为:
∇ ⋅ A = A i j , j . \nabla \cdot A=A_{i j, j} . ∇⋅A=Aij,j.

Likewise, the curl of a vector u is

同理,矢量 u u u 的旋度为:
∇ × u = ε i j k u j , k . \nabla × u=\varepsilon_{i j k} u_{j, k} . ∇×u=εijkuj,k.

The Laplacian of a quantity involves both the gradient and the divergence. For a scalar u , we apply the gradient and divergence sequentially, accordingly,

物理量的拉普拉斯算子同时涉及梯度和散度运算。对于标量 u u u,依次进行梯度和散度运算可得:
Δ u = u , i i . \Delta u=u_{, ii} . Δu=u,ii.

For a vector u , we will have

对于矢量 u u u,其拉普拉斯算子为:
Δ u = u i , j j . \Delta u=u_{i, j j} . Δu=ui,jj.

The derivative ∂ x i / ∂ x j \partial x_{i} / \partial x_{j} ∂xi/∂xj can be deduced by noting that ∂ x i / ∂ x j = 1 \partial x_{i} / \partial x_{j}=1 ∂xi/∂xj=1 when i = j i=j i=j and ∂ x i / ∂ x j = 0 \partial x_{i} / \partial x_{j}=0 ∂xi/∂xj=0 when i ≠ j i ≠j i=j . Therefore ∂ x i ∂ x j = δ i j \frac {\partial x_{i}}{\partial x_{j}}=\delta_{i j} ∂xj∂xi=δij

偏导数 ∂ x i / ∂ x j \partial x_i/\partial x_j ∂xi/∂xj 可由以下关系推导得出:当 i = j i=j i=j 时, ∂ x i / ∂ x j = 1 \partial x_i/\partial x_j=1 ∂xi/∂xj=1;当 i ≠ j i≠j i=j 时, ∂ x i / ∂ x j = 0 \partial x_i/\partial x_j=0 ∂xi/∂xj=0。因此有 ∂ x i ∂ x j = δ i j \frac {\partial x_{i}}{\partial x_{j}}=\delta_{i j} ∂xj∂xi=δij。

The same argument can be used for higher-order tensors

该推导思路同样适用于高阶张量,即:
∂ A i j ∂ A k l = δ i k δ j l . \frac {\partial A_{i j}}{\partial A_{k l}}=\delta_{i k} \delta_{j l} . ∂Akl∂Aij=δikδjl.

Examples of Algebraic Manipulations Using Index Notation

指标记号下的代数运算实例

  1. Let a, b, c,d be vectors. Prove that

    设 a a a、 b b b、 c c c、 d d d 为矢量,证明:
    ( a × b ) ⋅ ( c × d ) = ( a ⋅ c ) ( b ⋅ d ) − ( b ⋅ c ) ( a ⋅ d ) . ( 8.14 ) (a× b)\cdot (c× d)=(a\cdot c)(b\cdot d)-(b\cdot c)(a\cdot d). (8.14) (a×b)⋅(c×d)=(a⋅c)(b⋅d)−(b⋅c)(a⋅d).(8.14)

    Solution

    证明

    Let us first express the left-hand side of the equation using index notation. Please check the rules for cross products and dot products of vectors to see how this is done.

    首先将等式左侧通过指标记号表示,可结合矢量叉乘和点乘的规则进行推导:
    ( a × b ) ⋅ ( c × d ) = ε i j k a j b k ε i m n c m d n ( 8.15 ) (a × b) \cdot (c × d)=\varepsilon_{i j k} a_{j} b_{k} \varepsilon_{i m n} c_{m} d_{n} (8.15) (a×b)⋅(c×d)=εijkajbkεimncmdn(8.15)

    Recall the identity

    由置换符号的恒等式:
    ε i j k ε i m n = δ j m δ k n − δ j n δ m k . \varepsilon_{i j k} \varepsilon_{i m n}=\delta_{j m} \delta_{k n}-\delta_{j n} \delta_{m k} . εijkεimn=δjmδkn−δjnδmk.

    Then we will have

    可得:
    ε i j k a j b k ε i m n c m d n = ( δ j m δ k n − δ j n δ m k ) a j b k c m d n \varepsilon_{i j k} a_{j} b_{k} \varepsilon_{i m n} c_{m} d_{n}=\left (\delta_{j m} \delta_{k n}-\delta_{j n} \delta_{m k}\right) a_{j} b_{k} c_{m} d_{n} εijkajbkεimncmdn=(δjmδkn−δjnδmk)ajbkcmdn

    Multiply out, and note that

    并注意到克罗内克 δ 的运算性质:
    δ k n a k = a n . \delta_{k n} a_{k}=a_{n} . δknak=an.

    and


    δ j m a j = a m \delta_{j m} a_{j}=a_{m} δjmaj=am

    That is, multiplying a Kronecker delta has the effect of switching indices; hence

    即与克罗内克 δ 相乘具有指标替换的效果,因此展开上式可得:
    ( δ j m δ k n − δ j n δ m k ) a j b k c m d n = a m b n c m d n − a n b m c m d n \left (\delta_{j m} \delta_{k n}-\delta_{j n} \delta_{m k}\right) a_{j} b_{k} c_{m} d_{n}=a_{m} b_{n} c_{m} d_{n}-a_{n} b_{m} c_{m} d_{n} (δjmδkn−δjnδmk)ajbkcmdn=ambncmdn−anbmcmdn

    Finally, note that a m c m = a ⋅ c a_{m} c_{m}=a \cdot c amcm=a⋅c and similarly for other products with the same index so that

    最后,由哑指标的求和性质可知 a m c m = a ⋅ c a_m c_m=a\cdot c amcm=a⋅c,其余具有相同哑指标的乘积同理,因此:
    a m b n c m d n − a n b m c m d n = a m c m b n d n − b m c m a n d n = ( a ⋅ c ) ( b ⋅ d ) − ( b ⋅ c ) ( a ⋅ d ) \begin {aligned} a_{m} b_{n} c_{m} d_{n}-a_{n} b_{m} c_{m} d_{n} & =a_{m} c_{m} b_{n} d_{n}-b_{m} c_{m} a_{n} d_{n} \\ & =(a \cdot c)(b \cdot d)-(b \cdot c)(a \cdot d) \end {aligned} ambncmdn−anbmcmdn=amcmbndn−bmcmandn=(a⋅c)(b⋅d)−(b⋅c)(a⋅d)

  2. The stress-strain relation for linear elasticity may be expressed as

    线弹性力学的本构关系(应力 - 应变关系)可表示为:
    σ i j = E 1 + v ( ε i j + v 1 − 2 v ε k k δ i j ) \sigma_{i j}=\frac {E}{1+v}\left (\varepsilon_{i j}+\frac {v}{1-2 v} \varepsilon_{k k} \delta_{i j}\right) σij=1+vE(εij+1−2vvεkkδij)

    where σ i j \sigma_{i j} σij and ε i j \varepsilon_{i j} εij are the components of the stress and strain tensor and E and v are Young's modulus and Poisson's ratio, respectively. Find an expression for strain in terms of stress.

    其中 σ i j \sigma_{ij} σij 和 ε i j \varepsilon_{ij} εij 分别为应力张量和应变张量的分量, E E E 和 v v v 分别为杨氏模量和泊松比。试推导以应力表示的应变表达式。

    Solution

    求解

    Set i = j i=j i=j to see that

    令 i = j i=j i=j,对指标进行缩并可得:
    σ i i = E 1 + v ( ε i i + v 1 − 2 v ε k k δ i i ) . \sigma_{i i}=\frac {E}{1+v}\left (\varepsilon_{i i}+\frac {v}{1-2 v} \varepsilon_{k k} \delta_{i i}\right) . σii=1+vE(εii+1−2vvεkkδii).

    Recall that δ i i = 3 \delta_{ii}=3 δii=3 , and notice that we can replace the remaining i i ii ii by k k k k kk , so that

    由克罗内克 δ 的性质可知 δ i i = 3 \delta_{ii}=3 δii=3,且可将式中其余的缩并指标 i i ii ii 替换为 k k kk kk,因此:
    σ k k = E 1 + v ( ε k k + v 1 − 2 v 3 ε k k ) = E 1 − 2 v ε k k \sigma_{k k}=\frac {E}{1+v}\left (\varepsilon_{k k}+\frac {v}{1-2 v} 3 \varepsilon_{k k}\right)=\frac {E}{1-2v} \varepsilon_{k k} σkk=1+vE(εkk+1−2vv3εkk)=1−2vEεkk

    and

    进而可得:
    ε k k = 1 − 2 v E σ k k . \varepsilon_{k k}=\frac {1-2 v}{E} \sigma_{k k} . εkk=E1−2vσkk.

    Now, substitute for ε k k \varepsilon_{k k} εkk in the given stress-strain relation and we obtain

    将 ε k k \varepsilon_{kk} εkk 代入原应力 - 应变关系中,可得:
    σ i j = E 1 + v ( ε i j + v E σ k k δ i j ) \sigma_{i j}=\frac {E}{1+v}\left (\varepsilon_{i j}+\frac {v}{E} \sigma_{k k} \delta_{i j}\right) σij=1+vE(εij+Evσkkδij)

    and

    整理后得到以应力表示的应变表达式:
    ε i j = 1 + v E ( σ i j − v 1 + v σ k k δ i j ) . \varepsilon_{i j}=\frac {1+v}{E}\left (\sigma_{i j}-\frac {v}{1+v} \sigma_{k k} \delta_{i j}\right) . εij=E1+v(σij−1+vvσkkδij).


Free and Dummy Indices in Tensor Notation

张量记号中的自由指标与哑指标

SciencePedia

Key Takeaways

  • Free indices must be identical in every term of a tensor equation, defining the equation's subject and the tensor's rank.

    自由指标在张量方程的每一项中必须完全相同,它们确定方程的描述对象与张量的阶数。

  • Dummy indices appear once as a superscript and once as a subscript in a single term, indicating a summation or "contraction" over that index.

    哑指标在同一项中以上标与下标各出现一次,表示对该指标进行求和或「缩并」。

  • The Einstein summation convention provides a grammar for physical laws, ensuring consistency and revealing the structure of theories from general relativity to solid mechanics.

    爱因斯坦求和约定为物理定律提供了一套语法规则,保证理论自洽,并揭示从广义相对论到固体力学等理论的结构。

  • This notation acts as a blueprint for modern computation, allowing scientists to predict the computational cost of complex simulations by counting indices.

    这套记号是现代计算的设计蓝图,科学家可通过统计指标数量预估复杂模拟的计算代价。

Introduction

引言

In the worlds of physics and mathematics, equations are often adorned with a seemingly complex array of subscripts and superscripts. This notation, known as the Einstein summation convention, is far from a mere stylistic choice; it is a powerful language designed to express universal physical laws in a coordinate-independent way. However, mastering this language requires understanding its fundamental grammar, particularly the distinction between its two main players: free and dummy indices. This article demystifies this notation, addressing the common challenge of interpreting these indices correctly. First, in "Principles and Mechanisms," we will dissect the rules governing free and dummy indices, learning how they ensure the validity of tensor equations. Following this, under "Applications and Interdisciplinary Connections," we will explore how this elegant shorthand becomes a profound tool, shaping theories in general relativity, guiding calculations in solid mechanics, and even powering modern computational science.

在物理学与数学中,方程常带有形式上看似复杂的下标与上标组合。这套记号被称为爱因斯坦求和约定 ,它并非单纯的书写风格选择,而是一套用于以坐标无关方式表述普适物理定律的强大语言。掌握这套语言需要理解其基本语法,尤其是两类核心指标 ------自由指标哑指标------ 的区别。本文将对这套记号进行解析,解决正确理解指标时常见的困难。首先在「原理与机制」部分,我们拆解支配自由指标与哑指标的规则,学习它们如何保证张量方程的合法性。随后在「应用与学科交叉」部分,我们将展示这套简洁记号如何成为深刻工具,支撑广义相对论理论、指导固体力学计算,并推动现代计算科学发展。

Principles and Mechanisms

原理与机制

The sub- and superscripts that adorn equations in advanced physics and mathematics are not arbitrary decorations. This notation, known as theEinstein summation convention , is a precise and powerful language developed for clarity, not obfuscation. It is designed to express the profound idea that physical laws are independent of the observer's coordinate system. This convention allows for the formulation of physical laws in a universal, or covariant, form that remains unchanged under coordinate transformations.

高等物理与数学方程中的上标与下标并非随意装饰。这套被称为爱因斯坦求和约定的记号,是一套精确而强大的语言,其目的是清晰而非晦涩。它用于表述物理定律与观测者坐标系无关这一深刻思想,并允许物理定律以普适(协变)形式表述,在坐标变换下保持不变。

To learn this language, we must first meet its two main characters: thefree index and thedummy index .

要掌握这门语言,首先需要认识它的两个核心要素:自由指标哑指标

The Law of the Free Index: What an Equation is About

自由指标法则:方程描述的对象

Think of a tensor equation as a declarative sentence. The free indices tell you what the sentence is about---its subject. They are the indices that appear exactly once in every single term of an equation. For an equation to make any sense, it's like saying every part of your sentence has to agree on the subject. If the left side of an equation is a vector---a quantity with a direction, which we can denote with a single free index like A i A^i Ai---then the right side must also, after all its internal machinations are done, be a vector of the same type, B i B^i Bi.

可以把张量方程看作陈述句。自由指标指明句子的主题,它们在方程的每一项中恰好出现一次 。一个有意义的方程,要求所有部分都对应同一主题。若方程左侧是矢量(带有方向的量,可用单个自由指标如 A i A^i Ai 表示),则右侧在所有内部运算完成后也必须是同类型矢量 B i B^i Bi。

You cannot, for instance, add a vector pointing north to a temperature. They are different kinds of beasts. This is the rule of tensor algebra, and it's enforced by the free indices. Consider a simple, but invalid, proposed equation: F i = T i j V j + W i F^i = T^{ij} V_j + W_i Fi=TijVj+Wi. The term on the left, F i F^i Fi, tells us we are talking about a quantity of type "upper- i i i". Looking at the right side, the first term, T i j V j T^{ij} V_j TijVj, has its j j j index summed over (we'll get to that in a moment), leaving a free index i i i in the upper position. So far, so good! It's an "upper- i i i" quantity. But look at the second term, W i W_i Wi. Its free index i i i is in the lower position. This is a different kind of object, a "lower- i i i". You can't add an "upper- i i i" to a "lower- i i i". The equation is trying to add apples and oranges.

例如,不能将指向北方的矢量与温度相加,二者属于不同类型的量。这是张量代数的规则,由自由指标保证。考虑一个简单但不合法的方程:
F i = T i j V j + W i F^i = T^{ij} V_j + W_i Fi=TijVj+Wi

左侧项 F i F^i Fi 表明这是「上标 i i i」型量。右侧第一项 T i j V j T^{ij} V_j TijVj 中指标 j j j 被求和(稍后说明),剩余上标自由指标 i i i,这部分是合法的「上标 i i i」量。但第二项 W i W_i Wi 的自由指标 i i i 为下标 ,属于「下标 i i i」型量。上标 i i i 量与下标 i i i 量不能相加,该方程相当于把苹果和橙子相加。

This rule, theconservation of free indices , is absolute. Every term, on both sides of the equals sign, must have the exact same set of free indices, in the exact same up-or-down positions. An equation like A i j = B j k C k A^j_i = B^{jk} C_k Aij=BjkCk is nonsense for the same reason. The left side, A i j A^j_i Aij, has two free indices, i i i (up) and j j j (down). The right side, after its internal summation over k k k, is left with only a single free index, j j j (down). The index i i i has vanished! It's like having an equation that says "a velocity is equal to a pressure." It's not just wrong; it's meaningless.
自由指标守恒 是绝对规则。等号两侧每一项都必须拥有完全相同的自由指标集合,且上下位置严格一致。
A i j = B j k C k A^j_i = B^{jk} C_k Aij=BjkCk

这样的方程同样无意义:左侧 A i j A^j_i Aij 有两个自由指标 i i i(上)与 j j j(下);右侧对 k k k 内部求和后只剩一个自由指标 j j j(下),指标 i i i 消失。这就如同说「速度等于压强」,不仅错误,而且毫无意义。

The number of free indices tells you therank of the tensor.

自由指标的数量决定张量的阶数

-**Zero free indices:**A scalar (a single number, like temperature).
零个自由指标:标量(单一数值,如温度)。

-**One free index:**A vector (a quantity with magnitude and direction).
一个自由指标:矢量(具有大小与方向的量)。

-**Two free indices:**A rank-2 tensor (like stress, σ i j \sigma_{ij} σij, or the metric, g μ ν g_{\mu\nu} gμν).
两个自由指标 :二阶张量(如应力 σ i j \sigma_{ij} σij 或度规 g μ ν g_{\mu\nu} gμν)。

  • And so on.
    以此类推。

For a valid equation relating tensors, the free indices are the public-facing identity of the object, and they must be consistent across the board.

对合法的张量方程,自由指标是对象的外在标识,必须在全方程保持一致。

The Secret Life of the Dummy Index: The Workers Behind the Scenes

哑指标的内在作用:幕后运算单元

So, what about those other indices, the ones that don't survive to the end? These are thedummy indices , and they are the workhorses of the notation. A dummy index is one that appears exactly twice in a single term, once as a superscript and once as a subscript. (We'll address a small exception to this up/down rule in a moment). When you see this pairing, it's a quiet instruction: "sum over all possible values of this index."

那些最终不保留的指标就是哑指标 ,它们是这套记号的运算核心。哑指标在同一项中恰好出现两次 ,一次上标、一次下标(稍后说明上下规则的特例)。看到这种成对出现,就表示:对该指标的所有可能取值求和

For example, in the expression for index lowering, v k = g k j v j v_k = g_{kj} v^j vk=gkjvj, the index j j j appears once down in g k j g_{kj} gkj and once up in v j v^j vj. It is therefore a dummy index. The expression is shorthand for the sum:

例如指标下降表达式: v k = g k j v j v_k = g_{kj} v^j vk=gkjvj,指标 j j j 在 g k j g_{kj} gkj 中为下标,在 v j v^j vj 中为上标,因此是哑指标。该式是求和式的简写:
v k = ∑ j = 0 D − 1 g k j v j v_k = \sum_{j=0}^{D-1} g_{kj} v^j vk=j=0∑D−1gkjvj

where D D D is the number of dimensions in our space. Notice how j j j is gone from the final result; it has been summed out of existence. The only index left is k k k, the free index.

其中 D D D 是空间维数。可见 j j j 在最终结果中消失,已被求和消去,仅剩自由指标 k k k。

This summation process is calledcontraction . It's the operation that allows us to combine tensors to create new ones. Let's look at the equation for elastic stress:

这一求和过程称为缩并 ,是组合张量生成新张量的基本运算。以弹性应力方程为例:
σ i j = λ δ i j ϵ k k + 2 μ ϵ i j \sigma_{ij} = \lambda\delta_{ij}\epsilon_{kk} + 2\mu\epsilon_{ij} σij=λδijϵkk+2μϵij

  • The free indices are i i i and j j j. They appear on the left, and in both terms on the right.

    自由指标为 i i i 与 j j j,出现在左侧及右侧两项中。

  • In the first term on the right, the index k k k appears twice as a subscript in ϵ k k \epsilon_{kk} ϵkk. This is the trace of the strain tensor, a sum over the diagonal components ( ϵ 11 + ϵ 22 + ϵ 33 \epsilon_{11}+\epsilon_{22}+\epsilon_{33} ϵ11+ϵ22+ϵ33), and k k k is the dummy index for this operation.

    右侧第一项中指标 k k k 在 ϵ k k \epsilon_{kk} ϵkk 中以下标出现两次,代表应变张量的迹,即对角分量之和 ϵ 11 + ϵ 22 + ϵ 33 \epsilon_{11}+\epsilon_{22}+\epsilon_{33} ϵ11+ϵ22+ϵ33, k k k 是该运算的哑指标。

One of the most beautiful things about dummy indices is that their name doesn't matter. They are anonymous workers. The expression A i B i A^iB_i AiBi is a scalar. The expression A k B k A^kB_k AkBk is the exact same scalar . The choice of letter is purely a matter of convenience. This might seem trivial, but it's a statement about abstraction. However, you must be careful. Within a single equation, if you have multiple, independent summations, you must use different dummy letters for each to avoid confusion.

哑指标的一个优美性质是其符号无关紧要。 A i B i A^iB_i AiBi 是标量, A k B k A^kB_k AkBk 是完全相同 的标量,字母选择仅为方便。这看似平凡,却体现了抽象性。但需注意:同一方程中若存在多个独立求和,必须使用不同哑指标字母以避免混淆。

The Ultimate Contraction: The Scalar Invariant

极致缩并:标量不变量

What happens if we keep contracting indices until there are no free indices left? We get something truly special: ascalar invariant . This is a quantity with zero free indices---a pure number whose value all observers will agree upon, regardless of their coordinate system. It represents a fundamental, objective piece of reality.

若持续缩并直至无自由指标剩余,将得到一类特殊量:标量不变量。它不含自由指标,是所有观测者在任意坐标系下都认同的纯数值,代表客观的基本物理量。

One of the most famous examples comes from electromagnetism. The electromagnetic field is described by a tensor F μ ν F_{\mu\nu} Fμν. We can construct a quantity like this: g μ α g ν β F μ ν F α β g^{\mu\alpha} g^{\nu\beta} F_{\mu\nu} F_{\alpha\beta} gμαgνβFμνFαβ. Let's count the indices. The index μ \mu μ appears once up (in F μ ν F_{\mu\nu} Fμν) and once down (in g μ α g_{\mu\alpha} gμα). It's a dummy. The same is true for ν \nu ν, α \alpha α, and β \beta β. Every single index is paired up and summed over. There are no free indices left. The result is a scalar. This particular scalar is proportional to E 2 − c 2 B 2 E^2 - c^2B^2 E2−c2B2, a fundamental invariant of the electromagnetic field. It's a way of asking the universe a question and getting a single numerical answer that is true for everyone. This is the ultimate goal of writing physics in the language of tensors.

电磁学中最著名的例子之一:电磁场由张量 F μ ν F_{\mu\nu} Fμν 描述,可构造如下量: g μ α g ν β F μ ν F α β g^{\mu\alpha} g^{\nu\beta} F_{\mu\nu} F_{\alpha\beta} gμαgνβFμνFαβ 指标 μ \mu μ 在 F μ ν F_{\mu\nu} Fμν 中为上标,在 g μ α g_{\mu\alpha} gμα 中为下标,是哑指标。 ν \nu ν、 α \alpha α、 β \beta β 同理。所有指标均成对求和,无自由指标剩余,结果为标量。该标量与 E 2 − c 2 B 2 E^2 - c^2B^2 E2−c2B2 成正比,是电磁场的基本不变量。它以单一数值回答对宇宙的提问,对所有观测者都成立,这正是用张量语言描述物理的最终目标之一。

A Note on Flat Space: The Cartesian Shortcut

平直空间注记:笛卡尔坐标系捷径

Now for that exception I mentioned. You may have heard that a dummy index must appear once up and once down. This is absolutely true for the mathematics of general relativity and curved spaces, where the distinction between contravariant (upper) and covariant (lower) vectors is crucial for ensuring coordinate independence. The machinery for this is themetric tensor , g i j g_{ij} gij, which acts as a translator, lowering an index ( v i = g i j v j v_i = g_{ij} v^j vi=gijvj) or, with its inverse g i j g^{ij} gij, raising one ( v i = g i j v j v^i = g^{ij} v_j vi=gijvj).

下面说明前面提到的特例。哑指标必须 一上一下出现,这在广义相对论与弯曲空间数学中严格成立,因为逆变(上标)与协变(下标)矢量的区分对保证坐标无关性至关重要。实现这一区分的工具是度规张量 g i j g_{ij} gij,它可实现指标下降: v i = g i j v j v_i = g_{ij} v^j vi=gijvj 或通过逆度规 g i j g^{ij} gij 实现指标上升: v i = g i j v j v^i = g^{ij} v_j vi=gijvj

However, in the familiar, flat Euclidean space of introductory physics and solid mechanics, described by a simple Cartesian grid, the metric tensor is just the identity matrix ( δ i j \delta_{ij} δij). In this special case, raising and lowering an index doesn't change the numerical value of its components. Because of this, it has become common practice to be a bit lazy with the index positions. You will often see expressions like A i j B i k A_{ij} B_{ik} AijBik, where the index i i i is summed over despite both instances being subscripts. For instance, in an expression like A i j B i k C j A_{ij} B_{ik} C^j AijBikCj, the indices i i i and j j j are both treated as dummy indices being summed over, leaving k k k as the single free index.

但在基础物理与固体力学常用的平直欧几里得空间(笛卡尔网格)中,度规张量为单位矩阵 δ i j \delta_{ij} δij。此时升降指标不改变分量数值,因此实践中常放宽指标位置要求。常会见到 A i j B i k A_{ij} B_{ik} AijBik 这样的表达式,指标 i i i 虽均为下标仍被求和。例如: A i j B i k C j A_{ij} B_{ik} C^j AijBikCj 中 i i i 与 j j j 均作为哑指标被求和,仅剩自由指标 k k k。

This is a contextual shortcut. It works perfectly well in a Cartesian frame, but it's important to remember that it's a special case. The more general and robust rule---one up, one down---is what gives tensor notation its full power to describe the universe on its own terms, free from the prisons of our parochial coordinate systems. And embracing that power is what this beautiful language is all about.

这是场景化捷径,在笛卡尔坐标系中有效,但属于特例。更普适、更严格的规则 ------ 一上一下 ------ 才赋予张量记号完整能力,使其摆脱局部坐标系限制,以自然方式描述宇宙。这正是这套优美语言的价值所在。

Applications and Interdisciplinary Connections

应用与学科交叉

So, we have learned the rules of this little game---this "summation convention" where we drop the sigma signs and let repeated indices fend for themselves. You might be thinking it's just a bit of notational laziness, a convenient shorthand for physicists who couldn't be bothered to write ∑ \sum ∑ all day. And, well, you're not entirely wrong! But it turns out to be one of those wonderfully deep "shorthands" that, by making things simpler, reveals the hidden structure of the world. This isn't just about saving ink; it's the natural language for expressing physical laws, a grammar that keeps our theories honest, and a blueprint for some of the most powerful computational tools we have today. Let's see how this simple idea blossoms across science.

我们已经学习了这套求和约定的规则:省略求和号 ∑ \sum ∑,由重复指标自动表示求和。你可能认为这只是书写简化,方便物理学家偷懒少写求和号。这种看法并非完全错误,但这套简洁记号实则意义深刻:它在简化书写的同时,揭示世界的隐藏结构。它不仅节省笔墨,更是表述物理定律的自然语言、保证理论自洽的语法规则,以及现代强大计算工具的设计蓝图。下面看这一简单思想如何在各学科中绽放。

The Grammar of Physics: Keeping Our Stories Straight

物理语法:保证理论自洽

Before you can write a correct physical law, you need a language with rules. You can't say "a force equals a velocity," because the units are all wrong. The summation convention provides a powerful set of grammatical rules for the language of tensors. A "free index"---one that isn't summed over---tells you the character of an object.

写出正确物理定律前,需要一套有规则的语言。不能说「力等于速度」,因为量纲完全不同。求和约定为张量语言提供了严格语法规则。自由指标(不被求和的指标)决定对象的类型:

  • An object with no free indices, like A i B i A^iB_i AiBi, is a scalar.

    无自由指标(如 A i B i A^iB_i AiBi):标量

  • An object with one, like V j V_j Vj, is a vector.

    一个自由指标(如 V j V_j Vj):矢量

  • An object with two, T i j T_{ij} Tij, is a rank-2 tensor, and so on.

    两个自由指标(如 T i j T_{ij} Tij):二阶张量

The cardinal rule is simple: in any valid equation, the free indices on the left side must exactly match the free indices on the right side, term by term.

依此类推,基本规则很简单:合法方程中,左侧每一项的自由指标必须与右侧逐项完全匹配。

This rule is our first line of defense against writing nonsense. If you were to write down an equation like A i j = E k ( i j ) A_{ij} = E_k (ij) Aij=Ek(ij), the notation itself screams that something is wrong. The left side is a rank-2 tensor with two free indices, i i i and j j j. But the right side has three free indices, i i i, j j j, and k k k! You are trying to equate a matrix to a three-dimensional cube of numbers. The equation is "ungrammatical" and physically meaningless.

这一规则是避免写出荒谬方程的第一道防线。例如: A i j = E k ( i j ) A_{ij} = E_k (ij) Aij=Ek(ij) 左侧是含 i i i、 j j j 两个自由指标的二阶张量,右侧却有 i i i、 j j j、 k k k三个自由指标,相当于把矩阵等同于三维数块,方程「语法错误」,物理上无意义。

This rule also tells us how things can be added together. Consider a more complex physical relationship, like R k = A i B i ∂ k S + T j k V j R_k = A^iB_i\partial_k S + T_{jk} V^j Rk=AiBi∂kS+TjkVj. Let's dissect it. In the first term, A i B i ∂ k S A^iB_i\partial_k S AiBi∂kS, the index i i i is a dummy index---it's summed over and disappears, leaving only the free index k k k. So, this term represents a covector (a rank-1 covariant tensor). In the second term, T j k V j T_{jk} V^j TjkVj, the index j j j is the dummy, and again, only k k k remains free. This term, too, is a covector. The equation is telling us that one covector, R k R_k Rk, is the sum of two other covectors. The grammar checks out. Each term "lives" in the same kind of mathematical space, and we are free to add them. The notation automatically prevents us from adding apples to oranges.

该规则也指导合法加法运算。考虑更复杂的物理关系: R k = A i B i ∂ k S + T j k V j R_k = A^iB_i\partial_k S + T_{jk} V^j Rk=AiBi∂kS+TjkVj

第一项 A i B i ∂ k S A^iB_i\partial_k S AiBi∂kS 中指标 i i i 是哑指标,求和后消失,仅剩自由指标 k k k,表示协变矢量(一阶协变张量)。第二项 T j k V j T_{jk} V^j TjkVj 中指标 j j j 是哑指标,同样仅剩自由指标 k k k,也是协变矢量。方程表明协变矢量 R k R_k Rk 是另外两个协变矢量之和,语法合法,各项属于同一类数学空间,可以相加。记号体系自动避免了不同类型量的非法相加。

This game of "spot the free index" also tells us what we end up with after a complicated calculation. If a theorist mixes together four different tensors in a flurry of contractions, like A i j B k l m D i k D j l A^{ij} B_{klm} D_i {}^kD_j {}^l AijBklmDikDjl, how do they know what they've created? We just follow the indices! The indices i , j , k , i,j,k, i,j,k, and l l l each appear once up and once down, so they are all dummy indices, summed away into oblivion. The only index left standing is the lonely m m m. The result, therefore, is an object with one upper index, Q m Q^m Qm---a contravariant vector. The abstract rules of indices distill a complex interaction into a simple statement about the character of the final result.

「寻找自由指标」也能判断复杂计算的结果类型。若理论学家将四个张量通过多次缩并组合,例如: A i j B k l m D i k D j l A^{ij} B_{klm} D_i {}^kD_j {}^l AijBklmDikDjl 指标 i , j , k , l i,j,k,l i,j,k,l 均一上一下出现,均为哑指标并被求和消去,仅剩指标 m m m。因此结果是带有一个上标的量 Q m Q^m Qm,即逆变矢量。指标抽象规则将复杂相互作用提炼为对结果类型的简单判断。

The Language of Fields and Spacetime

场与时空的语言

The true power of this notation shines when we use it not just to check equations, but to write them. It provides an astonishingly compact and elegant way to describe the fundamental workings of the universe.

这套记号的真正威力不仅在于检验方程,更在于书写方程。它以极度简洁优美的方式描述宇宙基本运行规律。

Take Einstein's theory of general relativity. In the curved spacetime of our universe, the distinction between vectors with "upper" indices (contravariant) and "lower" indices (covariant) becomes physically meaningful. They are two different ways of describing the same physical arrow, and the dictionary for translating between them is the metric tensor, g i j g_{ij} gij. To change a twice-covariant tensor A m n A_{mn} Amn into its twice-contravariant cousin, you don't do some complicated dance. You simply "raise" the indices using the inverse metric, g i j g^{ij} gij. The operation is written as A k l = g k m g l n A m n A^{kl} = g^{km} g^{ln} A_{mn} Akl=gkmglnAmn. Notice the beautiful mechanics: the dummy index m m m in g k m g^{km} gkm finds the m m m in A m n A_{mn} Amn and contracts, raising the first index. The dummy index n n n in g l n g^{ln} gln does the same for the second. What's left are the free indices k k k and l l l upstairs. This is not just a mathematical trick; it's a statement about the geometry of spacetime, written with an elegance that almost hides its depth.

以爱因斯坦广义相对论为例。在宇宙弯曲时空中,「上标」(逆变)与「下标」(协变)矢量的区分具有物理意义,它们是同一物理矢量的两种描述方式,相互转换的工具是度规张量 g i j g_{ij} gij。将二阶协变张量 A m n A_{mn} Amn 变为二阶逆变张量,只需用逆度规 g i j g^{ij} gij 「上升」指标: A k l = g k m g l n A m n A^{kl} = g^{km} g^{ln} A_{mn} Akl=gkmglnAmn 其机制十分优美: g k m g^{km} gkm 中的哑指标 m m m 与 A m n A_{mn} Amn 中的 m m m 缩并,上升第一个指标; g l n g^{ln} gln 中的哑指标 n n n 上升第二个指标,剩余上标自由指标 k k k 与 l l l。这不仅是数学技巧,更是对时空几何的深刻表述,形式简洁而内涵深刻。

This elegance extends to other areas of continuum physics. Consider heat flowing through an anisotropic crystal, where heat flows more easily in some directions than others. The law governing this is captured by the equation

这种简洁性同样适用于连续介质物理。以各向异性晶体中的热传导为例,控制方程为:
ρ c ∂ t T = ∂ i ( K i j ∂ j T ) + q ˙ \rho c\partial_t T = \partial_i (K^{ij}\partial_j T) + \dot q ρc∂tT=∂i(Kij∂jT)+q˙

Let's read this story, from right to left, following the indices. First, we have the temperature T T T, a scalar field. The operator ∂ j \partial_j ∂j takes its gradient, ∂ j T \partial_j T ∂jT, producing a covector indicating the direction of steepest temperature change. This is then contracted with the material's conductivity tensor, K i j K^{ij} Kij. The dummy index j j j is summed over, leaving a free index i i i. Finally, the operator ∂ i \partial_i ∂i takes the divergence of the resulting vector field. The repeated index i i i is summed, resulting in a scalar term representing the net heat conduction. The rules of indices guide us perfectly through the physics.

从右向左按指标解读:

温度 T T T 是标量场;算符 ∂ j \partial_j ∂j 给出其梯度 ∂ j T \partial_j T ∂jT,得到表示温度最速下降方向的协变矢量;与材料电导张量 K i j K^{ij} Kij 缩并,哑指标 j j j 被求和,剩余自由指标 i i i;最后算符 ∂ i \partial_i ∂i 对所得矢量场求散度,重复指标 i i i 被求和,得到表示净热传导的标量项。指标规则完美引导物理意义的解读。

Perhaps one of the most stunning examples comes from solid mechanics. If you have a block of material and you deform it, how can you be sure you're describing a physically possible deformation---one without impossible gaps or overlaps appearing inside the material? The answer lies in the Saint-Venant compatibility conditions. In their full glory, they are a mess of partial derivatives. But in index notation, they become a statement of breathtaking simplicity:

固体力学中一个极为惊艳的例子是圣维南协调条件:描述材料变形时,如何保证变形物理可行(内部无不可实现的空隙或重叠)?偏导数形式的原始表达式十分复杂,但在指标记号下极为简洁:
ϵ i p q ϵ j r s ε q r , p s = 0 \epsilon_{ipq}\epsilon_{jrs}\varepsilon_{qr,ps} = 0 ϵipqϵjrsεqr,ps=0

Here, ε q r \varepsilon_{qr} εqr is the strain tensor. The expression on the left is a rank-2 tensor, because i i i and j j j are the free indices. Setting it to zero means every one of its components must be zero. Because this tensor happens to be symmetric in i i i and j j j, this single, compact equation actually contains six separate, complex differential equations. The simple grammatical rule that free indices must match (here, i i i and j j j on the left and no indices on the right for zero) encapsulates a profound physical constraint on the continuous nature of matter.

其中 ε q r \varepsilon_{qr} εqr 是应变张量。左侧表达式因自由指标 i i i、 j j j 而为二阶张量,令其为零即所有分量为零。由于该张量关于 i i i、 j j j 对称,这一紧凑方程实际包含 6 个 独立复杂微分方程。自由指标匹配这一简单语法规则(左侧 i i i、 j j j,右侧零无指标),封装了对物质连续性的深刻物理约束。

The Blueprint for Modern Computation

现代计算的设计蓝图

In recent decades, this century-old notation has found a vibrant new life at the heart of the computational revolution. It turns out that the language of theoretical physics is also the perfect language for telling a computer how to handle the massive, multi-dimensional datasets of the modern world.

近几十年来,这套百年记号在计算革命核心地带重获新生。理论物理语言同样是指导计算机处理现代海量高维数据的理想语言。

Consider the challenge of analyzing brain activity from an EEG, which gives you a flood of data: voltage at each electrode, at each moment in time, for every frequency component. You can arrange this data into a giant three-dimensional array, or a rank-3 tensor V i t c V_{itc} Vitc. How do you find meaningful patterns? For instance, how is the activity in one electrode, i i i, related to the activity in another, j j j? You compute the covariance matrix, R i j R_{ij} Rij. The formula, written in index notation, is an instruction to the computer:

以脑电图 EEG 脑活动分析为例:数据包含每个电极、每个时刻、每个频率成分的电压,可排列为三维数组(三阶张量) V i t c V_{itc} Vitc。如何发现有意义的模式?例如电极 i i i 与电极 j j j 活动的关联?可计算协方差矩阵 R i j R_{ij} Rij,指标记号下的公式直接作为计算机指令:
R i j = 1 T C V i t c V j t c R_{ij} = \frac {1}{T C} V_{itc} V_{jtc} Rij=TC1VitcVjtc

The free indices i i i and j j j tell the computer what the final output should be---a matrix indexed by pairs of electrodes. The dummy indices, t t t and c c c, tell it exactly what to do: for each pair ( i , j ) (i,j) (i,j), multiply the corresponding values and sum them up over all of time and frequency. This is the language behind many modern data analysis techniques, from machine learning to signal processing.

自由指标 i i i、 j j j 告诉计算机最终输出是按电极对索引的矩阵;哑指标 t t t、 c c c 指明运算:对每一对 ( i , j ) (i,j) (i,j),将对应数值相乘并对所有时间与频率求和。这是机器学习、信号处理等现代数据分析技术的底层语言。

This idea of representing contractions graphically has given rise to the field of "tensor networks," where a calculation like

将缩并图形化表示催生了张量网络 领域。形如
D k = ∑ i , j A i , j B j , k C i D_k = \sum_{i,j} A_{i,j} B_{j,k} C_i Dk=i,j∑Ai,jBj,kCi

is drawn as a diagram of nodes (the tensors) connected by lines (the dummy indices). The "open" lines that don't connect to anything else are the free indices of the final result. This graphical language, whose rules are precisely the rules of free and dummy indices, is revolutionizing how we simulate complex quantum systems.

的计算可画为由节点(张量)与连线(哑指标)构成的图。不连接任何其他节点的「开放」连线是最终结果的自由指标。这套以自由指标与哑指标规则为基础的图形语言,正革新复杂量子系统的模拟方式。

Finally, and perhaps most practically, the summation convention gives us an almost magical way to predict the cost of a large-scale scientific simulation. Consider the formidable CCSD (T) method in quantum chemistry, a "gold standard" for calculating molecular energies. How long does it take to run? We don't need to be experts in the algorithm; we just need to look at the equations. The most computationally expensive step involves contracting tensors in a way that can be represented schematically by an expression like

最后,也是最实用的一点:求和约定提供了预估大规模科学模拟计算代价的直观方法。以量子化学中计算分子能量的「金标准」CCSD (T) 方法为例,无需精通算法,只需观察方程。计算代价最高的步骤涉及张量缩并,可示意表示为:
∑ i j k ∑ a b c ∑ d t i j a d ( k d ∣ ∣ b c ) ... \sum_{ijk}\sum_{abc}\sum_d t_{ijad}(kd\mid\mid bc)\dots ijk∑abc∑d∑tijad(kd∣∣bc)...

Just count the summation indices: i , j , k , a , b , c , d i,j,k,a,b,c,d i,j,k,a,b,c,d. There are seven of them! If the size of our system (roughly, the number of orbitals) is N N N, then the number of operations will scale as

统计求和指标: i , j , k , a , b , c , d i,j,k,a,b,c,d i,j,k,a,b,c,d,共7 个 。若系统尺度(近似为轨道数)为 N N N,运算量随 N 7 N^7 N7 增长。
N × N × N × N × N × N × N = N 7 N\times N\times N\times N\times N\times N\times N = N^7 N×N×N×N×N×N×N=N7

This tells a chemist, before they even begin, that doubling the size of their molecule will make the calculation 2 7 = 128 2^7 = 128 27=128 times longer. This simple act of counting indices directly translates an abstract piece of mathematics into a concrete prediction about time, money, and the limits of what is computationally possible.

化学家在计算开始前就可判断:分子尺度加倍,计算时间变为 2 7 = 128 2^7 = 128 27=128 倍。简单统计指标即可将抽象数学转化为对时间、成本与计算可行性的具体预估。

So you see, this little convention of dropping summation signs is far more than a convenience. It is a deep principle that enforces logical consistency, a language of beautiful brevity for the laws of nature, and a powerful blueprint for computation. It is a thread that connects the geometry of the cosmos, the behavior of matter, and the frontier of what we can simulate and understand.

由此可见,省略求和号这一小约定远不只是方便。它是保证逻辑自洽的深刻原理,是描述自然定律的简洁优美语言,也是支撑计算的强大蓝图。它将宇宙几何、物质行为与我们可模拟、可理解的前沿领域紧密连接在一起。


行列式的指标表示与符号含义

1 列维‑奇维塔 (Levi-Civita) 符号

定义

3‑维空间中的全反对称单位张量(置换符号) ε i j k \varepsilon_{ijk} εijk 定义为

ε i j k = {    1 , ( i , j , k ) 为 ( 1 , 2 , 3 ) 的偶置换 , − 1 , ( i , j , k ) 为 ( 1 , 2 , 3 ) 的奇置换 ,    0 , 指标存在重复 . \varepsilon_{ijk}= \begin {cases} \; 1, & (i,j,k)\ \text {为}\ (1,2,3)\ \text {的偶置换},\\[4pt] -1, & (i,j,k)\ \text {为}\ (1,2,3)\ \text {的奇置换},\\[4pt] \; 0, & \text {指标存在重复}. \end {cases} εijk=⎩ ⎨ ⎧1,−1,0,(i,j,k) 为 (1,2,3) 的偶置换,(i,j,k) 为 (1,2,3) 的奇置换,指标存在重复.

  • 置换的定义:若通过偶数次互换得到 ( i , j , k ) (i,j,k) (i,j,k) 则为偶置换;若奇数次则为奇置换。
  • 在成分上, i , j , k ∈ { 1 , 2 , 3 } i,j,k\in\{1,2,3\} i,j,k∈{1,2,3}。

2 行列式的指标形式

设 A A A 为 3 阶方阵,其行列式可写为

det ⁡ A = ε i j k A i 1 A j 2 A k 3 = 1 6   ε i j k ε l m n A l i A m j A n k . \det A = \varepsilon_{ijk} A_{i1} A_{j2} A_{k3} = \frac {1}{6}\,\varepsilon_{ijk}\varepsilon_{lmn} A_{li} A_{mj} A_{nk}. detA=εijkAi1Aj2Ak3=61εijkεlmnAliAmjAnk.

令 λ = det ⁡ A \lambda=\det A λ=detA,则等价关系可写成

λ = det ⁡ A ⟺ λ = 1 6   ε i j k ε l m n A l i A m j A n k = ε i j k A i 1 A j 2 A k 3 , ⟺ ε l m n λ = ε i j k A l i A m j A n k = ε i j k A i l A j m A k n . \begin {aligned} \lambda =\det A &\Longleftrightarrow &\lambda &=\frac {1}{6}\,\varepsilon_{ijk}\varepsilon_{lmn} A_{li} A_{mj} A_{nk} &&=\varepsilon_{ijk} A_{i1} A_{j2} A_{k3},\\[4pt] &\Longleftrightarrow &\varepsilon_{lmn}\lambda &=\varepsilon_{ijk} A_{li} A_{mj} A_{nk} &&=\varepsilon_{ijk} A_{il} A_{jm} A_{kn}. \end {aligned} λ=detA⟺⟺λεlmnλ=61εijkεlmnAliAmjAnk=εijkAliAmjAnk=εijkAi1Aj2Ak3,=εijkAilAjmAkn.

3 公式组成部分

3.1 公式结构

det ⁡ A ⏟ 行列式(标量) = ε i j k ⏟ 行置换符号 A i 1 A j 2 A k 3 ⏟ 列指标固定的矩阵元素 = 1 6 ⏟ 归一化 ( 3 ! ) ε i j k ε l m n ⏟ 双置换符号 A l i A m j A n k ⏟ 全指标自由的矩阵元素 \underbrace {\det A}{\text {行列式(标量)}}= \underbrace {\varepsilon{ijk}}{\text {行置换符号}} \underbrace {A{i1} A_{j2} A_{k3}}{\text {列指标固定的矩阵元素}}= \underbrace {\frac {1}{6}}{\text {归一化 }(3!)} \underbrace {\varepsilon_{ijk}\varepsilon_{lmn}}{\text {双置换符号}} \underbrace {A{li} A_{mj} A_{nk}}_{\text {全指标自由的矩阵元素}} 行列式(标量) detA=行置换符号 εijk列指标固定的矩阵元素 Ai1Aj2Ak3=归一化 (3!) 61双置换符号 εijkεlmn全指标自由的矩阵元素 AliAmjAnk

3.2 ε i j k A i 1 A j 2 A k 3 \varepsilon_{ijk} A_{i1} A_{j2} A_{k3} εijkAi1Aj2Ak3

符号 含义
ε i j k \varepsilon_{ijk} εijk 列维‑奇维塔符号,提供行列式展开中的符号交替。
A i 1 A_{i1} Ai1 矩阵 A A A 的第 i i i 行、第 1 列元素。
A j 2 A_{j2} Aj2 矩阵 A A A 的第 j j j 行、第 2 列元素。
A k 3 A_{k3} Ak3 矩阵 A A A 的第 k k k 行、第 3 列元素。

整体含义 :遵循爱因斯坦求和约定( i , j , k = 1 , 2 , 3 i,j,k=1,2,3 i,j,k=1,2,3)求和,等价于对第三列展开的 3 阶行列式。

3.3 1 6   ε i j k ε l m n A l i A m j A n k \dfrac {1}{6}\,\varepsilon_{ijk}\varepsilon_{lmn} A_{li} A_{mj} A_{nk} 61εijkεlmnAliAmjAnk

符号 含义
1 6 \dfrac {1}{6} 61 归一化常数,详见 3.4 节推导。
ε i j k \varepsilon_{ijk} εijk 第一组列维‑奇维塔符号,控制行指标的置换奇偶性。
ε l m n \varepsilon_{lmn} εlmn 第二组列维‑奇维塔符号,控制列指标的置换奇偶性。
A l i A_{li} Ali A A A 的第 l l l 行、第 i i i 列元素。
A m j A_{mj} Amj A A A 的第 m m m 行、第 j j j 列元素。
A n k A_{nk} Ank A A A 的第 n n n 行、第 k k k 列元素。

整体含义 :对全部指标 i , j , k , l , m , n i,j,k,l,m,n i,j,k,l,m,n 按爱因斯坦求和(即 1 ≤ i , j , k , l , m , n ≤ 3 1\le i,j,k,l,m,n\le3 1≤i,j,k,l,m,n≤3)求和,得到一种不再固定列号的完全指标对称形式。

3.4 归一化系数 1 6 \dfrac {1}{6} 61 的推导

问题 :为什么双 ε \varepsilon ε 表达式中的系数是 1 6 \dfrac {1}{6} 61?

推导过程

1.求和范围

  • 在 ε i j k ε l m n A l i A m j A n k \varepsilon_{ijk}\varepsilon_{lmn} A_{li} A_{mj} A_{nk} εijkεlmnAliAmjAnk 中,
    i , j , k , l , m , n ∈ { 1 , 2 , 3 } , i,j,k,l,m,n\in\{1,2,3\}, i,j,k,l,m,n∈{1,2,3},
    分别出现 3 3 3 种取值,形成 3 3 = 27 3^3=27 33=27 项。全部组合共 27 × 27 = 729 27\times27=729 27×27=729 项。

2.利用符号的筛选性质

  • ε i j k ≠ 0 \varepsilon_{ijk}\neq0 εijk=0 仅当 ( i , j , k ) (i,j,k) (i,j,k) 是 ( 1 , 2 , 3 ) (1,2,3) (1,2,3) 的置换,故非零项数为 3 ! = 6 3!=6 3!=6。
  • 同理 ( l , m , n ) (l,m,n) (l,m,n) 也只能是置换,故也有 6 6 6 种非零排列。
  • 因此有效非零组合为 6 × 6 = 36 6\times6=36 6×6=36 项。

3.与标准行列式对比

  • 标准展开式为
    det ⁡ A = ε i j k A i 1 A j 2 A k 3 , \det A=\varepsilon_{ijk} A_{i1} A_{j2} A_{k3}, detA=εijkAi1Aj2Ak3,
    其中只对行指标 ( i , j , k ) (i,j,k) (i,j,k) 的 6 6 6 种排列求和。

4.等式的等价性

  • 通过直接计数,可得
    ε i j k ε l m n A l i A m j A n k = 6 det ⁡ A . \varepsilon_{ijk}\varepsilon_{lmn} A_{li} A_{mj} A_{nk}=6\det A. εijkεlmnAliAmjAnk=6detA.

5.求归一化系数

  • 为使等式写成 det ⁡ A \det A detA 形式,需两边同时除以 6 6 6,得到
    det ⁡ A = 1 6   ε i j k ε l m n A l i A m j A n k . \det A=\frac {1}{6}\,\varepsilon_{ijk}\varepsilon_{lmn} A_{li} A_{mj} A_{nk}. detA=61εijkεlmnAliAmjAnk.

结论 :系数 1 6 \frac {1}{6} 61 派生于 3 ! = 6 3! = 6 3!=6,即对行、列分别全排列后的重复计数。

n n n 维推广

对 n n n 阶方阵 A A A,行列式的双 ε \varepsilon ε 表示为

det ⁡ A = 1 n !   ε i 1 i 2 ... i n   ε j 1 j 2 ... j n   A j 1 i 1 A j 2 i 2 ... A j n i n \det A = \frac {1}{n!}\, \varepsilon_{i_1 i_2 \dots i_n}\, \varepsilon_{j_1 j_2 \dots j_n}\, A_{j_1 i_1} A_{j_2 i_2}\dots A_{j_n i_n} detA=n!1εi1i2...inεj1j2...jnAj1i1Aj2i2...Ajnin

维数 n n n 归一化系数 说明
2 1 2 \frac {1}{2} 21 2 ! = 2 2! = 2 2!=2
3 1 6 \frac {1}{6} 61 3 ! = 6 3! = 6 3!=6
4 1 24 \frac {1}{24} 241 4 ! = 24 4! = 24 4!=24
一般 n n n 1 n ! \frac {1}{n!} n!1 由全排列数 n ! n! n! 产生

本质 :消除对同一行列式展开的重复计数 ------ 双 ε \varepsilon ε 结构使行、列指标分别进行全排列,每种有效展开被计数 n ! n! n! 次。

3.5 符号 ε l m n λ \varepsilon_{lmn}\lambda εlmnλ 的含义

符号 \qquad \qquad 定义 性质
ε l m n \varepsilon_{lmn} εlmn 列维‑奇维塔符号(三阶反对称张量) 取值由指标 l , m , n l,m,n l,m,n 的置换性质决定,为 + 1 , − 1 , 0 +1,-1,0 +1,−1,0。
λ = det ⁡ A \lambda=\det A λ=detA 方阵 A A A 的行列式(标量) 仅数值,无方向性。
ε l m n λ \varepsilon_{lmn}\lambda εlmnλ 标量与置换符号的乘积 仍是三阶反对称张量,对应把固定列指标 ( 1 , 2 , 3 ) (1,2,3) (1,2,3) 推广到任意 ( l , m , n ) (l,m,n) (l,m,n)。

4 矩阵与行列式的区别

概念 定义 示例
矩阵 由数构成的二维阵列,用于表示线性变换的系数表。 A = ( 1 0 0 0 2 0 0 0 3 ) A=\begin {pmatrix} 1&0&0\\0&2&0\\0&0&3\end {pmatrix} A= 100020003
行列式 定义在方阵上的映射,输出为标量(纯数),反映体积缩放等几何属性。 det ⁡ A = 1 ⋅ 2 ⋅ 3 = 6 \det A = 1\cdot2\cdot3 = 6 detA=1⋅2⋅3=6

行列式的意义:

把一个矩阵(即一组坐标)转化为一个标量,用于描述线性变换的缩放、方向、体积等特征。

矩阵 A A A 本身是结构(数表),而 det ⁡ A \det A detA 是从结构中抽取出来的纯数。两者不可混淆。

5 行列式的几何意义

符号 几何解释
det ⁡ A \det A detA 线性变换对空间体积的缩放系数。正值表示方向保持(右手坐标系),负值表示方向反转(左手坐标系)。
ε l m n \varepsilon_{lmn} εlmn 坐标系的定向(右手或左手)的记号。
ε l m n λ \varepsilon_{lmn}\lambda εlmnλ 变换后体积元大小定向(右 / 左手)的组合。


reference