Machine Learning in Space Weather （2）：Background

Machine Learning in Space Weather

（2020 by Mandar Chandorkar ）

Background

Space weather is the branch of physics that studies the time varying phenomena in the solar system. The principal driver of space weather phenomena is the Sun, specifically its magnetic field variations and the solar wind. The effect of solar variations on the planetary environment are caused by the coupling between solar wind particles and the magnetic field produced by the Earth. This chapter gives a semi-quantitative treatment of various scientific ideas relevant to space weather research.

3.1. Space Plasma

Plasma, also known as the fourth state of matter due to its properties that differentiate it from the conventional gaseous state, is ubiquitous through-out the visible Universe. Plasma is a gas which is composed of roughly equal number of positive and negatively charged particles, a property known as charge quasi-neutrality. The term quasi-neutral is used because although the gas has almost equal amounts of positive and negative charges, the mixture is electromagnetically active. Due to incomplete charge shielding, long range electromagnetic fields play a big role in the dynamics of plasma.

等离子体，因其特性与常规气态不同，被称为物质的第四态，在可见宇宙中无处不在。等离子体是一种气体，由几乎相等数量的正电荷和负电荷粒子组成，这一特性被称为电荷准中性。之所以称为"准中性"，是因为虽然气体中正电荷和负电荷的数量几乎相等，但这种混合物在电磁上是活跃的。由于电荷屏蔽不完全，长程电磁场在等离子体的动力学中起着重要作用。

Debye Length

In a quasi-neutral plasma, due to the presence of partial electric shielding the potential due to the charges now takes the well known Debye form

where r is the spatial distance with respect to the charge and is the permittivity of vacuum. The electric potential decays with the Debye length scale λ_d at which a balance between thermal vibrations which can disturb quasi-neutrality, and electrostatic forces due to charge separation, is achieved. The Debye length scale depends on the electron temperature and plasma density.

In equation (3.2) above, the Debye length scale is expressed in terms of the Boltzmann constant k_b, the electron temperature T_e, free space permittivity , and electron charge e. One can visualise the positively charged ions having a cloud of electrons shielding them at the distance of λ_d.

It is also possible to take into account the shielding effect of the ions. The effective Debye length is now expressed as an addition of two terms: one for electrons (equation (3.2)) and a similar term for the ions by replacing T_e for the ion temperature T_i (n_i ≈ n_e).

Plasma Parameter

Consider a Debye sphere of radius λd. This sphere contains N_e = 4/3 π λ^3_d n_e electrons. The plasma parameter g is defined as N^{−1}_e . Rewriting this, we can say:

The description of plasma used in many applications in space is applicable when g << 1. In this situation the Debye shielding is significant, and the quasi-neutral plasma obeys collective statistical behavior. The plasma parameter g also correlates with the collision frequency. The collisions in plasma increase with increasing density and decreasing temperature, and if g → 0 the plasma becomes nearly collisionless. The collisionless property helps in making simplifying assumptions about plasma dynamics and serves as the starting point for the adiabatic theory of plasma motions in the Earth's magnetosphere which will be discussed in section 3.3.1.

德拜长度

在准中性等离子体中，由于部分电屏蔽的存在，电荷产生的电势现在呈现为众所周知的德拜形式：

其中 r 是相对于电荷的空间距离，是真空介电常数。电势随着德拜长度标度 λ_d 衰减，在这个长度标度上，能够扰乱准中性的热振动与电荷分离产生的静电力之间达到平衡。德拜长度标度取决于电子温度和等离子体密度。

在上面的方程（3.2）中，德拜长度标度用玻尔兹曼常数 k_b、电子温度 T_e、自由空间介电常数\epsilon_0 和电子电荷 e 来表示。可以想象正离子被一层在 \lambda_d 距离上屏蔽它们的电子云包围。也可以考虑离子的屏蔽效应。有效的德拜长度现在表示为两个项的和：一个用于电子（方程（3.2）），另一个用于离子，通过将 T_e 替换为离子温度 T_i (n_i ≈ n_e)。

等离子体参数

考虑一个半径为 \lambda_d 的德拜球。这个球包含N_e = 4/3 π λ^3_d n_e个电子。等离子体参数 g 定义为 N^{−1}_e。重写这个表达式，我们可以说：

在许多空间应用中使用的等离子体描述适用于 g << 1 的情况。在这种情况下，德拜屏蔽很重要，且准中性等离子体服从集体统计行为。等离子体参数 g 也与碰撞频率相关。等离子体中的碰撞随着密度的增加而增加，随着温度的降低而增加，如果 g -> 0，等离子体几乎变为无碰撞的。无碰撞特性。

3.2 Sun & the Solar Wind

The Sun is an almost perfectly spherical ball of plasma which is the the center of our solar system and the only source of light and energy for all living and meteorological processes on Earth. Apart from terrestrial weather, the Sun is also the primary driver of space weather which results from the interaction between the solar wind and planetary magnetospheres.

太阳是一个几乎完美的等离子体球体，是我们太阳系的中心，也是地球上所有生命和气象过程光和能量的唯一来源。除了地球天气外，太阳还是空间天气的主要驱动力，空间天气由太阳风与行星磁层的相互作用产生。

3.2.1 Structure

Figure 3.1 shows a cross section of the Sun with various layers. We give a brief description of them below.

Core:

The core of the Sun is the site for the thermonuclear fusion reactions which produce its energy. It extends from the center to about 20 − 25% of the solar radius [Garc´ıa et al., 2007]. It has a temperature close to 1.57 × 107 K and a density of 150 g/cm3 [Basu et al., 2009]. Nuclear fusion in the core takes place via the well known proton-proton chain (pp).

Radiative Zone:

The radiative zone extends from 25% to 70% of the solar radius. The nuclear reactions in the core are highly sensitive to temperature and pressure. In fact, they are almost shut off at the edge of the core. In the radiative zone, energy transfer takes place via photons (radiation) which bounce around nuclei until they reach the convective zone.

Convective Zone:

The convective zone lies between 70% of the solar radius to a point close to the solar surface. Density decreases dramatically going from the core to the radiative zone and subsequently the convective zone. In this region, the solar material behaves more like a fluid. Due to the temperature gradient which exists across it, the primary source of transport is here via convection.

Photosphere:

The photosphere is the visible 'surface' of the Sun, since the layers below it are all opaque to visible light. A layer of about 100 km thickness, the photosphere is also the region from where sunlight can freely escape into space. The photospheric surface has a number of features i.e. sunspots, granules and faculae. Sunspots (see section 3.2.3) are magnetic regions where the solar material has a lower temperature compared to its surroundings. Magnetic field lines are concentrated in sunspot regions, and the field strength in sunspots can often be thousands of times stronger than the on the Earth.

Chromosphere:

The Chromosphere extends for a distance of almost 5000 km after the photosphere. The chromosphere is known for the existence of features called spicules and prominences. The chromosphere has a red colour which is generally not visible due to the intense light given off by the photosphere but can be observed through a filter centered on the Hydrogen Hα spectral line.

Solar Transition Region:

A thin (100 km) region between the chromosphere and the solar corona where the temperature rises from about 8000 K to 5 × 105 K, the solar transition region might not be well defined at all altitudes; however its existence is evidenced by a bifurcation of the dynamics of the solar plasma. Below the transition region, the dynamics is dictated by gas pressure, fluid dynamics, and gravitation while above the region, the dynamics is dictated more by magnetic forces.

Corona:

An aura of plasma around the Sun that extends millions of kilometers into space, the corona can be observed during a total solar eclipse (figure 3.2) or with a coronagraph. The temperature of the corona is dramatically higher than the photosphere and chromosphere. The average temperature can range from 1 × 106 K to 2 × 106 K while in the hottest regions it can be as high as 2 × 107 K [Erd´elyi and Ballai, 2007]. Although the reason for this dramatic increase is still not well understood, there exist various explanations using concepts of magnetic reconnection [Russell, 2001, Erd´elyi and Ballai, 2007] and Alfv´en waves [Alfv´en, 1947]. There is a critical height in the corona, known as the source surface, below which the magnetic field controls the plasma completely. Above it the plasma carries the magnetic field with it into the interplanetary medium.

核心：

太阳的核心是热核聚变反应发生的场所，这些反应产生其能量。它从中心延伸到太阳半径的大约20-25% [Garc´ía等人，2007]。核心的温度接近1.57×10^7 K，密度为150 g/cm³ [Basu等人，2009]。核心的核聚变通过众所周知的质子-质子链（pp）进行。

辐射区 ：

辐射区从太阳半径的25%延伸到70%。核心中的核反应对温度和压力非常敏感。实际上，它们在核心边缘几乎被关闭。在辐射区，能量传递通过光子（辐射）进行，这些光子在原子核之间反弹，直到它们到达对流区。

对流区 ：

对流区位于太阳半径的70%到接近太阳表面的某一点之间。从核心到辐射区，再到对流区，密度急剧下降。在这个区域，太阳物质的行为更像流体。由于跨越该区域的温度梯度，这里的主要传输方式是通过对流。

光球层 ：

光球层是太阳的可见"表面"，因为下面的层对可见光都是不透明的。光球层大约100公里厚，也是太阳光可以自由逃逸到太空的区域。光球表面有许多特征，如太阳黑子、米粒组织和光斑。太阳黑子（见第3.2.3节）是磁场区域，其中太阳物质的温度比周围低。磁场线集中在太阳黑子区域，并且太阳黑子中的场强通常比地球上的强数千倍。

色球层 ：

色球层在光球层之后延伸了几乎5000公里。色球层以存在称为针状物和突出物的特征而闻名。色球层呈红色，但由于光球层发出的强烈光线，它通常不可见，但可以通过以氢Hα谱线为中心的滤光片观察到。

太阳过渡区 ：

太阳过渡区是色球层和日冕之间的一个薄（100公里）区域，其中温度从大约8000 K上升到5×10^5 K。太阳过渡区在所有高度上可能都没有明确定义；然而，它的存在由太阳等离子体动力学的分叉所证明。在过渡区以下，动力学由气体压力、流体动力学和重力决定；而在该区域以上，动力学更多地由磁力决定。

日冕：

日冕是太阳周围数百万公里远的等离子体光环，可以在日全食（图3.2）期间或通过日冕仪观察到。日冕的温度远高于光球层和色球层。平均温度范围可以从1×10^6 K到2×10^6 K，而在最热的区域，温度可以高达2×10^7 K [Erd´elyi和Ballai，2007]。尽管这种急剧增加的原因尚不完全清楚，但存在各种解释，使用了磁重联[Russell, 2001, Erd´elyi和Ballai, 2007]和阿尔文波[Alfv´en, 1947]的概念。在日冕中，有一个称为源表面的关键高度，低于该高度，磁场完全控制等离子体。在其上方，等离子体将磁场带入行星际介质中。

3.2.2 Solar Wind & Heliospheric Magnetic Field

The idea that the Sun was ejecting charged particles outwards into space was first hinted at after the solar storm of 1859 by Richard Carrington [Cliver and Dietrich, 2013] and later by George FitzGerald [Meyer-Vernet, 2007]. Arthur Eddington, in a footnote of an article about the comet Morehouse in 1910, was the first to suggest the existence of the solar wind, without naming it so [Durham, 2006].

In the 1950s, studies of the anti-solar orientation of the ion tails of Halley's comet led to the theory of solar corpuscular emission [Biermann, 1952, 1957, 1951]. Parker [1958b, 1960, 1965] argued that the corona cannot remain in hydrostatic equilibrium and that supersonic expansion of the corona is responsible for the outward expulsion of charged particles, which the author referred to as the solar wind. Parker [1958b] also proposed a spiral model for the Heliospheric Magnetic Field (HMF) and suggested that the solar wind carried with it the solar magnetic field. The Parker model was further supported its ability to explain the effect of the HMF on the modulation of galactic cosmic rays and their measured intensities close to the Earth [Parker, 1958a]. In 1959 the Soviet spacecraft Luna 1 was the first to directly observe the solar wind and measure its strength [Harvey, 2007]. Subsequently, the Mariner 2 mission recorded properties of the positive ion component of the solar wind and confirmed the Parker spiral HMF model [Neugebauer and Snyder, 1966].

The structure of the HMF is central to explaining the formation and propagation of the solar wind. The HMF in steady state points radially outward and rotates with the Sun, producing an Archimedean spiral structure as postulated in Parker [1958b] and shown schematically in figure 3.3. Photospheric observations of the magnetic field (see Global Oscillation Network Group https://gong.nso.edu) are often extrapolated to compute approximations to the coronal HMF topology. There exist a number of techniques used to perform such extrapolations: potential field based methods such as Potential-Field Source Surface (PFSS) [Schatten et al., 1969, Altschuler and Newkirk, 1969], PFSS variants such as Potential-Field Current Sheet (PFCS) [Schatten, 1971], Current-Sheet Source Surface (CSSS) [Zhao and Hoeksema, 1995], and several others. Apart from potential based models, there exist more involved techniques based on Magnetohydrodynamics (MHD) such as Magnetohydrodynamics Around a Sphere (MAS) [Linker et al., 1999], ENLIL [Odstrˇcil et al., 1996, Odstrˇcil and Pizzo, 1999a,b, Odstrˇcil, 2003, Odstrˇcil et al., 2004] and EUHFORIA [Pomoell and Poedts, 2018].

The HMF can be seen as a combination of two components: the poloidal magnetic field and the toroidal magnetic field. The two fields often exchange energy between themselves over the course of several years in a cyclical phenomenon known as the solar cycle (section 3.2.3). Interested readers can read Owens and Forsyth [2013] for an in-depth review on the phenomena that drive the HMF.

The expansion of the coronal magnetic field leads to an eventual opening of field lines at the source surface (see figure 3.3) and the ejection of the solar wind. This hot plasma consists mostly of protons, electrons and a small number of helium and heavy ions. The solar wind spirals outwards in all directions, carrying with it the magnetic field. Close to the Earth's magnetosphere, this wind has a nominal speed of about 400 km s−1 while its high speed component has an average velocity of ∼ 700 km s−1 (figure 3.4).

Near Earth Measurements

The solar wind has the heliospheric magnetic field frozen in1 , and as it propagates in the interplanetary medium, it carries the solar magnetic field with it [Alfv´en, 1942, 1943]. Important solar wind quantities such as: 1. solar wind speed, 2. proton density, and 3. magnetic field strength are recorded at the well known L1 Lagrangian point where the gravitational fields of the Earth and the Sun approximately balance out.

3.2.2 太阳风与日球层磁场

1859年太阳风暴之后，理查德·卡林顿（Richard Carrington）首次暗示了太阳正在向外太空喷射带电粒子的观点[Cliver and Dietrich, 2013]，随后乔治·菲茨杰拉德（George FitzGerald）也提出了类似看法[Meyer-Vernet, 2007]。1910年，亚瑟·爱丁顿（Arthur Eddington）在一篇关于莫豪斯彗星（Morehouse comet）的文章的脚注中，首次提出了太阳风的存在，尽管当时并未使用这一名称[Durham, 2006]。

20世纪50年代，对哈雷彗星离子尾反太阳方向的研究导致了太阳日冕发射理论的出现[Biermann, 1952, 1957, 1951]。帕克（Parker）[1958b, 1960, 1965]认为，日冕无法保持静力平衡，其超音速膨胀是导致带电粒子向外喷射的原因，帕克将这一现象称为太阳风。帕克[1958b]还提出了日球层磁场（HMF）的螺旋模型，并指出太阳风携带着太阳磁场。帕克模型进一步通过其解释HMF对银河宇宙射线调制作用及其近地测量强度的能力得到了支持[Parker, 1958a]。1959年，苏联的"月球1号"探测器首次直接观测到了太阳风并测量了其强度[Harvey, 2007]。随后，"水手2号"任务记录了太阳风正离子成分的特性，并证实了帕克的螺旋HMF模型[Neugebauer and Snyder, 1966]。

HMF的结构对于解释太阳风的形成和传播至关重要。在稳定状态下，HMF径向向外延伸并随太阳旋转，形成了帕克[1958b]所假设的阿基米德螺旋结构，如图3.3所示。通常，通过对光球层磁场（参见全球振荡网络组 NSO: Global Oscillation Network Group (GONG) ）的观测进行外推，可以计算日冕HMF拓扑结构的近似值。进行此类外推的技术有多种：基于势场的方法，如势场源表面（PFSS）[Schatten et al., 1969, Altschuler and Newkirk, 1969]；PFSS变体，如势场电流片（PFCS）[Schatten, 1971]、电流片源表面（CSSS）[Zhao and Hoeksema, 1995]等。除了基于势的模型外，还有基于磁流体动力学（MHD）的更复杂技术，如球体周围的磁流体动力学（MAS）[Linker et al., 1999]、ENLIL[Odstrˇcil et al., 1996, Odstrˇcil and Pizzo, 1999a,b, Odstrˇcil, 2003, Odstrˇcil et al., 2004]和EUHFORIA[Pomoell and Poedts, 2018]。

HMF可以视为由两个分量组成：极向磁场和环向磁场。这两个磁场在几年内会周期性地交换能量，这一现象被称为太阳周期（第3.2.3节）。感兴趣的读者可以阅读Owens和Forsyth[2013]的深入综述，了解驱动HMF的现象。

日冕磁场的扩展最终导致源表面（见图3.3）的磁场线开放和太阳风喷射。这种热等离子体主要由质子、电子以及少量的氦和重离子组成。太阳风在所有方向上螺旋向外延伸，并携带着磁场。接近地球磁层时，这股风的名义速度约为400 km/s，而其高速成分的平均速度约为700 km/s（见图3.4）。

近地测量

太阳风携带着冻结在其中的日球层磁场，并在行星际介质中传播时携带太阳磁场[Alfv´en, 1942, 1943]。在地球和太阳的引力场大致平衡的著名L1拉格朗日点，记录了重要的太阳风参数，如：1. 太阳风速度，2. 质子密度，3. 磁场强度。

3.2.3 Sunspots & Solar Cycle

Sunspots are temporarily occurring regions on the Sun's photosphere that appear as dark spots. They are areas of magnetic field concentration where the field lines often 'puncture' the solar surface inhibiting convection and producing regions with lower temperatures than the surroundings. Sunspots generally last anywhere between a few days to a few months. They can occur in pairs or groups and can accompany other phenomena such as coronal loops, prominences, and reconnection events.

Since the 19th century the number of sunspots on the Sun's surface have been recorded as the sunspot number (SSN). Sunspots populations increase and decrease, thereby behaving as markers for solar activity levels. The cyclical behavior of sunspot populations is called the sunspot cycle or solar cycle (figure 3.5).

Figure 3.5 depicts how the area occupied by sunspots changes with solar latitude and time. During the start of a solar cycle (solar minimum), sunspots start appearing at higher latitudes. Over the course of the cycle, they move towards the equatorial regions and their number increases to some maximum (solar maximum). Towards the end, the number of sunspots diminishes and the entire cycle starts over. This repetitive behavior happens over approximately 11 years.

Because sunspots are magnetic phenomena, the solar cycle represents cyclical behavior of the HMF. During solar minimum, the poloidal component of the solar magnetic field is at its strongest and it is the closest it can get to a magnetic dipole configuration. Towards solar maximum, energy is transferred from the poloidal component to the toroidal component, resulting in complex field configurations which are evidenced by larger numbers of sunspot clusters.

The solar cycle also gives rise to variations in solar irradiance [Willson et al., 1981]. Between 1645 and 1715, very few sunspots were observed, a period known as the Maunder minimum. This coincided with lower than average temperatures in Europe, which was called the little ice age. Although the Maunder minimum was a period of lower solar irradiance, recent research [Owens et al., 2017] has demonstrated that this was neither the only factor nor the most significant in causing lower than average temperatures during the little ice age.

In chapter 7, the sunspot number data as well as the flux tube expansion factor (fS or FTE) and the magnetic field strength computed by the CSSS model will be used to create a input data set for building the dynamic time lag regression model proposed therein. Using the input parameters, the DTLR model provides an estimate for the near Earth solar wind speed as well as the propagation time. Measurements of the solar wind speed will also be used in chapters 4 and 5 as inputs to the Dst forecasting models applied therein.

3.2.3 黑子与太阳周期

太阳黑子是太阳光球层上暂时出现的暗斑区域。它们是磁场集中的区域，其中磁场线经常"穿透"太阳表面，抑制对流，并产生比周围区域温度更低的区域。太阳黑子通常持续几天到几个月不等。它们可以成对或成群出现，并可能伴随其他现象，如日冕环、日珥和重联事件。

自19世纪以来，太阳表面上的黑子数量已被记录为黑子数（SSN）。黑子数量会增加和减少，从而作为太阳活动水平的标志。黑子数量的周期性行为被称为黑子周期或太阳周期（图3.5）。

图3.5展示了黑子占据的面积如何随太阳纬度和时间变化。在太阳周期的开始（太阳活动极小期），黑子开始在较高纬度出现。随着周期的进行，它们向赤道区域移动，数量增加到某个最大值（太阳活动极大期）。接近周期结束时，黑子数量减少，整个周期重新开始。这种重复行为大约每11年发生一次。

由于黑子是磁现象，太阳周期代表了地磁场（HMF）的周期性行为。在太阳活动极小期，太阳磁场的极向分量最强，最接近磁偶极子构型。随着太阳活动向极大期发展，能量从极向分量转移到环向分量，导致复杂的磁场构型，这表现为更大数量的黑子群。

太阳周期还会导致太阳辐照度的变化。在1645年至1715年期间，观测到的黑子数量非常少，这一时期被称为蒙德极小期。这与欧洲低于平均水平的温度相吻合，这一时期被称为小冰期。虽然蒙德极小期是太阳辐照度较低的时期，但最近的研究[表明，这既不是导致小冰期温度低于平均水平的唯一因素，也不是最重要的因素。

在第7章中，黑子数数据以及由CSSS模型计算得到的磁通管膨胀因子（fS或FTE）和磁场强度将用于创建动态时间滞后回归模型（DTLR）的输入数据集。使用这些输入参数，DTLR模型可以提供近地太阳风速度以及传播时间的估计。太阳风速度的测量也将在第4章和第5章中用作Dst预报模型的输入。

3.3 Magnetosphere

The Earth's magnetosphere (figure 3.6) is a region surrounding the planet where its magnetic field dominates the interplanetary magnetic field. The Earth's magnetic field shields the atmosphere and terrestrial life from the impact of the solar wind.

As the solar wind approaches the Earth, it is slowed down and deflected by the Earth's magnetic field. Since the solar wind is supersonic when it arrives and slows down to subsonic levels, a shock wave is generated in the process (bow shock). Much of the solar wind kinetic energy is converted to thermal energy when it crosses the bow shock into the magnetosheath. The magnetosheath spans from the bow shock to the magnetopause. The magnetopause is the outer boundary of the Earth's magnetic shield. Its location is ∼ 10R_E (R_E = 6372 km, the radius of the Earth).

Earth's magnetic shielding is not perfect, and some particles manage to get trapped inside the cavity of the magnetosphere. This region of trapped plasma is known as the the van Allen radiation belts. Particles trapped in the radiation belts execute complex motions which can be approximately modelled using ideas from adiabatic theory and diffusion described in section 3.3.1 below. The plasmasphere is the inner region of the radiation belts which contains cold, dense plasma. The portion of the magnetosphere facing away from the Sun (called the nightside) is stretched out in a tail-like shape by the deflected solar wind, hence referred to as the magnetotail. The magnetotail has an approximate extent of up to 1000R_E.

3.3.1 Particle Motions & Adiabatic Theory

This section gives a quick introduction to the theory of charged particle motions in the magnetosphere. The reader may refer to Roederer [1970] for an in depth treatment of this subject. To understand the motions of charged particles in the magnetosphere, the role of electric and magnetic forces must be understood.

It is well known from classical electromagnetism that the force exerted on a particle with charge q by a magnetic field B and an electric field E is given by the well known Lorentz force (equation (3.3)).

The first component of equation (3.3) (qE) is either parallel or opposite to the local electric field depending on the charge of the particle. The second component qv × B involves a vector cross product so it is always perpendicular to the plane spanned by vectors v and B . In order to understand its effects, we can decompose the particle velocity in two components; v_{||} parallel to B and v⊥ perpendicular to B. If E = 0, then the particle executes a circular motion with properties shown in equation (3.4). Here ρ is the gyroradius and ω is the gyrofrequency or cyclotron frequency. In the case where , the trajectory is helical.

Apart from the gyro motion, there are some important drift forces that significantly influence particle motions.

• Electric field drift: If E has a component E_⊥ perpendicular to B , the electric field accelerates and decelerates the particle in the two hemispheres of the orbit. The orbit becomes a distorted circle, and the particle drifts in a direction perpendicular to the electric field with a velocity v_d = E × B/B^2 .

• Magnetic gradient drift: When the magnetic field varies in space (as is the case of the Earth), a gradient in the field strength in the direction perpendicular to B gives rise to a gradient drift velocity given by v g = 1/2 m v^2⊥ B × ∇B/aB^3.

• Magnetic curvature drift: If the magnetic field has a curvature, this creates an additional drift motion with velocity v c = mv{||} B ×(b ˆ·∇)b ˆ qB2 (b = B/B ).

The equations of motion for charged particles in the general case of spatially varying electric and magnetic fields do not admit closed-form solutions. The motions are generally complex and require lengthy numerical integrations to be resolved.

The guiding center approximation helps us to decompose particle motions into three periodic components (figure 3.7): 1. gyration around magnetic field lines, 2. bounce motions between magnetic north and south poles, and 3. equatorial drift of electrons and protons, each with its own time scale.

Adiabatic Invariants

When a physical system with periodic motion is varied slowly as compared to the time period of its periodicity, the transformation can be characterized as adiabatic. Formally speaking, for systems which are described by Hamiltonian dynamics, we can write the equations of motion in terms of the canonical position q, the canonical momentum p, external parameters θ, and the system's Hamiltonian H(q, p|θ)：

If the system shown in equation (3.5) executes a periodic motion in the q, p phase space, it admits an adiabatic invariant A given in equation (3.6).

The quantity A would remain approximately constant if the external parameters θ were varied adiabatically (i.e. if changes in θ happen over a time period much greater than the period of oscillation of the system).

Applying the idea of adiabatic invariance to charged particle motion in the magnetosphere, it is possible to associate one adiabatic invariant with each periodic motion i.e. gyromotion M, bounce J, and drift Φ (equation (3.7)).

The first invariant M is associated with the Larmor gyration - it is the magnetic moment of the current generated by the circular motion of the particle around the field line.

The second invariant J is associated with the bounce motion between the two magnetic mirrors near the north and south poles (the quantity s is an appropriately chosen arc length coordinate along the bounce trajectory). The bounce motion between the magnetic poles can be explained by the conservation of the particle energy and the first invariant M. Because field strength |B| increases near the poles, v⊥ also increases to conserve M; however, to conserve energy, v_{||} decreases until the particle can no longer move farther along the field line (and bounces back).

The third invariant Φ, associated with equatorial drift motion, is actually the magnetic flux through the barrel shape envelope of the particle drift. A particle's guiding magnetic field line can be identified by its radial position r and its longitude . The magnetic flux of the drift can then be computed by integrating over .

Associated with each adiabatic invariant is a timescale which determines how easily its conservation can be violated. The timescales for M, J, and Φ are the time periods of the gyromotion, bounce motion, and equatorial drift motion respectively. Since it takes much a longer time for the particles to complete a drift motion around the Earth as compared to bounce and gyromotion (in that order), the invariance of Φ is most easily violated - a fact which is used in the simplification of the Fokker-Planck diffusion system described below.

Plasma Diffusion

Because we consider populations of charged particles, it is natural to employ some kind of distribution based picture for magnetospheric plasma. The adiabatic invariants give us a phase space or coordinate system by which we can express quantities of interest.

The main quantity of interest in this case is the phase space density f(t,M, J, Φ) which is a function of time and three invariants. The phase space density tells us the number of particles in a particular region of the phase space, and at a particular point of time. Diffusion behavior arises when one or more of the invariants are violated, which can happen due to a number of reasons such as: 1. non-adiabatic variations of the magnetic field, 2. external forces, 3. interaction with electromagnetic waves, and 4. collisions with atmosphere/ionosphere. The plasma diffusion system [Schulz and Lanzerotti, 1974] can be written as a generalized Fokker-Planck system as shown in equation (3.10).

It is possible to simplify this system by considering the two main categories of diffusion: radial diffusion and pitch angle diffusion. Radial diffusion allows particles to move farther or closer to the Earth, and pitch angle diffusion moves the magnetic mirror points along the field lines.

Rewriting Φ ∝ 1/l, the third invariant can be expressed in terms of the drift shell l (larger value of l implies greater distance from the Earth). The radial diffusion system can be obtained from equation (3.10) by keeping M and J fixed, considering diffusion in ` (violation of Φ invariance), and by approximating pitch angle diffusion as a loss process [Walt, 1970, Roederer, 1970]. The resulting system is shown in equation (3.14).

The first term on the right hand side, , models diffusive phenomena in Φ but is expressed in the drift shell coordinate `. Pitch angle diffusion is approximated using a loss process λ(l, t)f, where λ(l, t) is the loss rate. As an alternative it is also possible to express the loss rate as a loss time scale τ (l, t) = 1/λ(l,t) , but in this thesis we will use the former convention.

The radial diffusion system in equation (6.1) is the starting point for chapter 6 where a surrogate model of the phase space density ˆf is built to perform Bayesian inference over the parameters of the diffusion coefficient κ and loss rate λ.

3.3 磁层

地球的磁层（图3.6）是环绕地球的一个区域，其磁场在此区域中占据主导地位，超越了行星际磁场。地球的磁场保护着大气层和陆地生物免受太阳风的影响。

当太阳风接近地球时，它会被地球的磁场减速并偏转。由于太阳风到达时是超音速的，并减速到亚音速水平，因此在此过程中会产生冲击波（弓形激波）。当太阳风穿过弓形激波进入磁鞘时，其大部分动能会转化为热能。磁鞘从弓形激波延伸到磁层顶。磁层顶是地球磁盾的外边界，其位置大约在地球半径的10倍（R_E=6372公里，即地球半径）处。

地球的磁屏蔽并非完美无缺，一些粒子会设法被捕获在磁层的空腔内。这个被捕获的等离子体区域被称为范艾伦辐射带。辐射带中被捕获的粒子执行复杂的运动，这些运动可以大致使用下文3.3.1节中描述的绝热理论和扩散理论的思想进行建模。等离子体层是辐射带的内层区域，包含冷而密集的等离子体。磁层中背对太阳的部分（称为夜侧）被偏转的太阳风拉伸成尾状，因此被称为磁尾。磁尾的大致范围可达地球半径的1000倍。

3.3.1 粒子运动与绝热理论

本节简要介绍了磁层中带电粒子运动的理论。读者可参阅Roederer [1970] 以深入了解这一主题。为了理解磁层中带电粒子的运动，必须了解电场和磁场力的作用。

根据经典电磁学的知识，带电粒子q在磁场B和电场E中受到的力由著名的洛伦兹力（方程（3.3））给出。

方程（3.3）的第一部分（qE）要么与局部电场平行，要么与局部电场方向相反，这取决于粒子的电荷。第二部分qv × B涉及向量叉积，因此它总是垂直于由向量v和B张成的平面。为了理解其影响，我们可以将粒子速度分解为两个分量：v_{||}（平行于B）和v⊥（垂直于B）。如果E=0，则粒子将执行具有方程（3.4）所示性质的圆周运动。这里的ρ是回旋半径，ω是回旋频率或回旋频率。在v_{||} ≠ 0的情况下，轨迹是螺旋形的。

除了回旋运动外，还有一些重要的漂移力对粒子运动产生显著影响。

电场漂移：如果E有垂直于B的分量E_⊥，则电场会在轨道的两个半球中加速和减速粒子。轨道会变成扭曲的圆形，粒子会以速度v_d = E × B/B^2在垂直于电场的方向上漂移。
磁场梯度漂移：当磁场在空间中变化时（如地球的情况），在垂直于B的方向上，场强的梯度会产生梯度漂移速度，由v_g = 1/2 mv2_⊥ B × ∇B/aB3给出。
磁场曲率漂移：如果磁场具有曲率，这会产生一个额外的漂移运动，其速度为v_c = mv_{||} B×(bˆ·∇)bˆ/qB^2（其中b = B/B）。

在空间变化的电场和磁场的一般情况下，带电粒子的运动方程不允许封闭形式的解。这些运动通常很复杂，需要长时间的数值积分来解决。

引导中心近似有助于我们将粒子运动分解为三个周期性分量（图3.7）：1. 绕磁场线的回旋运动；2. 在磁北极和南极之间的弹跳运动；3. 电子和质子的赤道漂移，每种运动都有其自己的时间尺度。

绝热不变量

当与周期性运动的物理系统相比，其变化非常缓慢时，这种变化可以描述为绝热的。正式地说，对于由哈密顿动力学描述的系统，我们可以使用正则位置q、正则动量p、外部参数θ以及系统的哈密顿量H(q, p|θ)来写出运动方程：

如果方程（3.5）所示的系统在q，p相空间中执行周期性运动，则它承认一个由方程（3.6）给出的绝热不变量A。

如果外部参数θ是绝热变化的（即θ的变化发生在一个远大于系统振荡周期的时间段内），则量A将保持大致恒定。

将绝热不变量的概念应用于磁层中带电粒子的运动，我们可以将每个周期性运动（即回旋运动M、弹跳运动J和漂移运动Φ，方程（3.7））与一个绝热不变量相关联。

第一个不变量M与拉莫尔回旋运动相关，它是粒子绕磁场线做圆周运动所产生的电流的磁矩。

第二个不变量J与北极和南极附近两个磁镜之间的弹跳运动相关（s是沿弹跳轨迹适当选择的弧长坐标）。磁极之间的弹跳运动可以通过粒子能量的守恒和第一个不变量M的守恒来解释。因为磁场强度|B|在极点附近增加，为了保持M守恒，v⊥（垂直速度）也会增加；然而，为了保持能量守恒，v_{||}（平行速度）会减小，直到粒子无法再沿着磁场线进一步移动（然后弹回）。

第三个不变量Φ与赤道漂移运动相关，它实际上是粒子漂移的桶形包络中的磁通量。可以通过对粒子引导磁场线的径向位置r和经度进行积分来计算漂移的磁通量。

与每个绝热不变量相关联的是一个时间尺度，它决定了其守恒性可以被破坏的难易程度。M、J和Φ的时间尺度分别是回旋运动、弹跳运动和赤道漂移运动的时间周期。由于粒子完成绕地球的漂移运动所需的时间比弹跳运动和回旋运动长得多（按此顺序），因此Φ的不变性最容易被破坏------这一事实用于简化下文所述的福克-普朗克扩散系统。

等离子体扩散

由于我们考虑的是带电粒子群，因此自然而然地会采用某种基于分布的图景来描述磁层等离子体。绝热不变量为我们提供了一个相空间或坐标系，通过它可以表达我们感兴趣的量。

在这种情况下，我们主要关注的量是相空间密度f(t,M,J,Φ)，它是时间和三个不变量的函数。相空间密度告诉我们在相空间的特定区域和特定时间点上粒子的数量。当一个或多个不变量被破坏时，就会出现扩散行为，这可能是由于多种原因造成的，如：1. 磁场的非绝热变化；2. 外力；3. 与电磁波的相互作用；4. 与大气层/电离层的碰撞。等离子体扩散系统[Schulz 和 Lanzerotti, 1974]可以写成如方程（3.10）所示的广义福克-普朗克系统。

通过考虑扩散的两个主要类别：径向扩散和俯仰角扩散，可以简化这个系统。径向扩散允许粒子向地球更远或更近地移动，而俯仰角扩散则沿着磁场线移动磁镜点。

将Φ重新写为Φ ∝ 1/l，第三个不变量可以用漂移壳l来表示（l的较大值意味着离地球更远）。通过保持M和J不变，考虑Φ不变性的破坏（即Φ的扩散），并将俯仰角扩散近似为损失过程[Walt, 1970; Roederer, 1970]，可以从方程（3.10）中得到径向扩散系统。所得系统如方程（3.14）所示。

方程右侧的第一项，，模拟了Φ中的扩散现象，但它是用漂移壳坐标`来表示的。俯仰角扩散使用损失过程λ(l,t)f来近似，其中λ(l,t)是损失率。作为替代，也可以将损失率表示为损失时间尺度τ(l,t) = 1/λ(l,t)，但在本论文中，我们将使用前者。

方程（6.1）中的径向扩散系统是第6章的起点，在该章中，我们构建了一个相空间密度ˆf的替代模型，以对扩散系数κ和损失率λ的参数进行贝叶斯推断。

3.3.2 Current Systems & Geomagnetic Indices

As was noted earlier, the solar wind is largely deflected by the Earth's magnetic field but some particles still leak into the magnetosphere. This particle injection is governed by the interaction between the magnetic field carried by the solar wind and the Earth's magnetic field, also known as solar wind - magnetosphere coupling. It plays an important role in determining space weather conditions in the Earth's vicinity.

Solar wind plasma gets trapped in the Earth's magnetic field at a rate that is modulated by the solar wind - magnetosphere coupling. The drift motions of charged particles in the magnetosphere as discussed in section 3.3 lead to many current systems. The prominent current systems (pictured in figure 3.6) are 1. the ring current, 2. field aligned current, 3. tail current, and 4. magnetopause current. These current systems induce magnetic fields that interact with the Earth's magnetic field and mutate it. Weakening of the Earth's magnetic field strength due to strong ring currents leads to geomagnetic storm conditions which can have adverse impacts on orbiting satellites and ground based infrastructure.

For the purposes of space weather monitoring and forecasting, the state of the magnetosphere and geomagnetic phenomena are often represented by proxies known as geomagnetic indices. Geomagnetic indices give us the ability to summarize the state of the magnetosphere in terse framework. They are often calculated by averaging several ground based measurements of magnetic fluctuations, generally at a cadence of a few hours.

Chapter 4 gives a brief introduction to the popular geomagnetic indices and formulates gaussian process models for producing probabilistic one hour ahead forecasts of the Dst index. In chapter 5, we augment the Dst model from chapter 4 with long short-term memory (LSTM) networks and obtain five hour ahead forecasts of the Dst.

3.3.2 电流系统与地磁指数

如前所述，地球磁场在很大程度上会偏转太阳风，但仍有一些粒子会漏入磁层。这种粒子注入受太阳风携带的磁场与地球磁场（也称为太阳风-磁层耦合）之间相互作用的控制，它在决定地球附近的空间天气条件方面起着重要作用。

太阳风等离子体以受太阳风-磁层耦合调节的速率被地球磁场捕获。第3.3节中讨论的磁层中带电粒子的漂移运动导致了多个电流系统的形成。主要的电流系统（如图3.6所示）包括：1. 环电流；2. 场向电流；3. 尾电流；4. 磁层顶电流。这些电流系统产生的磁场与地球磁场相互作用并改变其状态。由于强烈的环电流导致地球磁场强度减弱，会引发地磁暴条件，从而对绕地卫星和地面基础设施产生不利影响。

为了进行空间天气监测和预报，磁层和地磁现象的状态通常由称为地磁指数的代理变量来表示。地磁指数使我们能够在简洁的框架内概括磁层的状态。它们通常通过对多个地面磁波动测量值进行平均计算得出，通常每隔几小时计算一次。

第4章简要介绍了流行的地磁指数，并制定了高斯过程模型，以产生Dst指数的一小时前概率预报。第5章在第4章Dst模型的基础上增加了长短期记忆（LSTM）网络，以实现对Dst的五小时前预报。