Lecture 10: Multi-Stage Time Integration and Symplectic Methods

Topics Covered: Multi-stage integration methods (Runge-Kutta schemes), linear stability analysis, Störmer-Verlet symplectic integrator

0. Overview

This lecture marks a critical transition in our course: we now have all the tools needed to build sophisticated physics-informed neural architectures. We've learned to derive energy-preserving ODEs from least action principles, compute Hamiltonians via Legendre transforms, integrate using discrete gradients, solve for nonlinear stencils with PyTorch, and impose conservation constraints through polynomial reproduction and Noether's theorem. With these foundational techniques in place, we're ready to tackle our ultimate objective: constructing a nonlinear wave equation solver.

The focus today shifts to the crucial question of how to integrate these systems in time. We begin by examining multi-stage integration schemes, particularly the Runge-Kutta (RK) family of methods. These explicit methods are especially important for machine learning applications where gradient evaluations drive the optimization process. We'll see how different RK schemes achieve various orders of accuracy and, critically, how to analyze their stability regions for systems with complex eigenvalues—a key consideration when solving hyperbolic PDEs like the wave equation.

Finally, we introduce the Störmer-Verlet (leapfrog) integrator, a symplectic method that exactly preserves the Hamiltonian structure of our dynamics. This conservation property makes it ideal for long-time integration of energy-conserving systems derived from variational principles. The connection between our discrete gradient methods and symplectic integration provides a beautiful unification of the variational and Hamiltonian perspectives.

1. Synthesis: The Nonlinear Wave Equation Problem

1.1 Review of Available Tools

With these in hand, we are ready to assume our final form: a nonlinear wave equation solver!

1.2 1D Wave Equation with Periodic Boundary Conditions

2. Variational Approach to the Wave Equation

2.1 Motivation from Discrete Laplacian

Important Note: In a previous class, we showed that in the absence of the right-most node $(D_+ u_N^2 = D_+^2 u_N)$, the Lagrangian gives the approximation:

2.2 Discrete Action with Learnable Nonlinear Stencil

Our goal is to be as expressive as possible while satisfying conservation constraints.

2.3 Deriving the Equations of Motion

3. Hamiltonian Formulation

3.1 Generalized Momentum

3.2 Legendre Transform

Applying the Legendre transform:

$$\boxed{H = \underbrace{\sum_i \frac{1}{2} p_i^2 h^{-1}}_{T_\theta(p) \text{ (kinetic)}} + \underbrace{\frac{1}{2} \sum_i N(D_- q; \theta)^2 h}_{V_\theta(q) \text{ (potential)}}}$$

3.3 Hamilton's Equations

4. Multi-Stage Time Integration Methods

4.1 Motivation: Linear Stability Analysis

To choose an integration scheme (so far we've only looked at explicit Euler), we need to understand stability.

Critical observation: Depending on the eigenvalues of $A$, the ODE will have distinct character:

In machine learning contexts, we need to evaluate gradients with respect to both state variables and parameters $\theta$. Implicit methods require solving nonlinear systems at each timestep, which is computationally expensive and complicates backpropagation. Explicit methods allow straightforward gradient flow through time integration.

4.2 Two Classes of Explicit Methods

Multi-Stage Schemes (Runge-Kutta)

Idea: At each stage, you can make an additional gradient evaluation using points generated in previous stages.

Multi-Step Schemes

Idea: Use information about the derivative from previous timesteps to predict the next state.

4.3 Comparison

5. Runge-Kutta Schemes

5.1 Example: RK1 (Explicit Euler)

5.2 Second-Order Schemes

To choose coefficients, expand in Taylor series and match to $\mathcal{O}(h^2)$:

Coefficients must satisfy:

$$\begin{aligned} a + b &= 1 \\ \alpha b &= 1/2 \\ \beta b &= 1/2 \end{aligned}$$

Result: Non-unique! Multiple second-order RK schemes exist.

5.3 Butcher Tableaux

For the general $s$-stage scheme, coefficients are written compactly as a Butcher tableau:

For explicit methods, $a_{ij} = 0$ for $j \geq i$ (lower triangular with zero diagonal).

5.4 Fourth-Order Method: Standard RK4

To understand why this is the default, we need to analyze the stability. We'll see that it works for purely imaginary problems (hyperbolic PDEs).

6. Stability of Multi-Stage Schemes

6.1 Linear Stability Analysis

Step 1: Write out the RK2 scheme:

$$\begin{aligned} y_{n+1} &= y_n + \frac{h}{2}(k_1 + k_2) \\ k_1 &= A y_n \\ k_2 &= A(y_n + h k_1) = A y_n + h A^2 y_n \end{aligned}$$

Step 2: Substitute:

$$y_{n+1} = \left(I + hA + \frac{h^2}{2} A^2\right) y_n$$

Step 3: Define amplification matrix:

$$y_{n+1} = Q y_n \quad \text{where} \quad Q = I + hA + \frac{h^2}{2} A^2$$

Note: What drives the choices of coefficients in RK is that $Q \approx \exp(hA)$ (Taylor expansion of matrix exponential).

6.2 Stability Condition

Consider an arbitrary scheme where $Q$ is diagonalizable:

$$y_n = Q^n y_0$$

If $Q = S \Lambda S^{-1}$ where $\Lambda = \text{diag}(\lambda_1, \ldots, \lambda_N)$, then component-wise:

$$w_{n,i} = \lambda_i^n w_{0,i}$$

Stability condition: The solution is bounded if:

$$\max_i |\lambda_i| \leq 1$$

where $\lambda_i$ is the $i$-th eigenvalue of $Q$.

6.3 Spectral Radius and Stability Region

6.4 Stability Regions for RK Methods

RK1 (Explicit Euler): Disk of radius 1 centered at $(-1, 0)$ in the complex plane.

RK2: Stretches the stability region by approximately 3/2 in the imaginary direction.

RK4: The standard RK4 has the largest stability region along the imaginary axis among explicit methods of its order, which is why it's the default choice for Hamiltonian/wave-like systems.

Critical observation: For the wave equation (purely imaginary eigenvalues), we need stability regions that extend along the imaginary axis. RK4 provides the best balance of accuracy and stability for such problems.

7. Symplectic Integrators

7.1 Hamiltonian Systems

7.2 Störmer-Verlet / Leapfrog Integrator

The Störmer-Verlet (also called leapfrog) integrator is a second-order symplectic method that exactly preserves the symplectic structure of Hamiltonian mechanics.

Algorithm:

Step 1: Half-step momentum update:

$$p_{n+1/2} = p_n - \frac{h}{2} \partial_q V(q_n)$$

Step 2: Full-step position update:

$$q_{n+1} = q_n + h \partial_p T(p_{n+1/2})$$

Step 3: Half-step momentum update:

$$p_{n+1} = p_{n+1/2} - \frac{h}{2} \partial_q V(q_{n+1})$$

Key Properties:

Symplectic: Exactly preserves phase space volume
Time-reversible: Running backward gives exact trajectory
Energy-conserving: Bounded energy error over long times (no drift)
Second-order accurate: $\mathcal{O}(h^2)$ local truncation error

For kinetic energy $T(p) = \frac{1}{2m} p^2$, we have $\partial_p T = p/m$, so the position update simplifies to:

Important Note: The leapfrog structure (alternating half-steps) is what gives the method its symplectic property. This makes it ideal for long-time integration of conservative systems like our wave equation.

Summary

This lecture covered:

Synthesis of tools for building physics-informed neural architectures
Variational derivation of the learnable wave equation using discrete action principles
Hamiltonian formulation via Legendre transform
Multi-stage integration methods (Runge-Kutta family)
Butcher tableaux for compact representation of RK schemes
Linear stability analysis and stability regions in the complex plane
Störmer-Verlet integrator for symplectic time-stepping of Hamiltonian systems

Key Takeaway: For energy-conserving systems derived from variational principles, symplectic integrators like Störmer-Verlet provide exact conservation of phase space structure and bounded long-time energy behavior. Combined with learnable nonlinear stencils, these methods enable principled construction of physics-informed neural architectures for PDEs with guaranteed geometric properties. The choice of time integrator—whether high-order RK for accuracy or symplectic methods for conservation—depends critically on the eigenvalue structure and conservation requirements of the target system.

$c_1$	$a_{11}$	$a_{12}$	$\cdots$	$a_{1s}$
$c_2$	$a_{21}$	$a_{22}$	$\cdots$	$a_{2s}$
$\vdots$	$\vdots$	$\vdots$	$\ddots$	$\vdots$
$c_s$	$a_{s1}$	$a_{s2}$	$\cdots$	$a_{ss}$
	$b_1$	$b_2$	$\cdots$	$b_s$

0	0	0	0	0
1/2	1/2	0	0	0
1/2	0	1/2	0	0
1	0	0	1	0
	1/6	1/3	1/3	1/6