Discontinuous Galerkin (DG) Discretization

SciencePedia

Key Takeaways

The DG method embraces discontinuity between elements, gaining flexibility for complex phenomena and requiring numerical fluxes to enforce physical conservation laws at interfaces.
The choice of numerical flux is central to the method, directly controlling stability and energy conservation through carefully constructed terms involving jumps and averages of the solution.
For nonlinear problems, provably stable DG schemes rely on the Summation-By-Parts (SBP) property, which connects the algebraic formulation to quadrature accuracy and prevents aliasing-driven instabilities.
The method's element-wise structure makes it a natural fit for multiphysics coupling and enables high-performance computing strategies like local time-stepping and specialized multigrid solvers.

Introduction

The universe is governed by partial differential equations, but translating these elegant laws into solutions on a computer requires approximation. While traditional methods like the Finite Element Method prize continuity, they can be rigid when faced with complex geometries or sharp physical phenomena. The Discontinuous Galerkin (DG) method offers a radically different and flexible philosophy, but this freedom introduces new challenges. This article provides a deep dive into the DG framework. The first chapter, "Principles and Mechanisms," demystifies the core concepts, from the freedom of discontinuity and the crucial role of numerical fluxes to the intricacies of stability and time integration. Following this, the "Applications and Interdisciplinary Connections" chapter showcases how these principles are applied to solve complex problems in wave propagation, fluid dynamics, and multiphysics, revealing the method's profound impact across science and engineering.

Principles and Mechanisms

To truly appreciate the Discontinuous Galerkin (DG) method, we must think like a physicist, but with the practical mindset of an engineer. The universe is governed by elegant partial differential equations, but solving them on a computer requires us to make approximations. The question is, how do we make these approximations in a way that is flexible, powerful, and yet still respects the underlying physics?

The Freedom of Discontinuity: A New Philosophy

Imagine you are trying to describe the shape of a complex mountain range. One approach, the traditional way of thinking in many numerical methods like the standard Finite Element Method (FEM), is to lay a single, enormous, continuous sheet of fabric over it. To capture every peak and valley, the fabric must be stretched and contorted in very complicated ways. If you want to add more detail in one small area, you might have to adjust the entire sheet. This is the world of continuous approximations—powerful, but often rigid and cumbersome.

The Discontinuous Galerkin method proposes a radically different, almost brazen, philosophy: Divide and Conquer, then Patch Things Up.

Instead of one giant sheet, let's cover the mountain with a mosaic of small, simple, independent tiles. Each tile is a simple shape—a triangle or a square—and on each tile, we describe the local topography with a simple mathematical function, a polynomial. A flat tile is a constant polynomial, a tilted tile is a linear one, and a tile with a bit of curvature is a quadratic. The crucial idea is that these tiles don't have to meet perfectly at the edges. One tile can end at a certain height, and its neighbor can start at a completely different one. We have embraced discontinuity.

This freedom is liberating. If we need more detail in one area, we just use smaller tiles or higher-order polynomials on the existing tiles, without disturbing the rest of the mosaic. This makes DG exceptionally well-suited for problems with complex geometries or phenomena that require sharp changes, like shock waves in air or electromagnetic waves bouncing off an object.

But this freedom comes at a price. Our beautiful, continuous mountain has become a collection of disjointed floating platforms. If a skier were on this mountain, how would they get from one tile to the next? The elements are isolated; they don't know their neighbors exist. We have solved the problem locally, but we have lost the global picture. The physics, which is all about interactions and connections, is broken.

The Art of Communication: The Numerical Flux

This brings us to the second part of the DG philosophy: patching things up. The communication between these isolated element "islands" must happen at their boundaries, the interfaces. The entire magic and power of the DG method is concentrated in the rule we invent to govern this communication. This rule is called the numerical flux.

Imagine two rooms, $K^-$ and $K^+$ , separated by a wall with a small opening. The temperature in room $K^-$ is $T^-$ and in room $K^+$ is $T^+$ . Heat will flow through the opening. The rate and direction of this flow—the heat flux—depends on both $T^-$ and $T^+$ . It is a single, well-defined physical process at the interface. In our DG world, the solution is discontinuous, so at the interface between two elements, we have two different values, say $u^-$ and $u^+$ . The numerical flux, denoted $\widehat{f}$ , is a function that takes both values and provides a single, unique value for the flux between the elements, mimicking the physical exchange.

To build these fluxes, we need a language to talk about the state at an interface. The most natural words in this language are the jump and the average of the solution across the interface. For a quantity $u$ , with values $u^-$ and $u^+$ on the two sides of a face, we define:

The average: $\{\!\{u\}\!\} = \frac{1}{2}(u^+ + u^-)$
The jump: $[\![u]\!] = u^+ - u^-$

The average tells us the mean value at the interface, while the jump tells us the degree of disagreement between the two elements. The numerical flux is constructed using these simple but powerful building blocks.

But what makes a "good" numerical flux? It must satisfy two non-negotiable conditions that are fundamental to physics:

Consistency: If, by chance, the solution is actually continuous across the interface ( $u^+ = u^-$ ), then the jump is zero. In this case, the numerical flux must be equal to the true physical flux, $f(u)$ . It must give the right answer in the simple case.
Conservation: The flux leaving element $K^-$ must be exactly the flux entering element $K^+$ . This is the numerical embodiment of conservation laws (of mass, momentum, energy, etc.). It ensures that we don't artificially create or destroy the conserved quantities of the system at the interfaces.

Stability and Conservation: What the Flux Decides

The choice of numerical flux is not merely a technical detail; it is the heart of the method's behavior. It dictates whether our simulation will be stable or blow up, and whether it will conserve fundamental quantities like energy.

Let's consider the wave equation, which describes everything from ripples on a pond to the propagation of light. We can write it as a first-order system and discretize it with DG. A simple and intuitive choice for the flux is the central flux, which is built purely from the averages of the fields. For a one-parameter family of fluxes, this corresponds to setting a "penalty" parameter $\alpha=0$ . If we derive the equation for the total energy of our discrete system, we find something remarkable. With the central flux, the rate of change of the discrete energy is exactly zero. The scheme perfectly conserves energy, just as the real wave equation does.

This seems perfect, but the central flux can be delicate, like a perfectly balanced needle. For more complex problems, it can be unstable. We often need to introduce a bit of stability, which usually means a bit of numerical dissipation (energy loss). This is achieved by adding a jump term to the flux. For example, a simple penalty flux might look like:

\widehat{\psi} = \{\!\{\psi_{h}\}\!\} + \alpha\,[\![\phi_{h}]\!]

When we re-derive the energy balance with this flux, we find that the rate of change of energy is:

\frac{dE_{h}}{dt} = -c \alpha \sum_{\text{interfaces}} [\![\phi_{h}]\!]^2

Look at this equation! The change in energy is proportional to $-\alpha$ times a sum of squared jumps. Since the jumps are squared, the sum is always positive. So, if we choose $\alpha > 0$ , the energy will always decrease. Our choice of flux has introduced a mechanism for controlled energy dissipation, making the scheme more robust. This is a profound insight: the abstract algebraic form of the numerical flux has a direct, tangible consequence on a physical invariant of the system. The art of designing DG schemes is the art of designing numerical fluxes that provide just the right amount of stability without polluting the physics too much.

For equations describing wave propagation, like Maxwell's equations for electromagnetism, this reasoning leads to upwind fluxes. The idea is simple: information flows in a certain direction. The flux at an interface should therefore depend more on the state from the "upwind" side. This seemingly simple physical intuition can be cast into a precise mathematical form involving jumps and averages, and it is the key to creating stable schemes for wave phenomena.

The Inner World of an Element: Basis and Quadrature

So far, we have focused on the communication between elements. But what is happening inside each element? The solution within an element $K$ is not just a single number; it's a polynomial, a smooth function like $u_h(x) = c_0 + c_1 x + c_2 x^2 + \dots$ . We have to decide how to represent this polynomial. There are two main schools of thought, or "bases":

Modal Basis: We represent the polynomial as a sum of pre-defined, "holistic" functions that span the whole element. A common choice is the family of Legendre polynomials ( $P_n(x)$ ). These functions are mathematically beautiful because they are orthogonal with respect to the standard unweighted integral: $\int_{-1}^{1} P_n(x) P_m(x) dx = 0$ for $n \neq m$ . When we formulate our DG equations, this orthogonality causes many terms to vanish, resulting in a diagonal mass matrix. Computationally, this is a massive advantage—it's like getting a set of equations that are already uncoupled and easy to solve.
Nodal Basis: We represent the polynomial by its values at a specific set of points inside the element, the nodes. This is perhaps more intuitive; it's like defining a curve by plotting points and drawing a line through them. The basis functions here are Lagrange polynomials, each of which is 1 at one node and 0 at all others.

What happens if we choose a different modal basis, like Chebyshev polynomials ( $T_n(x)$ )? These polynomials are not orthogonal under the standard integral, but under a weighted one. This means they produce a dense, fully-coupled mass matrix, which is computationally expensive to deal with. So why would anyone use them? Because the corresponding nodes for a nodal Chebyshev basis have a wonderful property: they are not evenly spaced but cluster near the element boundaries. This clustering is incredibly effective at resolving sharp changes or boundary layers in a solution, often leading to higher accuracy for the same number of degrees of freedom. This presents a classic engineering trade-off: the clean mathematical structure and computational efficiency of Legendre polynomials versus the superior approximation properties of Chebyshev polynomials.

In an ideal world of exact mathematics, the choice of basis is just a matter of perspective; a polynomial is a polynomial, whether you describe it by its modal coefficients or its nodal values. And indeed, if all the integrals that arise in the DG formulation are computed exactly, the nodal and modal approaches are algebraically equivalent. But computers cannot do exact integrals. They use numerical quadrature, which approximates an integral by a weighted sum of the integrand's values at a set of quadrature points. The accuracy of this process is key.

The Challenge of Nonlinearity: Taming the Beast of Aliasing

For linear problems, where nothing more complicated than multiplication by a constant happens, life is relatively simple. We can use a quadrature rule that is precise enough to integrate the required polynomials exactly, preserving the equivalence of our different formulations. But physics is often nonlinear. Consider the Burgers' equation, a simple model for fluid dynamics, where the flux is $f(u) = \frac{1}{2}u^2$ .

Now, when we build our DG equations, we encounter integrals of terms like $f(u_h(x))$ , where $u_h$ is a polynomial of degree $N$ . The term $f(u_h)$ is now a polynomial of degree $2N$ . If we multiply this by the derivative of a test function (degree $N-1$ ), the integrand becomes a polynomial of degree up to $3N-1$ . Our quadrature rule, which might only be exact for polynomials of degree $2N$ , is now insufficient.

This leads to a pernicious error called aliasing. The high-degree components of the true integrand, which the quadrature cannot "see" correctly, get misinterpreted as low-degree components. It's like listening to a badly sampled audio recording where a high-pitched violin note is "aliased" and sounds like a low-pitched hum. This spurious energy in the low-frequency modes can feed on itself and cause the simulation to become wildly unstable and blow up.

How can we fight this beast? One way is brute force: use an extremely high-order quadrature rule to compute the nonlinear integrals exactly, a process called de-aliasing. Another, more elegant approach is to be clever about the algebra. It turns out that you can rewrite the nonlinear term in an algebraically equivalent way, for instance, in a "split form." While these forms are identical in continuous mathematics, they behave very differently at the discrete level in the presence of aliasing. A carefully constructed skew-symmetric split form can be made to conserve energy (or a related quantity called entropy) discretely, even with an under-resolved quadrature. This prevents the aliasing-driven instability.

However, to prove that these clever formulations work, we need to show that a discrete version of the integration-by-parts rule holds. This property, known as the Summation-By-Parts (SBP) property, is the cornerstone of provably stable high-order methods. And to guarantee that our discrete operators have the SBP property, we are led right back to a minimum requirement on our quadrature precision: for polynomials of degree $N$ , we typically need a volume quadrature exact for degree $2N-1$ and a face quadrature exact for degree $2N$ . Everything is connected: the stability of nonlinear simulations depends on the algebraic form of the equations, which in turn relies on the SBP property of the operators, which is guaranteed by the precision of the numerical quadrature.

Putting It All Together: The March of Time

We have painstakingly built a sophisticated machine, our DG spatial discretization, denoted by an operator $\mathcal{L}$ . This machine takes the state of our system, $u$ , at a given moment and tells us how it is changing in time: $\frac{du}{dt} = \mathcal{L}(u)$ . This is a huge system of Ordinary Differential Equations (ODEs). The final step is to solve this system to march the solution forward in time. This approach is called the Method of Lines (MoL).

We can use standard time integrators like Runge-Kutta methods. However, we need to be careful. The spatial operator $\mathcal{L}$ might have some very desirable properties. For instance, with a proper setup for a hyperbolic problem, it might be Total Variation Diminishing (TVD), meaning it doesn't create new spurious oscillations. This is a property we'd very much like to keep.

This is why Strong Stability Preserving (SSP) time-stepping methods were invented. An SSP method is a type of Runge-Kutta scheme that is guaranteed to preserve the TVD property (or other similar stability properties) of the spatial operator. If a simple, single Forward Euler step is stable, an SSP method allows us to take much larger, more practical time steps while ensuring the overall scheme remains stable in the same sense. It ensures that our careful work on the spatial discretization isn't undone by a clumsy time integrator.

The Method of Lines is a pragmatic and powerful way to solve time-dependent problems. It separates the complex task of spatial discretization from the well-understood task of time integration. For the sake of completeness, it's worth knowing that a more unified approach exists: spacetime DG. In this view, time is not a special dimension but is treated on equal footing with space. One discretizes a 4D spacetime domain, with discontinuities allowed across time slabs as well as spatial elements. This is a conceptually beautiful and very powerful framework, especially for problems with moving and deforming domains, but it represents a higher level of complexity.

From the simple idea of allowing discontinuities, we have journeyed through a landscape of numerical fluxes, basis functions, quadrature, aliasing, and time integration. The Discontinuous Galerkin method is not just a single technique but a rich and flexible framework—a philosophy for turning the continuous laws of physics into robust and powerful numerical simulations.

Applications and Interdisciplinary Connections

Having journeyed through the principles and mechanisms of the Discontinuous Galerkin method, we might feel like we've just learned the grammar of a new and powerful language. But a language is not for admiring in a vacuum; it is for writing poetry, for describing the world, for building things. Now, we turn to the poetry. How does this elegant mathematical framework actually connect to the world? Where does its unique structure allow us to see things, or do things, that were difficult before?

You will see that the very features we have admired—the local independence of elements, the use of numerical fluxes at interfaces, the hierarchical polynomial spaces—are not merely abstract conveniences. They are the keys that unlock solutions to profound problems across a spectacular range of scientific and engineering disciplines.

The Music of the Universe: Waves and Interfaces

Perhaps the most intuitive place to witness the power of DG methods is in the world of waves. Imagine a sound wave traveling through the air and hitting the surface of water. We know intuitively what happens: some of the sound reflects back, and some of it transmits into the water, but changed. The rules that govern this reflection and transmission are fundamental, determined by the properties of the two materials—their density and stiffness, which combine into a quantity called acoustic impedance.

A remarkable thing happens when we simulate this phenomenon with a DG method. If we place the boundary between our numerical elements exactly where the interface between air and water is, and we use an "upwind" numerical flux—a rule that honors the direction of wave travel—we find something beautiful. The DG method, without any special instructions about reflection or transmission, will automatically and exactly reproduce the physical laws governing the wave's behavior at the interface. The numerical flux, which we introduced as a way to glue elements together, turns out to be a perfect mathematical analogue for the physics of an interface. The method doesn't just approximate the physics; it embodies it.

This principle is not confined to sound. The same mathematics describes how light waves behave at the boundary of a glass lens, how seismic waves travel through different layers of rock in the Earth's crust, and how radar signals reflect off a target. In electromagnetism, for instance, a naive numerical method can sometimes be haunted by "spurious modes"—solutions that look like waves but are just ghosts of the mathematics, with no physical reality. They are like echoes in a poorly designed concert hall. The DG framework, particularly when applied to the elegant curl-curl form of Maxwell's equations, offers a robust way to banish these ghosts. By adding a small, physically-motivated penalty for disagreement between elements, we can design DG schemes that are guaranteed to be free of this non-physical pollution, ensuring that the computed resonant frequencies of a device, for example, correspond to reality and not to numerical artifacts.

The Dance of Fluids: From Gentle Breezes to Shockwaves

While linear waves are elegant, the universe is often more chaotic. Consider the flow of air over a supersonic aircraft's wing. The physics is no longer a gentle back-and-forth; it's a violent, nonlinear dance involving shockwaves—discontinuities where pressure, density, and temperature change almost instantaneously. Capturing these shocks is one of the great challenges of computational fluid dynamics (CFD).

Here, the modularity of the DG method truly shines. The choice of the numerical flux at an element's boundary acts like choosing a different lens through which to view the flow. For smooth, gentle flows, we can use a simple, low-dissipation flux that gives very sharp, accurate results. But near a shockwave, such a lens might produce wild, unphysical oscillations, like a camera struggling to focus on an object moving too fast. The simulation can literally blow up.

Instead, we can switch to a more robust "lens," a different numerical flux like HLLC (Harten–Lax–van Leer–Contact) that is designed to handle shocks. These fluxes have more numerical dissipation, acting like a built-in shock absorber. They might slightly blur the fine details of the flow, but they ensure that the simulation remains stable and physically realistic through the extreme conditions of a shockwave. This flexibility to choose the right flux for the job—or even mix and match fluxes in different parts of the simulation—makes DG an exceptionally powerful tool for aerospace engineering, astrophysics (simulating supernovae), and other fields where shocks are king.

The dance of fluids gets even more intricate when we consider turbulence—the chaotic, swirling eddies that form in the wake of a car or in a river. We cannot possibly simulate every single tiny eddy. Instead, we use a technique called Large Eddy Simulation (LES), where we compute the large, energy-carrying eddies directly and model the effects of the smaller, dissipative ones. Again, DG provides a beautiful framework. The polynomial basis within each element gives us a natural hierarchy of scales. We can use the full polynomial solution to represent the "resolved" flow, and then project it onto a lower-degree polynomial space to define a "filtered" flow. The difference between these two representations gives us direct information about the unresolved scales, which can be used to dynamically compute the correct model for the small eddies on the fly. This turns the DG method into an intelligent tool for turbulence modeling, adapting itself to the local physics of the flow.

A Symphony of Physics: Coupling Worlds Together

Few real-world problems involve just one type of physics. The temperature of a turbine blade is determined by the hot gas flowing over it (fluid dynamics and heat transfer), which in turn causes the blade to expand and deform (solid mechanics). This is the world of multiphysics, and a major challenge is ensuring that the different mathematical models for each physical domain can "talk" to each other in a stable and accurate way.

The DG method's focus on interfaces makes it a natural language for coupling. Imagine a simple problem where a diffusive process (like heat spreading in a solid rod) is coupled to a steady-state field (like an external potential). We can model the rod with one DG element and the external field with another domain. The partitioned time-stepping scheme—where we solve the elliptic problem for the potential, pass its flux as a boundary condition to the parabolic heat problem, and then advance the heat problem in time—is a common strategy.

But does this back-and-forth communication lead to a stable simulation? The DG framework allows us to answer this question with mathematical certainty. By analyzing how a small perturbation at the interface propagates through the coupled system, we can derive a precise stability limit for the time step. We find that the stability depends on a beautiful combination of the physical parameters of both domains ( $\beta, \mu$ ) and the geometry of the discretized element ( $h$ ). The ability to perform such an analysis is critical, transforming the art of multiphysics coupling into a rigorous science.

The Art of Efficiency: Making the Impossible Computable

An elegant theory is one thing, but a practical computational method must also be efficient. Some of the most profound applications of DG arise from how its structure can be exploited to design blazingly fast algorithms for the world's largest supercomputers.

A simple yet powerful idea is local time-stepping. In many problems, we need a very fine mesh in some areas (e.g., around a complex object) and a coarse mesh elsewhere. In a traditional method, the smallest element on the entire mesh dictates the maximum stable time step for the whole simulation, which can be incredibly wasteful. Because DG elements are only connected to their immediate neighbors, they can march forward in time at their own pace. We can use tiny time steps on the small elements and huge time steps on the large ones, all within a single, coherent simulation. The overall speedup can be enormous, turning an infeasible calculation into an overnight run.

A deeper connection lies in how we solve the giant systems of linear equations that DG methods produce. An advanced technique called Algebraic Multigrid (AMG) works by creating a hierarchy of coarser and coarser versions of the problem. A key insight is that standard relaxation methods (like Jacobi) are good at eliminating "rough" or "high-frequency" errors but very slow at getting rid of "smooth" or "low-frequency" errors. The magic of AMG is that it transfers these stubborn, smooth errors to a coarser grid, where they suddenly look "rough" and are easy to eliminate.

When applied to high-order DG systems for elliptic problems, a wonderful thing happens. The "algebraically smooth" errors that the solver struggles with correspond precisely to the functions that are geometrically smooth—functions with small gradients and small jumps between elements. These are exactly the functions that are well-approximated by low-degree polynomials! Thus, the very structure of the DG polynomial space is in perfect harmony with the needs of the multigrid solver. This synergy between the discretization method and the linear algebra solver is a testament to the deep mathematical elegance of the approach.

The quest for speed even leads us to parallelize in time. Instead of computing one time step after another, methods like Parareal try to compute chunks of time simultaneously. Here too, the DG structure is a gift. A common strategy is to use a cheap "coarse" predictor to get a rough draft of the solution over a large time interval, and then run expensive "fine" correctors in parallel. With DG, creating the coarse predictor is as simple as re-running the simulation with a lower polynomial degree ( $p_c p$ ), which is naturally faster and allows for larger time steps.

Beyond Simulation: Creating Digital Twins

Finally, the Discontinuous Galerkin method is not just a tool for performing single, high-fidelity simulations. It is a cornerstone for building the next generation of engineering tools: reduced-order models (ROMs), or "digital twins."

Imagine you have performed one incredibly detailed and expensive DG simulation of airflow over a wing. A ROM is a method to distill the essential dynamics from that simulation into a model that is thousands or millions of times faster to run, yet still captures the dominant behavior. This is done by identifying the most important "modes" of the flow using a technique like Proper Orthogonal Decomposition (POD).

The question is, once you have these modes, how do you build a stable model with them? Once again, the DG framework provides the answer. The full DG simulation has a natural notion of energy, embodied in its mass matrix $M$ . If we build our ROM using a projection that respects this energy inner product (a so-called Galerkin projection in the $M$ -norm), the resulting ROM is guaranteed to inherit the dissipativity (and thus, stability) of the original high-fidelity model. This ensures our digital twin isn't just fast, but also trustworthy. This allows engineers to perform rapid design optimization, test control strategies in real-time, and explore vast parameter spaces in a way that would be impossible with the full DG model alone.

From the simple bounce of a sound wave to the complex dance of turbulence, from the coupling of disparate physical worlds to the design of futuristic computational algorithms, the Discontinuous Galerkin method reveals itself to be more than a clever numerical scheme. It is a unifying philosophy, a versatile language that brings mathematical elegance into direct, powerful contact with the physics of the real world.