Three-Address Code

SciencePedia

Definition

Three-Address Code is an intermediate representation used in compiler design that deconstructs complex source code statements into a sequence of simple instructions with at most one operator. This format enables crucial compiler optimizations such as constant folding and dead code elimination while modeling program control flow through a system of jumps and labels. By making memory address calculations and variable liveness explicit, it facilitates efficient register allocation and the generation of high-performance machine code.

Key Takeaways

Three-address code (TAC) deconstructs complex source code statements into a sequence of simple instructions, each with at most one operator.
This intermediate representation is the foundation for crucial compiler optimizations like constant folding, common subexpression elimination, and dead code elimination.
TAC models program control flow, such as loops and conditionals, through a simple system of conditional and unconditional jumps to labeled instructions.
The structure of TAC makes abstract concepts like array indexing and simultaneous updates explicit, revealing the underlying memory address calculations and data dependencies.
By enabling liveness analysis of variables, TAC plays a key role in efficient register allocation, a critical step for generating high-performance machine code.

Introduction

In the world of computer science, the journey from human-readable source code to the raw binary instructions a processor executes is a complex process of translation. At the heart of this process lies a pivotal, machine-independent abstraction known as three-address code (TAC). Acting as the compiler's universal language, or lingua franca, TAC bridges the gap between the expressive, high-level languages used by programmers and the spartan, primitive operations understood by hardware. It addresses the fundamental problem of how to systematically and unambiguously represent complex computations in a way that is ripe for analysis and optimization. This article delves into the world of three-address code, exploring its role as the unseen architect of program performance.

First, in Principles and Mechanisms, we will deconstruct how TAC breaks down expressions, manages temporary variables, and handles control flow and memory access, revealing the deterministic logic beneath high-level abstractions. Then, in Applications and Interdisciplinary Connections, we will explore how this representation serves as the compiler's workbench for powerful optimizations, examine its real-world impact in fields from digital signal processing to machine learning, and see how it guides the final transformation of code into efficient machine instructions.

Principles and Mechanisms

To truly appreciate the role of three-address code, we must think like a compiler. When a human programmer writes a line of code, they see a single, high-level intention. But the computer's central processing unit (CPU) is a far simpler beast. It operates on a sequence of primitive commands: load a value from memory, add two numbers, store a value back to memory. The journey from a rich, expressive programming language to these spartan machine instructions is a fascinating process of translation, and three-address code (TAC) is the most important waypoint on that journey. It is the compiler's lingua franca, a universal language that bridges the gap between human-readable source code and machine-executable binary.

The Art of Deconstruction

Let's start with something that looks utterly trivial: an assignment statement like $x := y + z$ . What does this really mean? To a CPU, it’s not a single action. It’s a multi-step recipe. First, the value of $y$ must be fetched from memory. Then, the value of $z$ must be fetched. Only then can the two values be added together. Finally, the resulting sum must be written to the memory location designated by $x$ .

This decomposition is the very soul of three-address code. We can write this sequence down in a more formal way:

$t_1 := \text{load}(y)$
$t_2 := \text{load}(z)$
$t_3 := t_1 + t_2$
\text{store}(x, t_3)$

Notice what has happened. The single, complex statement has been broken down into a series of simple, unambiguous instructions. Each instruction performs at most one operation. We've also introduced temporary variables, $t_1$ , $t_2$ , and $t_3$ , which act as the compiler's scratchpad, holding intermediate results. This methodical breakdown is crucial for several reasons. For one, it helps manage complexities like aliasing, where variables like $x$ and $y$ might unexpectedly refer to the same memory location. By loading $y$ and $z$ into temporaries first, we ensure we have their original values before any of them can be overwritten by the final store to $x$ . Furthermore, this explicit sequence gives the compiler a list of discrete tasks that it can schedule to run efficiently on the actual hardware, respecting the latencies and capabilities of the processor's memory and arithmetic units.

A Universal Language for Computation

The beauty of this approach is its uniformity. We can establish a canonical form for most of our instructions:

result := operand1 op operand2

This is the origin of the name three-address code: each instruction involves at most three "addresses" (variables, constants, or temporaries). This simple, consistent structure is a delight for a compiler. It's easy to parse, easy to analyze, and easy to manipulate. An entire ecosystem of optimizations can be built on top of this representation. For example, a common way to store these instructions is in a structure called a quadruple, which is simply a bundle of four fields: the operator, the two operands, and the result.

But how does a compiler systematically generate this code from a potentially convoluted expression like $-a + b^2 - c \times (-d)$ ? It doesn't guess. It follows the structure of the language's grammar, as captured by an Abstract Syntax Tree (AST). Think of the AST as the skeleton of the expression. Every time the compiler encounters an operator node in this tree—like +, *, or a unary -—it emits exactly one three-address instruction.

For the expression $-a + b^2 - c \times (-d)$ , the compiler would follow operator precedence rules to generate a sequence of simple steps. It would first handle the highest-precedence operations: the two unary minuses and the exponentiation (which itself is broken down into $b \times b$ ). Then it would handle the multiplication, and finally, the addition and subtraction. Each step produces a result that is stored in a new temporary variable, which then becomes an operand for a subsequent instruction. The complex, nested expression is flattened into a straight-line sequence of primitive calculations.

The Life of a Temporary

The temporary variables we introduced are more than just placeholders; their management is at the heart of a crucial optimization process called register allocation. A CPU has a small number of very fast storage locations called registers. To make a program run quickly, the compiler wants to keep frequently used variables and temporaries in these registers as much as possible, avoiding slow trips to main memory.

The way an expression is translated to TAC can have a significant impact on this. Consider the simple sum $a + b + c + d$ . We could evaluate this in a strictly left-to-right fashion, corresponding to the tree $(((a + b) + c) + d)$ . This produces a "deep" or "chain-like" sequence of TAC:

$t_1 := a + b$
$t_2 := t_1 + c$
$t_3 := t_2 + d$

Alternatively, we could evaluate it in a balanced way, corresponding to $((a + b) + (c + d))$ . This produces a "bushier" TAC sequence:

$t_1 := a + b$
$t_2 := c + d$
$t_3 := t_1 + t_2$

In the first case, the temporary $t_1$ is born at instruction 1 and "dies" at instruction 2, living for a very short time. The same is true for $t_2$ . In the second case, $t_1$ is born at instruction 1 but must be kept alive until instruction 3, while $t_2$ is also being computed. This second approach requires more temporaries to be "live" at the same time, potentially putting more pressure on the limited number of available registers. By analyzing the lifetimes of temporaries in the TAC, the compiler can make intelligent choices about how to schedule instructions and allocate registers, all in service of generating faster code.

Weaving the Flow of Control

So far, we have only discussed straight-line code. But real programs are full of decisions and loops. How does TAC represent control flow? The answer is elegantly simple: with jumps. TAC includes two new kinds of instructions:

Unconditional jump: goto L (where $L$ is a label representing an instruction address)
Conditional jump: if x relop y goto L

With these two tools, we can construct any control flow structure imaginable. An if-else statement is a perfect example. Consider if (B) S1 else S2. The compiler generates code that first evaluates the boolean condition $B$ . If $B$ is true, it jumps to the block of code for $S_1$ . If $B$ is false, it falls through to the code for $S_2$ . To ensure only one branch is executed, an unconditional jump is placed at the end of the $S_1$ block to hop over the $S_2$ block.

This mechanism becomes even more powerful when handling logical operators like `` (AND) and || (OR) with short-circuit evaluation. When we see $B_1 \lor B_2$ , we don't need to generate any special "OR" instruction. We translate it directly into control flow: evaluate $B_1$ . If it's true, we know the whole expression is true, so we can jump directly to the "true" outcome. If $B_1$ is false, only then do we proceed to evaluate $B_2$ . A logical operator is transformed into a pattern of conditional jumps.

And what is a while loop? It's just a condition check followed by a jump that goes backward. For while (B) S, the compiler generates code to evaluate $B$ . If $B$ is false, it jumps to the instruction after the loop. If $B$ is true, it executes the code for the loop body $S$ , and the very last instruction of the body is an unconditional jump right back to the beginning to re-evaluate $B$ . In the world of three-address code, the dynamic, looping behavior of a program is revealed to be a static sequence of instructions with a simple backward jump.

Talking to Memory

Modern programs don't just manipulate single variables; they operate on vast data structures like arrays and records (structs). Accessing an element like $A[i]$ seems instantaneous in a high-level language, but TAC reveals the mechanical calculation happening underneath. To find the location of $A[i]$ , the machine must compute its memory address:

$\text{address}(A[i]) = \text{base\_address}(A) + i \times \text{element\_size}$

This address calculation is itself translated into a sequence of three-address instructions. For a statement like $A[i] = B[j] + C[k] \times D[l]$ , the compiler will first generate TAC to compute the addresses of $B[j]$ , $C[k]$ , and $D[l]$ , load their values into temporaries, perform the arithmetic, then compute the address of $A[i]$ , and finally store the result. The abstraction of the array melts away to reveal the raw arithmetic of memory addressing.

This perspective provides profound insights. Consider accessing a field within a struct that is inside an array, like $arr[i].y$ . The way we organize data in memory—a choice made at a high level of abstraction—has direct consequences on the generated TAC. If we use an array-of-structs (AoS) layout, where each element of arr is a complete struct, the address calculation might look like $ \text{base} + i \times 24 + 8 $ , involving a multiplication and two additions. But if we use a struct-of-arrays (SoA) layout, where each field (x, y, z) is its own separate, contiguous array, the address for $arr[i].y$ might simply be $ \text{base\_of\_y\_array} + i \times 8 $ . The multiplication by $8$ (a power of two) can be replaced by a much faster bit-shift instruction ( $i \ll 3$ ). The TAC for the SoA layout is simpler and more efficient. This beautiful link shows how decisions about data organization ripple all the way down to the fundamental instructions the machine will execute, a connection made perfectly clear through the lens of three-address code.

Imposing Order on Ambiguity

Perhaps the most crucial role of three-address code is to be the ultimate arbiter of meaning. High-level languages sometimes contain ambiguities or behaviors that are "undefined." A classic example from languages like C is the statement $x = x++;$ . Does $x$ get assigned its old value, or its new, incremented value? The language standard might not say, leaving it as undefined behavior.

A compiler, however, cannot produce "undefined" machine code. It must make a choice. The process of translating to TAC forces this choice to be made explicit. To respect the "post-increment" semantics (use the value, then increment), a compiler might generate the following TAC sequence:

$t_1 := x$ (Save the original value of $x$ into a temporary)
$x := x + 1$ (Apply the side effect: increment $x$ )
$x := t_1$ (Perform the assignment using the saved original value)

In this sequence, the final value of $x$ will be its original value, as the increment is overwritten by the final assignment. Another compiler might choose a different order. The key point is that the TAC representation is completely unambiguous. It is a precise, formal contract describing exactly what will happen and in what order. It is the point where the beautiful, sometimes messy, abstractions of a high-level language are forged into the cold, hard, deterministic logic of a machine.

Applications and Interdisciplinary Connections

We have seen that three-address code (TAC) is a marvel of structured simplicity, a language that a machine can understand. But its true power is not merely in its role as a translator. TAC is the compiler's private workbench, a place where it can pause, reflect, and reason about our programs. It is on this workbench that raw, literal translations are transformed into elegant, efficient machine instructions. This intermediate world is where the true artistry of a modern compiler unfolds, bridging the expressive domains of human thought with the stark realities of silicon. Let's explore this world and see how TAC acts as the unseen architect behind much of modern computing.

The Art of Optimization: Making Code Faster and Smarter

A compiler's first duty is correctness, but its second, and perhaps more celebrated, duty is performance. Three-address code provides the perfect representation for a compiler to become more than a rote translator—it becomes an optimizer.

Imagine you write a piece of code containing the expression $a + 0 \times b - 2 \times (c - c)$ . A naive compiler might dutifully translate every operation. But a smarter compiler, working with TAC, can see this expression for what it is. It breaks the calculation into simple, atomic steps. In doing so, it immediately recognizes that $c - c$ is always zero, and so is $2 \times 0$ . It sees that $0 \times b$ is also zero, regardless of what $b$ is. The entire elaborate expression, when viewed through the clarifying lens of TAC, collapses into a single, trivial assignment: the result is simply $a$ . This process, known as constant folding and dead code elimination, is a fundamental optimization where the compiler performs arithmetic at compile-time, sparing the processor from doing pointless work at run-time.

The compiler's intelligence doesn't stop at simple arithmetic. Consider a program that needs to calculate $m + n$ and then, a few lines later, needs to calculate it again. Should it perform the same addition twice? A human wouldn't, and neither does a good compiler. By analyzing the TAC, the compiler can identify these "common subexpressions." Using a technique called Global Value Numbering, it can see that the value for $m+n$ has already been computed and is available. Instead of re-computing the sum, it simply reuses the previous result. It's a simple idea, but it's the basis for how compilers avoid redundant work on a grand scale.

This optimization can even extend to a form of algebraic reasoning. Some operations are much more "expensive" for a processor than others—multiplication, for instance, can take more time and energy than addition. What if you asked a compiler to compute $m \cdot n + n \cdot p + p \cdot m$ ? A direct translation would require three expensive multiplications. But a clever compiler can leverage the properties of algebra. By factoring the expression into $n \cdot (m + p) + p \cdot m$ , it can achieve the same result with only two multiplications. Three-address code provides a structured format where such algebraic transformations, governed by rules like commutativity and distributivity, can be safely applied, turning the compiler into a tiny, tireless mathematician that refines our code for peak efficiency.

Bridging Worlds: Code in Science and Engineering

These optimization principles are not mere academic exercises; they have profound impacts on real-world applications across science and engineering.

In digital signal processing (DSP), the field that powers our audio and video, simple-looking equations are executed billions of times. A common first-order filter, used to smooth signals, might be described by the equation $y = \alpha \cdot x + (1 - \alpha) \cdot y_{prev}$ . Here, $x$ is the current input, $y_{prev}$ is the previous output, and $\alpha$ is a constant blending factor. If $\alpha = \frac{13}{37}$ , the compiler can use constant folding to pre-calculate the value of $1-\alpha$ (which is $\frac{24}{37}$ ). This saves one subtraction for every single sample being processed. For a standard audio stream at 44.1 kHz, that's over 44,000 operations saved every second. For high-frequency radio signals, the savings are astronomical. This tiny optimization, enabled by TAC, directly translates to higher performance and lower energy consumption in countless devices.

The same principles of correctness and efficiency are paramount in computational science. Consider the SIR model, a set of equations used to simulate the spread of an epidemic:

S' = S - \beta S I, \quad I' = I + \beta S I - \gamma I, \quad R' = R + \gamma I

The notation here implies a simultaneous update: the new populations of susceptible ( $S'$ ), infectious ($I'), and removed ($R') individuals must all be calculated using the current state ( $S, I, R$ ). If a compiler were to update $S$ first and then use that new value to calculate $I'$, the simulation would be wrong. Three-address code provides the solution. It allows the compiler to first compute the intermediate terms that appear in multiple equations, like the number of new infections $\beta S I$ and new removals $\gamma I$ , storing them in temporary variables. Only after all these terms are calculated based on the old state does it perform the final updates to $S$ , $I$ , and $R$ `.

This challenge is even more intuitive in image processing. Imagine applying a blur filter to an image, where each pixel's new value is the average of its neighbors. If you update the pixels "in-place," the calculation for a pixel at location $(i, j)$ might use the old value of its neighbor to the left but the already-new, blurred value of its neighbor above, leading to a skewed, incorrect result. The standard solution is a "double buffer" strategy: a temporary copy of the entire image is created. The blur calculations read only from the original image and write their results to the temporary buffer. Once every pixel has been computed, the temporary buffer is copied back to the original. TAC is the mechanism that makes this strategy straightforward to implement, ensuring that the high-level idea of a simultaneous update is translated correctly into a sequence of low-level loads and stores.

Finally, think of the world of machine learning and AI, which is built on the manipulation of massive multi-dimensional arrays, or "tensors." A simple line in a Python program like value = data[5, 10, 15] hides a significant computation. To find that element, the computer must calculate a memory offset: ` $\text{offset} = 5 \times \text{stride}_0 + 10 \times \text{stride}_1 + 15 \times \text{stride}_2$ . The compiler is responsible for translating that high-level, convenient array access into a sequence of low-level multiplications and additions in three-address code. This address calculation is the fundamental operation that underpins the performance of every major scientific computing and machine learning library, from NumPy to TensorFlow and PyTorch.

The Grand Finale: From Code to Silicon

After the code has been optimized and its logic correctly structured, TAC has one final role: to serve as the blueprint for the final machine instructions that will run on the physical processor. This is the crucial step of "code generation," and here too, TAC's structure is invaluable.

Processors have different "instruction set architectures" (ISAs)—different vocabularies for performing operations. A "three-address" ISA has instructions like $ADD(r_1, r_2, r_3)$ , which means $r_1 \leftarrow r_2 + r_3$ . This maps almost directly from a line of TAC. However, many common architectures, like x86 and ARM, use a "two-address" form like $ADD(r_1, r_2)$ , which means $r_1 \leftarrow r_1 + r_2$ . Notice the difference: one of the source registers is overwritten with the result. If the original value in $r_1$ was still needed for a later calculation, this would be a disaster. TAC, as a machine-independent representation, allows the compiler to navigate these differences intelligently. When targeting a two-address machine, it can analyze the liveness of variables. If the value in $r_1$ is "dead" (no longer needed), it can be safely overwritten. If it is "live," the compiler can either use an extra MOV instruction to save the value first, or it can cleverly exploit the commutativity of addition ( $x+y = y+x$ ) to rewrite the operation so that it overwrites a register whose value is no longer needed.

The final piece of the puzzle is managing the processor's most precious resource: registers. A CPU has a very small number of these ultra-fast memory locations. The compiler's most critical job is to orchestrate which variables reside in these registers at any given moment. This "register allocation" problem has a beautiful and surprising connection to an abstract field of mathematics: graph theory.

Starting from the TAC, the compiler performs liveness analysis to determine, at every point in the program, which variables hold values that will be used in the future. With this information, it constructs an "interference graph." Each variable is a node, and an edge is drawn between any two variables that are simultaneously live. The problem is now to assign a register to each variable such that no two connected variables are assigned the same register. This is precisely the famous "graph coloring" problem, where the registers are the colors. The compiler, guided by TAC, transforms a question of code efficiency into a question of coloring a mathematical graph, a stunning example of the deep unity of computer science and pure mathematics.

The Elegance of the Intermediate

From enabling simple arithmetic shortcuts to ensuring the correctness of complex scientific simulations and bridging the gap to pure mathematics, three-address code is far more than a technical implementation detail. It is a powerful, flexible abstraction. It decouples the programmer's intent from the machine's constraints, creating a stable, structured world where logic and algebra can be applied to transform our code into something not just correct, but efficient and elegant. It is the quiet, unseen architect shaping the performance of the digital world.