When Correct Code Breaks: How Compilers Exploit Undefined Behavior

In 2009, a vulnerability was discovered in the Linux kernel that allowed privilege escalation. The code looked perfectly reasonable—a null pointer check designed to prevent crashes. But when compiled with optimization enabled, the check simply vanished. The compiler had every right to delete it. The code contained undefined behavior, and undefined behavior means the compiler can do whatever it wants.

This wasn’t a compiler bug. It was the compiler doing exactly what the C standard allows it to do. Understanding this distinction is crucial for anyone writing systems code in C or C++.

What Undefined Behavior Actually Means

The C and C++ standards define three categories of implementation-specific behavior, each with different guarantees:

Undefined behavior (UB) places no requirements on the implementation. If a program executes undefined behavior, the standard imposes no constraints on what happens. The compiler is free to generate code that crashes, produces incorrect results, or—most insidiously—appears to work correctly until it doesn’t.

Implementation-defined behavior requires the implementation to document and consistently choose one of several possible behaviors. The result may vary between compilers, but it will be consistent within a given compiler.

Unspecified behavior allows the implementation to choose among several valid options, but it need not document which one it picks or even be consistent about it.

Consider signed integer overflow. In C, the result of INT_MAX + 1 is undefined behavior. The C standard doesn’t specify what should happen because different CPU architectures handled overflow differently when C was standardized. Some wrapped around using two’s complement arithmetic. Others trapped. Still others produced unpredictable results.

This historical accident gave compilers a powerful optimization tool. If signed overflow is undefined, the compiler can assume it never happens. This assumption unlocks transformations that would otherwise be impossible.

Why Undefined Behavior Exists

The C language was designed in the 1970s for the PDP-11, a 16-bit minicomputer. Its creators wanted C to be “portable assembly”—efficient enough to replace hand-written assembly while working across different hardware architectures.

Different CPUs had different semantics for the same operations. A left shift by 32 bits on a 32-bit value produces 0 on ARM but preserves bits on x86. Division by zero traps on x86 but silently returns undefined results on PowerPC. Signed overflow wraps on x86 but traps on some DSPs.

Rather than mandating one behavior (which would require expensive software emulation on some architectures), the C standard left these cases undefined. The compiler could generate a single instruction matching the hardware’s behavior, and programmers who wanted portable code would simply avoid these edge cases.

This design choice made C efficient, but it also created a fundamental tension. As compilers became more sophisticated, they began exploiting undefined behavior in ways the original language designers never anticipated.

How Compilers Exploit Undefined Behavior

Modern compilers don’t just passively allow undefined behavior—they actively exploit it for optimization. The reasoning goes: since undefined behavior cannot occur in a correct program, the compiler can assume it never occurs and optimize accordingly.

Null Pointer Dereference Optimization

When the compiler sees a pointer dereference, it can infer that the pointer is non-null:

void process(struct tun_struct *tun) {
    struct sock *sk = tun->sk;  // Dereference implies tun != NULL
    if (!tun)  // This check is now "provably false"
        return POLLERR;
    // ... rest of function
}

This exact pattern existed in the Linux kernel (CVE-2009-1897). The compiler, seeing that tun was dereferenced on line 2, concluded that tun must be non-null. The check on line 3 was optimized away as dead code. If tun was actually null, the code would crash at the dereference—but if page zero was mapped, execution would continue past the eliminated check, creating an exploitable vulnerability.

Signed Integer Overflow Assumptions

Because signed integer overflow is undefined, compilers assume it never happens:

int check_overflow(int x) {
    return x + 100 < x;  // "Obviously" false if no overflow
}

At -O2, GCC and Clang both optimize this to simply return 0. The compiler reasons: if x + 100 overflowed, that would be undefined behavior, which can’t happen; therefore x + 100 >= x is always true.

This optimization once caused a heated debate on the GCC mailing list. Developers argued that the compiler was “broken” for removing their overflow checks. Compiler writers responded that the code was broken for relying on undefined behavior.

Pointer Overflow Checks

A common pattern for bounds checking:

if (buf + len < buf)  // Check for pointer wraparound
    return -1;  // len is too large

This check assumes pointers wrap around on overflow. But the C standard says pointer overflow is undefined behavior. Modern compilers will optimize this check to if (len < 0) for signed len, or eliminate it entirely for unsigned len.

The Chromium browser, Python interpreter, and Linux kernel all contained variants of this pattern. The checks worked when compiled without optimization, then silently failed when optimization was enabled.

Loop Optimization via UB Assumptions

Undefined behavior enables aggressive loop optimizations:

for (int i = 0; i <= n; i++) {
    a[i] = 42;
}

When n is INT_MAX, this loop would cause i to overflow. Since signed overflow is undefined, the compiler can assume this never happens. It can therefore conclude that the loop runs at most INT_MAX + 1 times and optimize accordingly—potentially changing the loop bounds or eliminating the loop entirely.

The Performance Question: Is UB Worth It?

A comprehensive study published at PLDI 2025 by Popescu and Lopes finally answered the question that had long been debated: does exploiting undefined behavior actually improve performance?

The researchers modified LLVM to selectively disable each category of UB exploitation, then benchmarked 24 real-world applications including LLVM itself, Z3, SQLite, OpenSSL, and various encoding libraries.

The results were surprising: the performance gains from exploiting UB are minimal. In the cases evaluated, end-to-end performance impacts were generally within noise margins. When performance did regress, it could often be recovered through link-time optimization or small compiler improvements.

Some specific findings:

Disabling null pointer check removal caused at most 3.2% slowdown (pbzip2)
Disabling signed overflow assumptions caused at most 2.1% slowdown (zstd)
Some benchmarks actually improved when UB exploitation was disabled

The study concluded that while UB enables certain optimizations, the performance benefits are modest and often achievable through other means. This challenges the conventional wisdom that UB exploitation is essential for C’s performance.

Security Implications: Optimization-Unstable Code

Researchers at MIT coined the term “optimization-unstable code” to describe code that works without optimization but breaks when optimization is enabled. Using their STACK tool, they analyzed 8,575 Debian packages and found that 40% contained optimization-unstable code.

The security implications are severe:

Safety checks disappear: Bounds checks, null checks, and overflow checks can be silently eliminated
Testing becomes unreliable: Code that passes tests in debug builds fails in release builds
Compiler evolution creates time bombs: As compilers become more aggressive, previously safe code becomes vulnerable

A notable example from FFmpeg:

int size = bytestream_get_be16(&data);
if (data + size >= data_end || data + size < data)
    return -1;  // Bounds check
data += size;

The check data + size < data was intended to catch pointer wraparound. GCC optimizes this to size < 0, which never triggers for unsigned size. A malicious input with large size could bypass the bounds check entirely.

The Three Categories of UB Bugs

Understanding how UB manifests helps identify and fix problematic code:

Non-optimization Bugs

These bugs cause problems regardless of optimization level. Example: dereferencing null pointers. Even if the compiler doesn’t optimize away your checks, dereferencing null is already a crash.

Urgent Optimization Bugs

These bugs only manifest when optimization is enabled. The code works at -O0 but breaks at -O2. The Linux kernel null pointer vulnerability falls into this category.

Time Bombs

Code that currently works but will break as compilers evolve. As compilers implement more aggressive optimizations based on UB assumptions, previously safe code becomes vulnerable. The Postgres overflow check example represents a time bomb—it works now, but research compilers already eliminate it.

LLVM’s Approach to Undefined Behavior

LLVM, the compiler infrastructure behind Clang, has evolved a sophisticated model for undefined behavior. It distinguishes between:

Deferred UB (undef and poison values): Operations that produce undefined results but can still be executed. This enables speculative execution of potentially undefined operations.
Immediate UB: Operations that cannot be executed, like division by zero or null pointer dereference. These trigger traps at runtime.

LLVM’s optimizer uses these distinctions to enable aggressive optimization while providing some safety guarantees. For example, stores to null pointers are converted to trap instructions rather than being silently deleted, making debugging easier.

The compiler also provides flags to disable specific UB exploitations:

-fwrapv: Define signed overflow as wrapping
-fno-delete-null-pointer-checks: Keep null checks even if they appear redundant
-fno-strict-aliasing: Disable type-based alias analysis
-fno-strict-overflow: Assume signed overflow is possible

These flags trade performance for predictability, though they don’t provide complete protection against all UB-related issues.

Writing Safer Code

Several practices can help avoid UB-related bugs:

Use Explicit Overflow Checking

Instead of:

if (x + 100 < x)  // UB if overflow occurs

Use:

if (x > INT_MAX - 100)  // Well-defined comparison

Or use compiler builtins:

if (__builtin_add_overflow(x, 100, &result))
    // Handle overflow

Check Before Dereference

Always check pointers before dereferencing:

if (ptr) {
    int value = ptr->field;  // Safe
}

Not after:

int value = ptr->field;  // UB if null
if (!ptr)  // Already too late
    return;

Use Size-Aware Types

When loop bounds matter:

for (size_t i = 0; i <= n; i++)  // size_t matches pointer width

Instead of:

for (int i = 0; i <= n; i++)  // int may overflow before size_t

Enable Sanitizers

AddressSanitizer (ASan), UndefinedBehaviorSanitizer (UBSan), and MemorySanitizer can catch many UB-related bugs at runtime. They impose a performance penalty but are invaluable during testing and fuzzing.

The Future of Undefined Behavior

The programming language community is grappling with undefined behavior’s costs and benefits. C++ is actively considering proposals to reduce UB:

P2723: Zero-initialize automatic storage duration objects
P2809: Make trivial infinite loops well-defined
P2795: Introduce “erroneous behavior” as a middle ground between undefined and well-defined

Meanwhile, languages like Rust demonstrate that systems programming can be memory-safe without relying on undefined behavior. Rust’s ownership system prevents null pointer dereferences, buffer overflows, and data races at compile time—eliminating entire classes of UB at the cost of a steeper learning curve.

The Fundamental Trade-off

Undefined behavior represents a fundamental trade-off in language design: performance versus predictability. C chose performance, trusting programmers to avoid undefined constructs. This choice enabled C’s success as a systems language but created a perpetual source of subtle bugs and security vulnerabilities.

The PLDI 2025 study’s finding that UB exploitation provides minimal performance benefits suggests this trade-off may not be as necessary as once thought. Modern CPUs with out-of-order execution and deep pipelines can often absorb the cost of extra instructions that avoid UB.

For new projects, languages with stronger safety guarantees may offer a better trade-off. For existing C/C++ codebases, understanding undefined behavior is essential for writing correct and secure code.

The next time you write a bounds check that “obviously” works, remember: the compiler’s definition of “obviously” may differ from yours. What you see as a safety check, the compiler may see as dead code. In the world of undefined behavior, intuition is not a reliable guide—only a deep understanding of the language specification can save you from your own code.

References

Popescu, L. & Lopes, N. (2025). “Exploiting Undefined Behavior in C/C++ Programs for Optimization: A Study on the Performance Impact.” PLDI 2025.
Wang, X., Zeldovich, N., Kaashoek, M.F. & Solar-Lezama, A. (2013). “Towards Optimization-Safe Systems: Analyzing the Impact of Undefined Behavior.” SOSP 2013.
Lattner, C. (2011). “What Every C Programmer Should Know About Undefined Behavior.” LLVM Blog.
Regehr, J. (2012). “It’s Time to Get Serious About Exploiting Undefined Behavior.”
ISO/IEC 9899:2011. “Programming Languages - C.”
Krebbers, R. & Wiedijk, F. (2015). “A Typed C11 Semantics for Interactive Theorem Proving.” CPP 2015.
Lee, J. et al. (2018). “Reconciling High-Level Optimizations and Low-Level Code in LLVM.” OOPSLA 2018.
Dietz, W. et al. (2015). “Understanding Integer Overflow in C/C++.” ACM TOSEM.

What Undefined Behavior Actually Means#

Why Undefined Behavior Exists#

How Compilers Exploit Undefined Behavior#

Null Pointer Dereference Optimization#

Signed Integer Overflow Assumptions#

Pointer Overflow Checks#

Loop Optimization via UB Assumptions#

The Performance Question: Is UB Worth It?#

Security Implications: Optimization-Unstable Code#

The Three Categories of UB Bugs#

Non-optimization Bugs#

Urgent Optimization Bugs#

Time Bombs#

LLVM’s Approach to Undefined Behavior#

Writing Safer Code#

Use Explicit Overflow Checking#

Check Before Dereference#

Use Size-Aware Types#

Enable Sanitizers#

The Future of Undefined Behavior#

The Fundamental Trade-off#

References#