CWE-409: Improper Handling of Highly Compressed Data (Data Amplification)

Definition

What is CWE-409?

This vulnerability occurs when software fails to safely process highly compressed data, where a small input file can trigger the creation of an extremely large amount of data during decompression, overwhelming system resources.

Often called a 'decompression bomb' or 'zip bomb,' this attack exploits the extreme compression ratios possible with formats like ZIP, XML, or PDF. A malicious actor can craft a tiny, harmless-looking file that, when processed by your application, expands to consume gigabytes of memory or disk space, leading to denial of service, crashes, or performance degradation. To prevent this, developers must implement security controls before decompression. This includes setting strict limits on the compression ratio, checking the uncompressed size from file headers before allocating memory, and using streaming decompression with quotas instead of loading entire outputs into memory at once. Treating all compressed input as untrusted and validating its potential impact is a critical step in secure file handling.

Real-world impact

Real-world CVEs caused by CWE-409

CVE-2009-1955

XML bomb in web server module
CVE-2003-1564

Parsing library allows XML bomb

How attackers exploit it

Step-by-step attacker path

1
Identify a code path that handles untrusted input without validation.
2
Craft a payload that exercises the unsafe behavior — injection, traversal, overflow, or logic abuse.
3
Deliver the payload through a normal request and observe the application's reaction.
4
Iterate until the response leaks data, executes attacker code, or escalates privileges.

Vulnerable code example

Vulnerable XML

The DTD and the very brief XML below illustrate what is meant by an XML bomb. The ZERO entity contains one character, the letter A. The choice of entity name ZERO is being used to indicate length equivalent to that exponent on two, that is, the length of ZERO is 2^0. Similarly, ONE refers to ZERO twice, therefore the XML parser will expand ONE to a length of 2, or 2^1. Ultimately, we reach entity THIRTYTWO, which will expand to 2^32 characters in length, or 4 GB, probably consuming far more data than expected.

Vulnerable XML

<?xml version="1.0"?>
  <!DOCTYPE MaliciousDTD [
  <!ENTITY ZERO "A">
  <!ENTITY ONE "&ZERO;&ZERO;">
  <!ENTITY TWO "&ONE;&ONE;">
  ...
  <!ENTITY THIRTYTWO "&THIRTYONE;&THIRTYONE;">
  ]>
  <data>&THIRTYTWO;</data>

Attacker payload

The DTD and the very brief XML below illustrate what is meant by an XML bomb. The ZERO entity contains one character, the letter A. The choice of entity name ZERO is being used to indicate length equivalent to that exponent on two, that is, the length of ZERO is 2^0. Similarly, ONE refers to ZERO twice, therefore the XML parser will expand ONE to a length of 2, or 2^1. Ultimately, we reach entity THIRTYTWO, which will expand to 2^32 characters in length, or 4 GB, probably consuming far more data than expected.

Attacker payload XML

<?xml version="1.0"?>
  <!DOCTYPE MaliciousDTD [
  <!ENTITY ZERO "A">
  <!ENTITY ONE "&ZERO;&ZERO;">
  <!ENTITY TWO "&ONE;&ONE;">
  ...
  <!ENTITY THIRTYTWO "&THIRTYONE;&THIRTYONE;">
  ]>
  <data>&THIRTYTWO;</data>

Secure code example

Secure pseudo

// Validate, sanitize, or use a safe API before reaching the sink.
function handleRequest(input) {
  const safe = validateAndEscape(input);
  return executeWithGuards(safe);
}

What changed: the unsafe sink is replaced (or the input is validated/escaped) so the same payload no longer triggers the weakness.

Prevention checklist

How to prevent CWE-409

Architecture Use safe-by-default frameworks and APIs that prevent the unsafe pattern from being expressible.
Implementation Validate input at trust boundaries; use allowlists, not denylists.
Implementation Apply the principle of least privilege to credentials, file paths, and runtime permissions.
Testing Cover this weakness in CI: SAST rules + targeted unit tests for the data flow.
Operation Monitor logs for the runtime signals listed in the next section.

Detection signals

How to detect CWE-409

SAST High

Run static analysis (SAST) on the codebase looking for the unsafe pattern in the data flow.

DAST Moderate

Run dynamic application security testing against the live endpoint.

Runtime Moderate

Watch runtime logs for unusual exception traces, malformed input, or authorization bypass attempts.

Code review Moderate

Code review: flag any new code that handles input from this surface without using the validated framework helpers.

Frequently asked questions

What is CWE-409?

This vulnerability occurs when software fails to safely process highly compressed data, where a small input file can trigger the creation of an extremely large amount of data during decompression, overwhelming system resources.

How serious is CWE-409?

MITRE has not published a likelihood-of-exploit rating for this weakness. Treat it as medium-impact until your threat model proves otherwise.

What languages or platforms are affected by CWE-409?

MITRE has not specified affected platforms for this CWE — it can apply across most application stacks.

How can I prevent CWE-409?

Use safe-by-default frameworks, validate untrusted input at trust boundaries, and apply the principle of least privilege. Cover the data-flow signature in CI with SAST.

How does Plexicus detect and fix CWE-409?

Plexicus's SAST engine matches the data-flow signature for CWE-409 on every commit. When a match is found, our Codex Remedium agent opens a fix PR with the corrected code, tests, and a one-line summary for the reviewer.

Where can I learn more about CWE-409?

MITRE publishes the canonical definition at https://cwe.mitre.org/data/definitions/409.html. You can also reference OWASP and NIST documentation for adjacent guidance.

Related weaknesses

Weaknesses related to CWE-409

CWE-405 Parent

Don't Let Security
Weigh You Down.

Stop choosing between AI velocity and security debt. Plexicus is the only platform that runs Vibe Coding Security and ASPM in parallel — one workflow, every codebase.

Get started free Book a demo

Improper Handling of Highly Compressed Data (Data Amplification)

What is CWE-409?

Real-world CVEs caused by CWE-409

Step-by-step attacker path

Vulnerable XML

Secure pseudo

How to prevent CWE-409

How to detect CWE-409

Plexicus auto-detects CWE-409 and opens a fix PR in under 60 seconds.

Frequently asked questions

Weaknesses related to CWE-409

Asymmetric Resource Consumption (Amplification)

Excessive Platform Resource Consumption within a Loop

Data Resource Access without Use of Connection Pooling

Non-SQL Invokable Control Element with Excessive Number of Data Resource Accesses

Invokable Control Element with Excessive File or Data Access Operations

Large Data Table with Excessive Number of Indices

Excessive Index Range Scan for a Data Resource

Inefficient CPU Computation

Insufficient Control of Network Message Volume (Network Amplification)

Further reading

Don't Let Security
Weigh You Down.

Improper Handling of Highly Compressed Data (Data Amplification)

What is CWE-409?

Real-world CVEs caused by CWE-409

Step-by-step attacker path

Vulnerable XML

Secure pseudo

How to prevent CWE-409

How to detect CWE-409

Plexicus auto-detects CWE-409 and opens a fix PR in under 60 seconds.

Frequently asked questions

Weaknesses related to CWE-409

Asymmetric Resource Consumption (Amplification)

Excessive Platform Resource Consumption within a Loop

Data Resource Access without Use of Connection Pooling

Non-SQL Invokable Control Element with Excessive Number of Data Resource Accesses

Invokable Control Element with Excessive File or Data Access Operations

Large Data Table with Excessive Number of Indices

Excessive Index Range Scan for a Data Resource

Inefficient CPU Computation

Insufficient Control of Network Message Volume (Network Amplification)

Further reading

Don't Let SecurityWeigh You Down.

Don't Let Security
Weigh You Down.