CWE-502: Deserialization of Untrusted Data

Definition

What is CWE-502?

This vulnerability occurs when an application accepts and processes serialized data from an untrusted source without proper validation, allowing an attacker to manipulate the data to execute malicious code or cause unexpected behavior.

Serialization is a common technique for converting complex objects into a storable or transmittable format, like JSON or binary streams. The danger arises when an application blindly trusts this serialized data from external sources—such as user inputs, cookies, or network packets—and reconstructs (deserializes) it back into live objects. Without strict controls, an attacker can craft malicious payloads that, when deserialized, trigger dangerous actions like remote code execution, file system access, or denial-of-service attacks. To prevent this, developers should treat all serialized data from outside the application's trust boundary as inherently hostile. Effective defenses include using safe, data-only serialization formats (like plain JSON without code execution features), implementing integrity checks with digital signatures, employing strict type constraints during deserialization, and utilizing security tools that monitor and validate deserialization processes. Never rely on obscurity or client-side checks, as attackers can easily bypass them.

Real-world impact

Real-world CVEs caused by CWE-502

CVE-2024-37052

insecure deserialization in platform for managing AI/ML applications and models allows code execution via a crafted pickled object in a model file
CVE-2024-37288

deserialization of untrusted YAML data in dashboard for data query and visualization of Elasticsearch data
CVE-2024-9314

PHP object injection in WordPress plugin for AI-based SEO
CVE-2019-12799

chain: bypass of untrusted deserialization issue (CWE-502) by using an assumed-trusted class (CWE-183)
CVE-2015-8103

Deserialization issue in commonly-used Java library allows remote execution.
CVE-2015-4852

Deserialization issue in commonly-used Java library allows remote execution.
CVE-2013-1465

Use of PHP unserialize function on untrusted input allows attacker to modify application configuration.
CVE-2012-3527

Use of PHP unserialize function on untrusted input in content management system might allow code execution.

How attackers exploit it

Step-by-step attacker path

1
This code snippet deserializes an object from a file and uses it as a UI button:
2
This code does not attempt to verify the source or contents of the file before deserializing it. An attacker may be able to replace the intended file with a file that contains arbitrary malicious code which will be executed when the button is pressed.
3
To mitigate this, explicitly define final readObject() to prevent deserialization. An example of this is:
4
In Python, the Pickle library handles the serialization and deserialization processes. In this example derived from [REF-467], the code receives and parses data, and afterwards tries to authenticate a user based on validating a token.
5
Unfortunately, the code does not verify that the incoming data is legitimate. An attacker can construct a illegitimate, serialized object "AuthToken" that instantiates one of Python's subprocesses to execute arbitrary commands. For instance,the attacker could construct a pickle that leverages Python's subprocess module, which spawns new processes and includes a number of arguments for various uses. Since Pickle allows objects to define the process for how they should be unpickled, the attacker can direct the unpickle process to call Popen in the subprocess module and execute /bin/sh.

Vulnerable code example

Vulnerable Java

This code snippet deserializes an object from a file and uses it as a UI button:

Vulnerable Java

try {
  	File file = new File("object.obj");
  	ObjectInputStream in = new ObjectInputStream(new FileInputStream(file));
  	javax.swing.JButton button = (javax.swing.JButton) in.readObject();
  	in.close();
  }

Secure code example

Secure Java

To mitigate this, explicitly define final readObject() to prevent deserialization. An example of this is:

Secure Java

private final void readObject(ObjectInputStream in) throws java.io.IOException {
  throw new java.io.IOException("Cannot be deserialized"); }

What changed: the unsafe sink is replaced (or the input is validated/escaped) so the same payload no longer triggers the weakness.

Prevention checklist

How to prevent CWE-502

Architecture and Design / Implementation If available, use the signing/sealing features of the programming language to assure that deserialized data has not been tainted. For example, a hash-based message authentication code (HMAC) could be used to ensure that data has not been modified.
Implementation When deserializing data, populate a new object rather than just deserializing. The result is that the data flows through safe input validation and that the functions are safe.
Implementation Explicitly define a final object() to prevent deserialization.
Architecture and Design / Implementation Make fields transient to protect them from deserialization. An attempt to serialize and then deserialize a class containing transient fields will result in NULLs where the transient data should be. This is an excellent way to prevent time, environment-based, or sensitive variables from being carried over and used improperly.
Implementation Avoid having unnecessary types or gadgets (a sequence of instances and method invocations that can self-execute during the deserialization process, often found in libraries) available that can be leveraged for malicious ends. This limits the potential for unintended or unauthorized types and gadgets to be leveraged by the attacker. Add only acceptable classes to an allowlist. Note: new gadgets are constantly being discovered, so this alone is not a sufficient mitigation.
Architecture and Design / Implementation Employ cryptography of the data or code for protection. However, it's important to note that it would still be client-side security. This is risky because if the client is compromised then the security implemented on the client (the cryptography) can be bypassed.
Operation Use an application firewall that can detect attacks against this weakness. It can be beneficial in cases in which the code cannot be fixed (because it is controlled by a third party), as an emergency prevention measure while more comprehensive software assurance measures are applied, or to provide defense in depth [REF-1481].

Detection signals

How to detect CWE-502

Automated Static Analysis High

Automated static analysis, commonly referred to as Static Application Security Testing (SAST), can find some instances of this weakness by analyzing source code (or binary/compiled code) without having to execute it. Typically, this is done by building a model of data flow and control flow, then searching for potentially-vulnerable patterns that connect "sources" (origins of input) with "sinks" (destinations where the data interacts with external components, a lower layer such as the OS, etc.)

Frequently asked questions

What is CWE-502?

This vulnerability occurs when an application accepts and processes serialized data from an untrusted source without proper validation, allowing an attacker to manipulate the data to execute malicious code or cause unexpected behavior.

How serious is CWE-502?

MITRE rates the likelihood of exploit as Medium — exploitation is realistic but typically requires specific conditions.

What languages or platforms are affected by CWE-502?

MITRE lists the following affected platforms: Java, Ruby, PHP, Python, JavaScript, ICS/OT.

How can I prevent CWE-502?

If available, use the signing/sealing features of the programming language to assure that deserialized data has not been tainted. For example, a hash-based message authentication code (HMAC) could be used to ensure that data has not been modified. When deserializing data, populate a new object rather than just deserializing. The result is that the data flows through safe input validation and that the functions are safe.

How does Plexicus detect and fix CWE-502?

Plexicus's SAST engine matches the data-flow signature for CWE-502 on every commit. When a match is found, our Codex Remedium agent opens a fix PR with the corrected code, tests, and a one-line summary for the reviewer.

Where can I learn more about CWE-502?

MITRE publishes the canonical definition at https://cwe.mitre.org/data/definitions/502.html. You can also reference OWASP and NIST documentation for adjacent guidance.

Related weaknesses

Weaknesses related to CWE-502

CWE-913 Parent

Don't Let Security
Weigh You Down.

Stop choosing between AI velocity and security debt. Plexicus is the only platform that runs Vibe Coding Security and ASPM in parallel — one workflow, every codebase.

Deserialization of Untrusted Data

What is CWE-502?

Real-world CVEs caused by CWE-502

Step-by-step attacker path

Vulnerable Java

Secure Java

How to prevent CWE-502

How to detect CWE-502

Plexicus auto-detects CWE-502 and opens a fix PR in under 60 seconds.

Frequently asked questions

Weaknesses related to CWE-502

Improper Control of Dynamically-Managed Code Resources

Improperly Controlled Modification of Object Prototype Attributes ('Prototype Pollution')

Use of Externally-Controlled Input to Select Classes or Code ('Unsafe Reflection')

Improper Control of Dynamically-Identified Variables

Improperly Controlled Modification of Dynamically-Determined Object Attributes

Improper Control of Generation of Code ('Code Injection')

Further reading

Don't Let Security
Weigh You Down.

Deserialization of Untrusted Data

What is CWE-502?

Real-world CVEs caused by CWE-502

Step-by-step attacker path

Vulnerable Java

Secure Java

How to prevent CWE-502

How to detect CWE-502

Plexicus auto-detects CWE-502 and opens a fix PR in under 60 seconds.

Frequently asked questions

Weaknesses related to CWE-502

Improper Control of Dynamically-Managed Code Resources

Improperly Controlled Modification of Object Prototype Attributes ('Prototype Pollution')

Use of Externally-Controlled Input to Select Classes or Code ('Unsafe Reflection')

Improper Control of Dynamically-Identified Variables

Improperly Controlled Modification of Dynamically-Determined Object Attributes

Improper Control of Generation of Code ('Code Injection')

Further reading

Don't Let SecurityWeigh You Down.

Don't Let Security
Weigh You Down.