Reconnecting to live updates…

CVE-2024-14021: CWE-502 Deserialization of Untrusted Data in run-llama llama_index

Severity: highType: vulnerabilityCVE-2024-14021

LlamaIndex (run-llama/llama_index) versions up to and including 0.11.6 contain an unsafe deserialization vulnerability in BGEM3Index.load_from_disk() in llama_index/indices/managed/bge_m3/base.py. The function uses pickle.load() to deserialize multi_embed_store.pkl from a user-supplied persist_dir without validation. An attacker who can provide a crafted persist directory containing a malicious pickle file can trigger arbitrary code execution when the victim loads the index from disk.

AI Analysis

Technical Summary

CVE-2024-14021 is a deserialization of untrusted data vulnerability (CWE-502) found in the run-llama project's llama_index library, specifically affecting versions up to and including 0.11.6. The issue exists in the BGEM3Index.load_from_disk() function located in llama_index/indices/managed/bge_m3/base.py, which uses Python's pickle.load() to deserialize a file named multi_embed_store.pkl from a user-supplied persist_dir. Because pickle.load() can execute arbitrary code during deserialization, if an attacker can control the contents of the persist_dir and provide a maliciously crafted pickle file, they can trigger arbitrary code execution on the victim's system when the index is loaded. This vulnerability does not require prior authentication but does require that the attacker can influence or supply the persist directory contents and that the victim loads the index, implying some user interaction. The CVSS 4.0 score is 8.4 (high severity), reflecting the potential for high confidentiality, integrity, and availability impact due to arbitrary code execution. No patches or fixes are currently linked, and no known exploits have been reported in the wild. The vulnerability is particularly relevant for environments where llama_index is used to manage or index data, especially in AI or machine learning workflows that rely on persistent storage of embeddings or indexes.

Potential Impact

For European organizations, this vulnerability poses a significant risk if they use the vulnerable versions of llama_index in their AI, data indexing, or machine learning pipelines. Successful exploitation could lead to arbitrary code execution, allowing attackers to compromise confidentiality by accessing sensitive data, integrity by modifying or corrupting data, and availability by disrupting services or deleting data. This could result in data breaches, operational disruptions, or further lateral movement within networks. Organizations in sectors such as finance, healthcare, research, and critical infrastructure that rely on AI tools and data indexing are particularly at risk. The requirement for supplying a malicious persist directory limits remote exploitation but does not eliminate risk in environments where untrusted data sources or shared storage are used. The absence of known exploits suggests a window for proactive mitigation before active attacks occur.

Mitigation Recommendations

European organizations should immediately audit their use of the llama_index library and identify any deployments using versions up to 0.11.6. Until a patch is available, they should avoid loading indexes from untrusted or user-supplied persist directories. Implement strict validation and sanitization of any input directories or files used for deserialization. Consider replacing pickle-based deserialization with safer alternatives such as JSON or other secure serialization libraries that do not allow code execution. Employ application-level controls to restrict which users or processes can supply or modify persist directories. Use endpoint protection and monitoring to detect anomalous file modifications or suspicious process executions related to llama_index. Maintain network segmentation to limit the impact of potential exploitation. Stay updated with vendor advisories for patches or updates addressing this vulnerability and apply them promptly once released.

Affected Countries

Germany, France, United Kingdom, Netherlands, Sweden, Finland

CVE-2024-14021: CWE-502 Deserialization of Untrusted Data in run-llama llama_index

Severity: high

Type: vulnerability

CVE: CVE-2024-14021

Technical Summary

Potential Impact

Mitigation Recommendations

Affected Countries

Germany, France, United Kingdom, Netherlands, Sweden, Finland

Source: CVE Database V5

Published: Mon Jan 12 2026

CVE-2024-14021: CWE-502 Deserialization of Untrusted Data in run-llama llama_index

High

VulnerabilityCVE-2024-14021cve cve-2024-14021 cwe-502

Published: Mon Jan 12 2026 (01/12/2026, 23:04:43 UTC)

Source: CVE Database V5

Vendor/Project: run-llama

Product: llama_index

Description

AI-Powered Analysis

AILast updated: 01/12/2026, 23:38:32 UTC

Technical Analysis

Potential Impact

Mitigation Recommendations

Affected Countries

Need more detailed analysis?Upgrade to Pro Console

Technical Details

Data Version: 5.2
Assigner Short Name: VulnCheck
Date Reserved: 2026-01-09T20:42:56.495Z
Cvss Version: 4.0
State: PUBLISHED

Threat ID: 69658281da2266e838450d16

Added to database: 1/12/2026, 11:23:45 PM

Last enriched: 1/12/2026, 11:38:32 PM

Last updated: 2/6/2026, 10:48:51 AM

Community Reviews

0 reviews

Crowdsource mitigation strategies, share intel context, and vote on the most helpful responses. Sign in to add your voice and help keep defenders ahead.

Sort by

Loading community insights…

Want to contribute mitigation steps or threat intel context? Sign in or create an account to join the community discussion.

Related Threats

Actions

PRO

Updates to AI analysis require Pro Console access. Upgrade inside Console → Billing.

Please log in to the Console to use AI analysis features.

External Links

NVD Database MITRE CVE Reference 1 Reference 2 Reference 3 Reference 4 Search on Google

Need more coverage?

Upgrade to Pro Console in Console -> Billing for AI refresh and higher limits.

For incident response and remediation, OffSeq services can help resolve threats faster.

CVE-2024-14021: CWE-502 Deserialization of Untrusted Data in run-llama llama_index

AI Analysis

Technical Summary

Potential Impact

Mitigation Recommendations

Affected Countries

CVE-2024-14021: CWE-502 Deserialization of Untrusted Data in run-llama llama_index

Description

AI-Powered Analysis

Technical Analysis

Potential Impact

Mitigation Recommendations

Affected Countries

Technical Details

Community Reviews

Related Threats

CVE-2026-2015: Improper Authorization in Portabilis i-Educar

CVE-2026-2014: SQL Injection in itsourcecode Student Management System

CVE-2026-2013: SQL Injection in itsourcecode Student Management System

CVE-2026-24928: CWE-680 Integer Overflow to Buffer Overflow in Huawei HarmonyOS

CVE-2026-24927: CWE-416 Use After Free in Huawei HarmonyOS

Actions

External Links

Need more coverage?

Latest Threats

Keyboard Shortcuts

Navigation

Search & Filters

UI Controls

Accessibility