Skip to main content
Press slash or control plus K to focus the search. Use the arrow keys to navigate results and press enter to open a threat.
Reconnecting to live updates…

CVE-2025-14009: CWE-94 Improper Control of Generation of Code in nltk nltk/nltk

0
Critical
VulnerabilityCVE-2025-14009cvecve-2025-14009cwe-94
Published: Wed Feb 18 2026 (02/18/2026, 17:45:17 UTC)
Source: CVE Database V5
Vendor/Project: nltk
Product: nltk/nltk

Description

A critical vulnerability exists in the NLTK downloader component of nltk/nltk, affecting all versions. The _unzip_iter function in nltk/downloader.py uses zipfile.extractall() without performing path validation or security checks. This allows attackers to craft malicious zip packages that, when downloaded and extracted by NLTK, can execute arbitrary code. The vulnerability arises because NLTK assumes all downloaded packages are trusted and extracts them without validation. If a malicious package contains Python files, such as __init__.py, these files are executed automatically upon import, leading to remote code execution. This issue can result in full system compromise, including file system access, network access, and potential persistence mechanisms.

AI-Powered Analysis

Machine-generated threat intelligence

AILast updated: 02/27/2026, 08:12:06 UTC

Technical Analysis

CVE-2025-14009 is a critical vulnerability in the Natural Language Toolkit (NLTK), a widely used Python library for natural language processing. The flaw exists in the downloader component, specifically in the _unzip_iter function within nltk/downloader.py. This function uses Python's zipfile.extractall() method to extract downloaded packages without performing any path validation or security checks. As a result, attackers can craft malicious zip archives containing specially named files that exploit path traversal or overwrite critical files. More importantly, if the malicious package includes Python files such as __init__.py, these files are automatically executed when imported by NLTK, leading to remote code execution (RCE). This vulnerability arises because NLTK assumes all downloaded packages are trusted and does not verify their integrity or origin. The impact of this vulnerability is severe, allowing attackers to execute arbitrary code remotely without requiring authentication or user interaction. Exploitation can lead to full system compromise, including unauthorized file system access, network communications, and the ability to establish persistence mechanisms. The vulnerability has been assigned a CVSS v3.0 score of 10.0, indicating critical severity with network attack vector, no privileges required, no user interaction, and complete impact on confidentiality, integrity, and availability. Although no patches or fixes are currently available, the vulnerability disclosure urges immediate attention. Given NLTK's widespread use in academia, industry, and AI research, this vulnerability poses a significant risk to organizations relying on Python-based NLP workflows.

Potential Impact

The impact of CVE-2025-14009 is critical and far-reaching. Successful exploitation allows remote attackers to execute arbitrary code on affected systems without authentication or user interaction. This can lead to full system compromise, including unauthorized access to sensitive data, modification or destruction of files, disruption of services, and lateral movement within networks. Attackers can also establish persistence mechanisms, making remediation difficult. Organizations using NLTK in production environments, especially those processing untrusted data or automating package downloads, face high risk of compromise. The vulnerability undermines the confidentiality, integrity, and availability of affected systems. Given NLTK's popularity in natural language processing tasks across various sectors, including academia, technology companies, and government research, the potential for widespread exploitation exists. The lack of current patches increases exposure, and attackers may develop exploits rapidly once details are public. This threat could facilitate espionage, data theft, sabotage, or deployment of ransomware and other malware.

Mitigation Recommendations

To mitigate CVE-2025-14009, organizations should immediately disable automatic downloading and extraction of NLTK packages until a secure patch is released. Instead, manually download and verify packages from trusted sources before installation. Implement strict network controls to restrict outbound connections from systems running NLTK to prevent unauthorized downloads. Use sandboxing or containerization to isolate environments where NLTK is used, limiting potential damage from exploitation. Monitor logs and network traffic for unusual activity related to NLTK downloader operations. Employ file integrity monitoring to detect unauthorized changes to Python package files. Consider applying custom patches or monkey patches to the _unzip_iter function to add path validation and restrict extraction paths. Educate developers and data scientists about the risks of using untrusted packages and enforce code review policies for dependencies. Stay updated with NLTK project communications for official patches and apply them promptly once available. Finally, conduct vulnerability scanning and penetration testing focused on this issue to assess exposure.

Pro Console: star threats, build custom feeds, automate alerts via Slack, email & webhooks.Upgrade to Pro

Technical Details

Data Version
5.2
Assigner Short Name
@huntr_ai
Date Reserved
2025-12-04T09:27:21.716Z
Cvss Version
3.0
State
PUBLISHED

Threat ID: 69969ef76aea4a407a3d9a78

Added to database: 2/19/2026, 5:26:15 AM

Last enriched: 2/27/2026, 8:12:06 AM

Last updated: 4/8/2026, 8:27:50 PM

Views: 385

Community Reviews

0 reviews

Crowdsource mitigation strategies, share intel context, and vote on the most helpful responses. Sign in to add your voice and help keep defenders ahead.

Sort by
Loading community insights…

Want to contribute mitigation steps or threat intel context? Sign in or create an account to join the community discussion.

Actions

PRO

Updates to AI analysis require Pro Console access. Upgrade inside Console → Billing.

Please log in to the Console to use AI analysis features.

Need more coverage?

Upgrade to Pro Console for AI refresh and higher limits.

For incident response and remediation, OffSeq services can help resolve threats faster.

Latest Threats

Breach by OffSeqOFFSEQFRIENDS — 25% OFF

Check if your credentials are on the dark web

Instant breach scanning across billions of leaked records. Free tier available.

Scan now
OffSeq TrainingCredly Certified

Lead Pen Test Professional

Technical5-day eLearningPECB Accredited
View courses