Reconnecting to live updates…

CVE-2025-33238: CWE-362 Concurrent Execution using Shared Resource with Improper Synchronization ('Race Condition') in NVIDIA Triton Inference Server

Severity: highType: vulnerabilityCVE-2025-33238

NVIDIA Triton Inference Server Sagemaker HTTP server contains a vulnerability where an attacker may cause an exception. A successful exploit of this vulnerability may lead to denial of service.

AI Analysis

Technical Summary

CVE-2025-33238 identifies a race condition vulnerability classified under CWE-362 in the NVIDIA Triton Inference Server, specifically within its Sagemaker HTTP server component. The vulnerability stems from improper synchronization when handling concurrent execution of shared resources, which can lead to an exception being thrown. This exception can cause the server to crash or become unresponsive, resulting in a denial of service (DoS) condition. The flaw affects all versions of the Triton Inference Server prior to 26.01. The vulnerability is exploitable remotely over the network without requiring any privileges or user interaction, making it relatively easy for attackers to trigger. Although no exploits have been observed in the wild to date, the CVSS v3.1 score of 7.5 reflects a high severity due to the potential impact on availability and the ease of exploitation. The vulnerability does not impact confidentiality or integrity, focusing solely on availability disruption. The lack of a patch link indicates that a fix may be forthcoming or included in version 26.01. Organizations using Triton for AI inference workloads, especially those exposed to untrusted networks, are at risk of service interruptions. The vulnerability highlights the importance of proper concurrency controls in multi-threaded server environments handling AI inference requests.

Potential Impact

The primary impact of CVE-2025-33238 is denial of service, which can disrupt AI inference services relying on NVIDIA Triton Inference Server. This disruption can affect business-critical applications such as real-time analytics, autonomous systems, and cloud-based AI services, leading to downtime and potential loss of revenue or operational capability. Since the vulnerability does not compromise data confidentiality or integrity, the risk is confined to availability. However, given the increasing reliance on AI inference servers in sectors like finance, healthcare, manufacturing, and autonomous vehicles, even temporary service outages can have cascading effects on dependent systems and processes. Organizations with large-scale deployments or those providing AI inference as a service are particularly vulnerable to operational disruptions. The ease of remote exploitation without authentication increases the threat landscape, especially for publicly accessible Triton servers. This could also be leveraged as part of a broader attack to degrade AI capabilities or cause reputational damage.

Mitigation Recommendations

To mitigate CVE-2025-33238, organizations should prioritize upgrading NVIDIA Triton Inference Server to version 26.01 or later once the patch is officially released. Until then, restricting network access to the Sagemaker HTTP server component is critical; implement network segmentation and firewall rules to limit exposure to trusted clients only. Employ strict authentication and authorization mechanisms where possible to reduce attack surface. Monitor server logs and performance metrics for signs of abnormal exceptions or crashes indicative of exploitation attempts. Consider deploying rate limiting or request throttling to mitigate rapid concurrent requests that could trigger the race condition. Additionally, review and harden concurrency controls and resource management policies in the deployment environment. For cloud deployments, leverage provider-specific security groups and virtual private clouds to isolate Triton servers. Finally, maintain an incident response plan to quickly address any service disruptions caused by exploitation attempts.

Affected Countries

United States, China, Germany, Japan, South Korea, United Kingdom, Canada, France, India, Australia

CVE-2025-33238: CWE-362 Concurrent Execution using Shared Resource with Improper Synchronization ('Race Condition') in NVIDIA Triton Inference Server

Severity: high

Type: vulnerability

CVE: CVE-2025-33238

NVIDIA Triton Inference Server Sagemaker HTTP server contains a vulnerability where an attacker may cause an exception. A successful exploit of this vulnerability may lead to denial of service.

Technical Summary

Potential Impact

Mitigation Recommendations

Affected Countries

United States, China, Germany, Japan, South Korea, United Kingdom, Canada, France, India, Australia

Source: CVE Database V5

Published: 03/24/2026

EPSS 0.3%top 75%

CVE-2025-33238: CWE-362 Concurrent Execution using Shared Resource with Improper Synchronization ('Race Condition') in NVIDIA Triton Inference Server

High

VulnerabilityCVE-2025-33238cve cve-2025-33238 cwe-362

Published: 03/24/2026 (03/24/2026, 20:25:57 UTC)

Source: CVE Database V5

Vendor/Project: NVIDIA

Product: Triton Inference Server

Description

NVIDIA Triton Inference Server Sagemaker HTTP server contains a vulnerability where an attacker may cause an exception. A successful exploit of this vulnerability may lead to denial of service.

CVSS v3.1

Score 7.5high

Attack Vector

Network

Attack Complexity

Low

Privileges Required

None

User Interaction

None

Scope

Unchanged

Confidentiality

None

Integrity

None

Availability

High

CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:N/I:N/A:H

Affected software

GitHub Actionsmore threats →ai

nvidia/triton-inference-server

pkg:github/nvidia/triton-inference-server

Affected versions

<26.01

Run on your own infrastructure? Check whether these packages are installed with threat-finder — our free open-source scanner.

Weaknesses

CWE-362Race Condition

AI-Powered Analysis

Machine-generated threat intelligence

AILast updated: 03/24/2026, 20:51:20 UTC

Technical Analysis

Potential Impact

Mitigation Recommendations

Affected Countries

Pro Console: star threats, build custom feeds, automate alerts via Slack, email & webhooks.Upgrade to Pro

Technical Details

Data Version: 5.2
Assigner Short Name: nvidia
Date Reserved: 2025-04-15T18:51:08.191Z
Cvss Version: 3.1
State: PUBLISHED

Threat ID: 69c2f481f4197a8e3b7561d1

Added to database: 03/24/2026, 20:30:57 UTC

Last enriched: 03/24/2026, 20:51:20 UTC

Last updated: 07/31/2026, 19:22:53 UTC

Community Reviews

0 reviews

Crowdsource mitigation strategies, share intel context, and vote on the most helpful responses. Sign in to add your voice and help keep defenders ahead.

Sort by

Loading community insights…

Want to contribute mitigation steps or threat intel context? Sign in or create an account to join the community discussion.

Actions

PRO

Updates to AI analysis require Pro Console access. Upgrade inside Console → Billing.

Please log in to the Console to use AI analysis features.

External Links

NVD Database MITRE CVE Reference 1 Reference 2 Reference 3 Search on Google

Need more coverage?

Upgrade to Pro Console for AI refresh and higher limits.

For incident response and remediation, OffSeq services can help resolve threats faster.

Latest Threats

Breach by OffSeqOFFSEQFRIENDS — 25% OFF

Check if your credentials are on the dark web

Instant breach scanning across billions of leaked records. Free tier available.

Scan now

OffSeq TrainingCredly Certified

Lead Pen Test Professional

Technical5-day eLearningPECB Accredited

View courses

CVE-2025-33238: CWE-362 Concurrent Execution using Shared Resource with Improper Synchronization ('Race Condition') in NVIDIA Triton Inference Server

AI Analysis

Technical Summary

Potential Impact

Mitigation Recommendations

Affected Countries

CVE-2025-33238: CWE-362 Concurrent Execution using Shared Resource with Improper Synchronization ('Race Condition') in NVIDIA Triton Inference Server

Description

CVSS v3.1

Affected software

Weaknesses

AI-Powered Analysis

Technical Analysis

Potential Impact

Mitigation Recommendations

Affected Countries

Technical Details

Community Reviews

Actions

External Links

Need more coverage?

Latest Threats

Check if your credentials are on the dark web

Lead Pen Test Professional

Keyboard Shortcuts

Navigation

Search & Filters

UI Controls

Accessibility

CVE-2025-33238: CWE-362 Concurrent Execution using Shared Resource with Improper Synchronization ('Race Condition') in NVIDIA Triton Inference Server

AI Analysis

Technical Summary

Potential Impact

Mitigation Recommendations

Affected Countries

CVE-2025-33238: CWE-362 Concurrent Execution using Shared Resource with Improper Synchronization ('Race Condition') in NVIDIA Triton Inference Server

Description

CVSS v3.1

Affected software

Weaknesses

AI-Powered Analysis

Technical Analysis

Potential Impact

Mitigation Recommendations

Affected Countries

Technical Details

Community Reviews

Related Threats

CVE-2026-54785: CWE-22: Improper Limitation of a Pathname to a Restricted Directory ('Path Traversal') in eLyiN gemini-bridge

CVE-2026-9044: CWE-78 Improper neutralization of special elements used in an OS command ('OS command injection') in TP-Link Systems Inc. AXE75 V1

CVE-2026-54909: CWE-20: Improper Input Validation in pion stun

CVE-2026-54787: CWE-324: Use of a Key Past its Expiration Date in sigstore sigstore-go

CVE-2026-53573: CWE-601: URL Redirection to Untrusted Site ('Open Redirect') in geonetwork core-geonetwork

Actions

External Links

Need more coverage?

Latest Threats

Check if your credentials are on the dark web

Lead Pen Test Professional

Keyboard Shortcuts

Navigation

Search & Filters

UI Controls

Accessibility