Skip to main content
Press slash or control plus K to focus the search. Use the arrow keys to navigate results and press enter to open a threat.
Reconnecting to live updates…

CVE-2026-34756: CWE-770: Allocation of Resources Without Limits or Throttling in vllm-project vllm

0
Medium
VulnerabilityCVE-2026-34756cvecve-2026-34756cwe-770
Published: Mon Apr 06 2026 (04/06/2026, 15:40:03 UTC)
Source: CVE Database V5
Vendor/Project: vllm-project
Product: vllm

Description

A Denial of Service (DoS) vulnerability exists in vllm versions from 0. 1. 0 up to but not including 0. 19. 0. The issue arises because the OpenAI-compatible API server does not limit the 'n' parameter in certain request models, allowing an unauthenticated attacker to send a request with an extremely large 'n' value. This causes excessive memory allocation and blocks the Python asyncio event loop, leading to immediate Out-Of-Memory crashes. The vulnerability is fixed in version 0. 19. 0.

AI-Powered Analysis

Machine-generated threat intelligence

AILast updated: 04/14/2026, 16:05:00 UTC

Technical Analysis

CVE-2026-34756 is a resource exhaustion vulnerability (CWE-770) in the vllm-project's vllm inference engine for large language models. Specifically, the OpenAI-compatible API server does not enforce an upper bound on the 'n' parameter in ChatCompletionRequest and CompletionRequest Pydantic models. An attacker can exploit this by sending a single HTTP request with an astronomically large 'n' value, causing the server to allocate millions of request object copies in memory before scheduling, which blocks the asyncio event loop and results in an immediate Out-Of-Memory crash. This affects all vllm versions from 0.1.0 up to but not including 0.19.0. The vulnerability has a CVSS 3.1 base score of 6.5 (medium severity).

Potential Impact

Successful exploitation results in a Denial of Service condition by exhausting server memory and blocking the event loop, causing the vllm service to crash. There is no impact on confidentiality or integrity reported. The vulnerability can be triggered by an unauthenticated attacker via a single HTTP request.

Mitigation Recommendations

This vulnerability is fixed in vllm version 0.19.0. Users should upgrade to version 0.19.0 or later to remediate this issue. Patch status is not explicitly stated in the vendor advisory, but the fix is included in the specified version. Until upgraded, users should consider implementing request size limits or input validation on the 'n' parameter at a proxy or API gateway level to mitigate exploitation.

Pro Console: star threats, build custom feeds, automate alerts via Slack, email & webhooks.Upgrade to Pro

Technical Details

Data Version
5.2
Assigner Short Name
GitHub_M
Date Reserved
2026-03-30T19:17:10.225Z
Cvss Version
3.1
State
PUBLISHED
Remediation Level
null

Threat ID: 69d49831aaed68159aca0f94

Added to database: 4/7/2026, 5:37:53 AM

Last enriched: 4/14/2026, 4:05:00 PM

Last updated: 5/22/2026, 8:45:56 AM

Views: 112

Community Reviews

0 reviews

Crowdsource mitigation strategies, share intel context, and vote on the most helpful responses. Sign in to add your voice and help keep defenders ahead.

Sort by
Loading community insights…

Want to contribute mitigation steps or threat intel context? Sign in or create an account to join the community discussion.

Actions

PRO

Updates to AI analysis require Pro Console access. Upgrade inside Console → Billing.

Please log in to the Console to use AI analysis features.

Need more coverage?

Upgrade to Pro Console for AI refresh and higher limits.

For incident response and remediation, OffSeq services can help resolve threats faster.

Latest Threats

Breach by OffSeqOFFSEQFRIENDS — 25% OFF

Check if your credentials are on the dark web

Instant breach scanning across billions of leaked records. Free tier available.

Scan now
OffSeq TrainingCredly Certified

Lead Pen Test Professional

Technical5-day eLearningPECB Accredited
View courses