CVE-2026-34756: CWE-770: Allocation of Resources Without Limits or Throttling in vllm-project vllm
A Denial of Service (DoS) vulnerability exists in vllm versions from 0. 1. 0 up to but not including 0. 19. 0. The issue arises because the OpenAI-compatible API server does not limit the 'n' parameter in certain request models, allowing an unauthenticated attacker to send a request with an extremely large 'n' value. This causes excessive memory allocation and blocks the Python asyncio event loop, leading to immediate Out-Of-Memory crashes. The vulnerability is fixed in version 0. 19. 0.
AI Analysis
Technical Summary
CVE-2026-34756 is a resource exhaustion vulnerability (CWE-770) in the vllm-project's vllm inference engine for large language models. Specifically, the OpenAI-compatible API server does not enforce an upper bound on the 'n' parameter in ChatCompletionRequest and CompletionRequest Pydantic models. An attacker can exploit this by sending a single HTTP request with an astronomically large 'n' value, causing the server to allocate millions of request object copies in memory before scheduling, which blocks the asyncio event loop and results in an immediate Out-Of-Memory crash. This affects all vllm versions from 0.1.0 up to but not including 0.19.0. The vulnerability has a CVSS 3.1 base score of 6.5 (medium severity).
Potential Impact
Successful exploitation results in a Denial of Service condition by exhausting server memory and blocking the event loop, causing the vllm service to crash. There is no impact on confidentiality or integrity reported. The vulnerability can be triggered by an unauthenticated attacker via a single HTTP request.
Mitigation Recommendations
This vulnerability is fixed in vllm version 0.19.0. Users should upgrade to version 0.19.0 or later to remediate this issue. Patch status is not explicitly stated in the vendor advisory, but the fix is included in the specified version. Until upgraded, users should consider implementing request size limits or input validation on the 'n' parameter at a proxy or API gateway level to mitigate exploitation.
CVE-2026-34756: CWE-770: Allocation of Resources Without Limits or Throttling in vllm-project vllm
Description
A Denial of Service (DoS) vulnerability exists in vllm versions from 0. 1. 0 up to but not including 0. 19. 0. The issue arises because the OpenAI-compatible API server does not limit the 'n' parameter in certain request models, allowing an unauthenticated attacker to send a request with an extremely large 'n' value. This causes excessive memory allocation and blocks the Python asyncio event loop, leading to immediate Out-Of-Memory crashes. The vulnerability is fixed in version 0. 19. 0.
AI-Powered Analysis
Machine-generated threat intelligence
Technical Analysis
CVE-2026-34756 is a resource exhaustion vulnerability (CWE-770) in the vllm-project's vllm inference engine for large language models. Specifically, the OpenAI-compatible API server does not enforce an upper bound on the 'n' parameter in ChatCompletionRequest and CompletionRequest Pydantic models. An attacker can exploit this by sending a single HTTP request with an astronomically large 'n' value, causing the server to allocate millions of request object copies in memory before scheduling, which blocks the asyncio event loop and results in an immediate Out-Of-Memory crash. This affects all vllm versions from 0.1.0 up to but not including 0.19.0. The vulnerability has a CVSS 3.1 base score of 6.5 (medium severity).
Potential Impact
Successful exploitation results in a Denial of Service condition by exhausting server memory and blocking the event loop, causing the vllm service to crash. There is no impact on confidentiality or integrity reported. The vulnerability can be triggered by an unauthenticated attacker via a single HTTP request.
Mitigation Recommendations
This vulnerability is fixed in vllm version 0.19.0. Users should upgrade to version 0.19.0 or later to remediate this issue. Patch status is not explicitly stated in the vendor advisory, but the fix is included in the specified version. Until upgraded, users should consider implementing request size limits or input validation on the 'n' parameter at a proxy or API gateway level to mitigate exploitation.
Technical Details
- Data Version
- 5.2
- Assigner Short Name
- GitHub_M
- Date Reserved
- 2026-03-30T19:17:10.225Z
- Cvss Version
- 3.1
- State
- PUBLISHED
- Remediation Level
- null
Threat ID: 69d49831aaed68159aca0f94
Added to database: 4/7/2026, 5:37:53 AM
Last enriched: 4/14/2026, 4:05:00 PM
Last updated: 5/22/2026, 8:45:56 AM
Views: 112
Community Reviews
0 reviewsCrowdsource mitigation strategies, share intel context, and vote on the most helpful responses. Sign in to add your voice and help keep defenders ahead.
Want to contribute mitigation steps or threat intel context? Sign in or create an account to join the community discussion.
Actions
Updates to AI analysis require Pro Console access. Upgrade inside Console → Billing.
Need more coverage?
Upgrade to Pro Console for AI refresh and higher limits.
For incident response and remediation, OffSeq services can help resolve threats faster.
Latest Threats
Check if your credentials are on the dark web
Instant breach scanning across billions of leaked records. Free tier available.