Skip to main content
Press slash or control plus K to focus the search. Use the arrow keys to navigate results and press enter to open a threat.
Reconnecting to live updates…

CVE-2025-54988: CWE-611 Improper Restriction of XML External Entity Reference in Apache Software Foundation Apache Tika PDF parser module

0
High
VulnerabilityCVE-2025-54988cvecve-2025-54988cwe-611
Published: Wed Aug 20 2025 (08/20/2025, 20:08:49 UTC)
Source: CVE Database V5
Vendor/Project: Apache Software Foundation
Product: Apache Tika PDF parser module

Description

Critical XXE in Apache Tika (tika-parser-pdf-module) in Apache Tika 1.13 through and including 3.2.1 on all platforms allows an attacker to carry out XML External Entity injection via a crafted XFA file inside of a PDF. An attacker may be able to read sensitive data or trigger malicious requests to internal resources or third-party servers. Note that the tika-parser-pdf-module is used as a dependency in several Tika packages including at least: tika-parsers-standard-modules, tika-parsers-standard-package, tika-app, tika-grpc and tika-server-standard. Users are recommended to upgrade to version 3.2.2, which fixes this issue.

AI-Powered Analysis

Machine-generated threat intelligence

AILast updated: 02/27/2026, 03:44:54 UTC

Technical Analysis

CVE-2025-54988 is an XML External Entity (XXE) vulnerability classified under CWE-611 that affects the Apache Tika PDF parser module from versions 1.13 through 3.2.1. Apache Tika is a widely used content analysis toolkit that extracts metadata and text from various file formats, including PDFs. The vulnerability arises from improper restriction of XML external entity references within the XFA (XML Forms Architecture) files embedded inside PDFs. An attacker can craft a malicious PDF containing a specially designed XFA file that triggers the XXE flaw during parsing. This can lead to disclosure of sensitive files on the host system, server-side request forgery (SSRF) to internal or third-party resources, or denial of service conditions. The vulnerable tika-parser-pdf-module is a dependency in several Apache Tika packages, amplifying the potential attack surface. Exploitation does not require authentication or user interaction but does require the application to parse the malicious PDF. The vulnerability has a CVSS v3.1 base score of 8.4 (AV:L/AC:L/PR:N/UI:N/S:U/C:H/I:H/A:H), indicating high severity with local attack vector, low attack complexity, no privileges or user interaction needed, and impacts on confidentiality, integrity, and availability. The Apache Software Foundation has addressed the issue in version 3.2.2 by properly restricting external entity processing in the PDF parser module. While no active exploits have been reported, the vulnerability poses a significant risk to any organization using affected Apache Tika versions for document processing.

Potential Impact

The impact of CVE-2025-54988 is substantial for organizations relying on Apache Tika for document parsing and content extraction, especially where PDFs with embedded XFA forms are processed. Successful exploitation can lead to unauthorized disclosure of sensitive internal files, potentially exposing confidential business information or credentials. Additionally, attackers may leverage the vulnerability to perform SSRF attacks, accessing internal network resources or third-party services, which could facilitate further lateral movement or data exfiltration. The integrity of the processing system can be compromised by injecting malicious requests or causing denial of service, disrupting critical document workflows. Given Apache Tika’s integration in many enterprise content management systems, search engines, and data ingestion pipelines, the vulnerability could affect a broad range of industries including finance, healthcare, government, and technology. The lack of required authentication and user interaction lowers the barrier for exploitation, increasing the risk. Although no known exploits are currently in the wild, the widespread deployment of vulnerable versions and the critical nature of the flaw necessitate urgent remediation to prevent potential attacks.

Mitigation Recommendations

To mitigate CVE-2025-54988, organizations should immediately upgrade all Apache Tika deployments to version 3.2.2 or later, where the XXE vulnerability has been fixed by properly restricting XML external entity processing in the PDF parser module. For environments where immediate upgrade is not feasible, consider disabling or restricting processing of XFA forms within PDFs if configurable. Employ strict input validation and sandboxing around document parsing components to limit the impact of malicious files. Monitor logs for unusual requests or errors related to PDF parsing that could indicate exploitation attempts. Network segmentation and firewall rules should be used to restrict outbound requests from document processing servers to prevent SSRF exploitation. Additionally, implement least privilege principles for services running Apache Tika to minimize access to sensitive files and internal resources. Regularly review and update dependency versions in all applications that bundle Apache Tika components. Finally, maintain awareness of vendor advisories and threat intelligence for any emerging exploits targeting this vulnerability.

Pro Console: star threats, build custom feeds, automate alerts via Slack, email & webhooks.Upgrade to Pro

Technical Details

Data Version
5.1
Assigner Short Name
apache
Date Reserved
2025-08-04T16:04:26.626Z
Cvss Version
null
State
PUBLISHED

Threat ID: 68a62d6bad5a09ad0008befd

Added to database: 8/20/2025, 8:17:47 PM

Last enriched: 2/27/2026, 3:44:54 AM

Last updated: 3/26/2026, 8:43:11 AM

Views: 266

Community Reviews

0 reviews

Crowdsource mitigation strategies, share intel context, and vote on the most helpful responses. Sign in to add your voice and help keep defenders ahead.

Sort by
Loading community insights…

Want to contribute mitigation steps or threat intel context? Sign in or create an account to join the community discussion.

Actions

PRO

Updates to AI analysis require Pro Console access. Upgrade inside Console → Billing.

Please log in to the Console to use AI analysis features.

Need more coverage?

Upgrade to Pro Console for AI refresh and higher limits.

For incident response and remediation, OffSeq services can help resolve threats faster.

Latest Threats

Breach by OffSeqOFFSEQFRIENDS — 25% OFF

Check if your credentials are on the dark web

Instant breach scanning across billions of leaked records. Free tier available.

Scan now
OffSeq TrainingCredly Certified

Lead Pen Test Professional

Technical5-day eLearningPECB Accredited
View courses