CVE-2026-35375: CWE-176: Improper Handling of Unicode Encoding in Uutils coreutils
A logic error in the split utility of uutils coreutils causes the corruption of output filenames when provided with non-UTF-8 prefix or suffix inputs. The implementation utilizes to_string_lossy() when constructing chunk filenames, which automatically rewrites invalid byte sequences into the UTF-8 replacement character (U+FFFD). This behavior diverges from GNU split, which preserves raw pathname bytes intact. In environments utilizing non-UTF-8 encodings, this vulnerability leads to the creation of files with incorrect names, potentially causing filename collisions, broken automation, or the misdirection of output data.
AI Analysis
Technical Summary
The vulnerability in uutils coreutils split utility involves a logic error related to Unicode encoding handling. When non-UTF-8 prefix or suffix inputs are provided, the utility uses to_string_lossy() to generate chunk filenames, which replaces invalid byte sequences with the UTF-8 replacement character (U+FFFD). This behavior differs from GNU split, which preserves raw pathname bytes. As a result, in non-UTF-8 environments, output filenames can be corrupted, potentially causing filename collisions and automation failures.
Potential Impact
The impact is limited to the integrity of output filenames generated by the split utility in uutils coreutils when handling non-UTF-8 encoded inputs. This can cause filename collisions, broken automation workflows, or misdirection of output data. There is no direct confidentiality, availability, or system compromise impact reported.
Mitigation Recommendations
Patch status is not yet confirmed — check the vendor advisory for current remediation guidance. Until a fix is available, users should be cautious when using the split utility with non-UTF-8 encoded prefixes or suffixes and consider using GNU split as an alternative if filename integrity is critical.
CVE-2026-35375: CWE-176: Improper Handling of Unicode Encoding in Uutils coreutils
Description
A logic error in the split utility of uutils coreutils causes the corruption of output filenames when provided with non-UTF-8 prefix or suffix inputs. The implementation utilizes to_string_lossy() when constructing chunk filenames, which automatically rewrites invalid byte sequences into the UTF-8 replacement character (U+FFFD). This behavior diverges from GNU split, which preserves raw pathname bytes intact. In environments utilizing non-UTF-8 encodings, this vulnerability leads to the creation of files with incorrect names, potentially causing filename collisions, broken automation, or the misdirection of output data.
AI-Powered Analysis
Machine-generated threat intelligence
Technical Analysis
The vulnerability in uutils coreutils split utility involves a logic error related to Unicode encoding handling. When non-UTF-8 prefix or suffix inputs are provided, the utility uses to_string_lossy() to generate chunk filenames, which replaces invalid byte sequences with the UTF-8 replacement character (U+FFFD). This behavior differs from GNU split, which preserves raw pathname bytes. As a result, in non-UTF-8 environments, output filenames can be corrupted, potentially causing filename collisions and automation failures.
Potential Impact
The impact is limited to the integrity of output filenames generated by the split utility in uutils coreutils when handling non-UTF-8 encoded inputs. This can cause filename collisions, broken automation workflows, or misdirection of output data. There is no direct confidentiality, availability, or system compromise impact reported.
Mitigation Recommendations
Patch status is not yet confirmed — check the vendor advisory for current remediation guidance. Until a fix is available, users should be cautious when using the split utility with non-UTF-8 encoded prefixes or suffixes and consider using GNU split as an alternative if filename integrity is critical.
Technical Details
- Data Version
- 5.2
- Assigner Short Name
- canonical
- Date Reserved
- 2026-04-02T12:58:56.088Z
- Cvss Version
- 3.1
- State
- PUBLISHED
- Remediation Level
- null
Threat ID: 69e8f7d519fe3cd2cdd00d94
Added to database: 4/22/2026, 4:31:17 PM
Last enriched: 4/22/2026, 4:47:28 PM
Last updated: 4/23/2026, 3:26:41 AM
Views: 6
Community Reviews
0 reviewsCrowdsource mitigation strategies, share intel context, and vote on the most helpful responses. Sign in to add your voice and help keep defenders ahead.
Want to contribute mitigation steps or threat intel context? Sign in or create an account to join the community discussion.
Actions
Updates to AI analysis require Pro Console access. Upgrade inside Console → Billing.
Need more coverage?
Upgrade to Pro Console for AI refresh and higher limits.
For incident response and remediation, OffSeq services can help resolve threats faster.
Latest Threats
Check if your credentials are on the dark web
Instant breach scanning across billions of leaked records. Free tier available.