NA

CVE-2024-34078

Published: 06/05/2024 Updated: 06/05/2024

Vulnerability Summary

html-sanitizer is an allowlist-based HTML cleaner. If using `keep_typographic_whitespace=False` (which is the default), the sanitizer normalizes unicode to the NFKC form at the end. Some unicode characters normalize to chevrons; this allows specially crafted HTML to escape sanitization. The problem has been fixed in 2.4.2.

Vendor Advisories

Debian Bug report logs - #1070710 python-html-sanitizer: CVE-2024-34078: Arbitrary HTML present after sanitization because of unicode normalization Package: src:python-html-sanitizer; Maintainer for src:python-html-sanitizer is Jonas Smedegaard <dr@jonesdk>; Reported by: Salvatore Bonaccorso <carnil@debianorg> Date: ...