ladybird/HTMLParser.cpp at db2ba5f1d9234b51a547ce01c3da8ecd5006ee87

mirror of https://github.com/fergalmoran/ladybird.git synced 2025-12-23 01:39:55 +00:00

Files

Timothy Flynn f5f1a5228e LibWeb: Escape HTML text fragments with multi-byte code point awareness

The UTF-8 encoding of U+00A0 (NBSP) is the bytes 0xc2 0xa0. By looping
over the string to escape byte-by-byte, we replace the second byte with
"&nbsp;", but leave the first byte in the resulting text. This creates
an invalid UTF-8 string, with a lone leading byte.

2023-03-13 07:29:40 +00:00

166 KiB

Raw Blame History

View Raw

166 KiB Raw Blame History

166 KiB

Raw Blame History