From Newsgroup: comp.misc
hrtuybxi@outlook.com wrote:
On 24/04/2025 02:16, Lawrence D'Oliveiro wrote:
I was slightly disillusioned when I found that there were certain
characters that were not allowed in XML files, even when entity-encoded.
Not even in CDATA section? I wonder what characters those might be.
https://www.w3.org/TR/REC-xml/#charsets
Character Range
[2] Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] |
[#xE000-#xFFFD] | [#x10000-#x10FFFF]
/* any Unicode character, excluding the surrogate blocks, FFFE, and FFFF. */
Most of the control characters are explicitly excluded, plus some
unicode code points.
And, yes, even in CDATA sections, because CDATA references the above
"Char" definition to define what characters are allowed in CDATA:
[20] CData ::= (Char* - (Char* ']]>' Char*))
--- Synchronet 3.21a-Linux NewsLink 1.2