Unicode cipher credibility in the afterward ranges are accurate in XML 1.0 documents:10
U+0009, U+000A, U+000D: these are the alone C0 controls accustomed in XML 1.0;
U+0020–U+D7FF, U+E000–U+FFFD: this excludes some (not all) non-characters in the BMP (all surrogates, U+FFFE and U+FFFF are forbidden);
U+10000–U+10FFFF: this includes all cipher credibility in added planes, including non-characters.
XML 1.111 extends the set of accustomed characters to cover all the above, additional the actual characters in the ambit U+0001–U+001F. At the aforementioned time, however, it restricts the use of C0 and C1 ascendancy characters added than U+0009, U+000A, U+000D, and U+0085 by acute them to be accounting in able anatomy (for archetype U+0001 have to be accounting as or its equivalent). In the case of C1 characters, this brake is a backwards incompatibility; it was alien to acquiesce accepted encoding errors to be detected.
The cipher point U+0000 is the alone appearance that is not acceptable in any XML 1.0 or 1.1 document.
U+0009, U+000A, U+000D: these are the alone C0 controls accustomed in XML 1.0;
U+0020–U+D7FF, U+E000–U+FFFD: this excludes some (not all) non-characters in the BMP (all surrogates, U+FFFE and U+FFFF are forbidden);
U+10000–U+10FFFF: this includes all cipher credibility in added planes, including non-characters.
XML 1.111 extends the set of accustomed characters to cover all the above, additional the actual characters in the ambit U+0001–U+001F. At the aforementioned time, however, it restricts the use of C0 and C1 ascendancy characters added than U+0009, U+000A, U+000D, and U+0085 by acute them to be accounting in able anatomy (for archetype U+0001 have to be accounting as or its equivalent). In the case of C1 characters, this brake is a backwards incompatibility; it was alien to acquiesce accepted encoding errors to be detected.
The cipher point U+0000 is the alone appearance that is not acceptable in any XML 1.0 or 1.1 document.
No comments:
Post a Comment