The characters in the XML document can be encoded in different formats. XML uses the Unicode (UTF-8 or UTF-16) character set by default, but other encodings can be used if they are declared in the XML declaration at the beginning of the document.
Related categories 3
Chinese XML FAQ (in English)
FAQ about Chinese document encodings, and how they relate to XML. Also includes links and software.
Entry-Level Unicode for XML
A "just enough education to perform" guide to Unicode and ISO 10646 for authors of XML parsers and other software that processes XML.
Expressing Simple Dublin Core in RDF/XML
Explains how to encode the Dublin Core Metadata Element Set V1.1 (DCMES) in XML, provides a DTD to validate the documents and describes a method to link them from web pages.
ISO 8859 Alphabet Soup
A commented graphical overview of the ISO 8859 character sets by Roman Czyborra.
MSDN: How to Encode XML Data
Article explaining how character encoding works and specifically how it works in XML and the MSXML DOM.
The Unicode Consortium
The official site with information and resources for programmers, implementers and others involved in computer globalization work.
What ASN.1 can offer to XML
XML Encoding Rules for Abstract Syntax Notation One (ASN.1) and the ASN.1 Markup Language [ITU-T Rec. X.693 | ISO/IEC 8825-4].
XER - XML Encoding Rules
Idea to standardize rules that would allow for information described in ASN.1 (Abstract Syntax Notation) to be carried in XML. [pdf]
Last update:July 1, 2016 at 9:35:07 UTC