Characters and Character Sets
This page lists links to several documents that list special characters or entities supported by HTML, SGML, and XML. All links will open a new browser window.
- Earliest Uses of Various Mathematical Symbols -- created by Jeff Miller, this sites includes an illustrated history of symbols.
- il8n/110n: Character Sets -- created by the W3C, this site discusses character sets on the Internet and provides links to character set resources for internationalization (that is, il8n) and localization (that is, 110n).
- ISO-8859 Briefing and Resources -- created by Alan J. Flavell at Glasgow University, this site is a 13-page document that discusses the ISO-8859-1 character codes which represent the most commonly used characters for HTML, but also supported by SGML and XML.
- Mathematica 3.0 Characters from Assigned Unicode Space -- created by the American Mathematical Society, this site is a table with the following columns: Unicode hex, Mathematica name, character glyph, SGML aliases, TeX aliases, and Mathematica aliases.
- Minimum European Subset of ISO/IEC 10646-1 -- created by Michael Everson, this site lists the characters and scripts of Europe.
- RFC 2044: UTF-8, a Transformation Format of Unicode and ISO 10646 -- presents information and examples about the Unicode standard and various character sets: UTF-8, US-ASCII, UCS-2, and UCS-4.
- RFC 2130: The Report of the IAB Character Set Workshop -- provides an overview of character sets on the Internet, the problems and how characters are currently handled, as well as a list of acronyms and a glossary.
- The Unicode Consortium -- created by the group responsible for the Unicode standard, this site includes compiled charts and names of special characters as well as links to resources, conferences, and the official standard.