XML Character Entities Version 0.2

OASIS DocBook Technical Committee

Working Draft 19 Mar 2002

This version:: Working Draft: 19 Mar 2002

Previous versions:: Working Draft: 19 Nov 2001

Editor:: Norman Walsh <Norman.Walsh@Sun.COM>

This document and translations of it may be copied and furnished to others, and derivative works that comment on or otherwise explain it or assist in its implementation may be prepared, copied, published and distributed, in whole or in part, without restriction of any kind, provided that the above copyright notice and this paragraph are included on all such copies and derivative works. However, this document itself may not be modified in any way, such as by removing the copyright notice or references to OASIS, except as needed for the purpose of developing OASIS specifications, in which case the procedures for copyrights defined in the OASIS Intellectual Property Rights document must be followed, or as required to translate it into languages other than English.

The limited permissions granted above are perpetual and will not be revoked by OASIS or its successors or assigns.

This document and the information contained herein is provided on an "AS IS" basis and OASIS DISCLAIMS ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.

Abstract

This Standard defines XML encodings of the 19 standard character entity sets defined in Non-normative Annex D of [ISO 8879:1986].

Status of this Document

This Working Draft was approved for publication by the OASIS DocBook Technical Committee. Comments on this document may be sent to docbook@lists.oasis-open.org.

This Standard defines XML encodings of the standard SGML character entity sets.

Non-normative Annex D of [ISO 8879:1986] defines 19 standard SGML character entity sets: Added Latin 1, Added Latin 2, Greek Letters, Monotoniko Greek, Russian Cyrillic, Non-Russian Cyrillic, Numeric and Special Graphic, Diacritical Marks, Publishing, Box and Line Drawing, General Technical, Greek Symbols, Alternative Greek Symbols, Added Math Symbols: Ordinary, Added Math Symbols: Binary Operators, Added Math Symbols: Relations, Added Math Symbols: Negated Relations, Added Math Symbols: Arrow Relations, Added Math Symbols: Delimiters. The SGML declarations for these entities use the specific character data (SDATA) entity type that is not supported in XML, so alternative XML declarations are necessary.

In XML, the specific character data of most entities can be expressed as a [Unicode] character.

1. XML Character Entity Sets

The character entity sets defined by this Standard are summarized in Appendix A through Appendix S.

In order to use these entities in a document, they must be declared. Entities can be declared in the external subset or the internal subset, as described in [XML 1.0]. An example document, with the declaration in the internal subset, is shown in Example 1.

Example 1. Declaring and Using the ISO Latin 1 Character Entity Set

<!DOCTYPE doc [
<!ENTITY % iso-lat1 PUBLIC "ISO 8879:1986//ENTITIES Added Latin 1//EN//XML"
                    "http://www.oasis-open.org/docbook/xmlcharent/0.2/isolat1.ent">
%iso-lat1;
]>
<doc>
<p>This document declares the ISO Latin 1 Character Entity Set, providing
access to the ISO Latin 1 entities, such as "&eacute;" and "&copy;".</p>
</doc>

Note

Non-validating XML Parsers may choose not to process externally declared entities. This Standard does not alter the semantics of XML processors. If a processor does not see the declaration for an entity, it will not be able to report the correct replacement text for that entity.

1.1. Multi-Character Replacements

The replacement text of some entities includes more than a single Unicode character. Some characters are composed with the "combining reverse solidus overlay" (20E5) and some are composed with a variation selector (FE00, FE01, …).

1.2. Duplicate Entities

Historically, the inodot entity is multiply defined in iso-lat2.ent and iso-amso.ent. If both entity sets are included, some parsers will warn about redefinition of this entity. The warning can be ignored.

1.3. Entities with no Mapping

There are a small number of entities that have no [Unicode] representation. These entities are all mapped to the Unicode character "FFFD", the "replacement character".

Entity Name	Entity Set	Description
fjlig	iso-pub.ent	Small fj ligature
gnap	iso-amsn.ent	Greater, not approximate
jnodot	iso-amso.ent	Small j, no dot
lnap	iso-amsn.ent	Less, not approximate
lpargt	iso-amsc.ent	Greater than, left arc
nsmid	iso-amsn.ent	Negated short mid
prnE	iso-amsn.ent	Precedes, not double equals
rpargt	iso-amsc.ent	Right paren, greater than
scnE	iso-amsn.ent	Succeeds, not double equals
smid	iso-amsr.ent	shortmid r
vsubnE	iso-amsn.ent	Subset not double equals, variant

Users needing these characters will have to rely on the private use area or other non-portable mechanisms to access them.

1.4. Entities with Substituted Mappings

There are a few more for which there is no specific [Unicode] representation but where a reasonable substitution has been used:

Entity Name	Entity Set	Substitution	Description
bepsi	iso-amsr.ent	220D	Back epsilon: such that
ges	iso-amsr.ent	2265	Greater-or-equal, slanted
gvnE	iso-amsn.ent	2269	Gt, vert, not double equals
iff	iso-tech.ent	21D4	If and only if
les	iso-amsr.ent	2264	Less-than-or-equal, slanted
lozf	iso-pub.ent	2726	Lozenge, filled
lvnE	iso-amsn.ent	2268	Less, vert, not double equals
nge	iso-amsn.ent	2271	Neither greater-than nor equal to
nle	iso-amsn.ent	2270	Not less-than-or-equal
npre	iso-amsn.ent	22E0	Not precedes, equals
nsce	iso-amsn.ent	22E1	Not succeeds, equals
nspar	iso-amsn.ent	2226	Not short parallel
pre	iso-amsr.ent	227C	Precedes, equals
spar	iso-amsr.ent	2225	Short parallel
ssetmn	iso-amsb.ent	2216	Small set minus (reverse solidus)
star	iso-pub.ent	22C6	Star operator
starf	iso-pub.ent	2605	Black star
thkap	iso-amsr.ent	2248	Thick approximate
thksim	iso-amsr.ent	223C	Thick similar
vsubne	iso-amsn.ent	228A	Subset, not equals, variant
vsupnE	iso-amsn.ent	228B	Subset not double equals, variant
vsupne	iso-amsn.ent	228B	Superset, not equals, variant
xhArr	iso-amsa.ent	2194	Long left and right double arr
xharr	iso-amsa.ent	2194	Long left and right arr
xlArr	iso-amsa.ent	21D0	Long left double arrow
xrArr	iso-amsa.ent	21D2	Long right double arr
ssmile	iso-amsr.ent	2323	Small smile
sfrown	iso-amsr.ent	2322	Small frown

Users needing alternate glyphs for these characters will have to rely on redefining them to use the private use area or other non-portable mechanisms to access them.

2. XML Character Elements

Named XML entities (except for the five predefined entities) cannot be used if they are not declared. Entity declaration requires either an external or an internal subset. Some classes of applications forbid the occurrence of markup declarations in documents. For these documents, named character entities are inaccessible.

In this section, we introduce an XML vocabulary with the semantics of character entity reference. This Standard defines the semantics of elements and attributes declared in the "http://www.oasis-open.org/docbook/xmlcharent/names" namespace.

This namespace contains exactly one element, char. The char element has two attributes, entity and name. They are mutually exclusive.

The entity attribute identifies characters by their character entity names. (The set of valid names is the closed set of names associated with character entity sets defined by this Standard.) Case is significant in entity names.

The name attribute identifies characters by their Unicode character names. (The set of valid names is the set of character names published in the [Unicode] specification, or any later version of that specification.) Case is insignificant in character names.

The [RELAX NG] definition of this namespace is shown in figure Figure 1.

Figure 1. The RELAX NG Definition of the http://www.oasis-open.org/docbook/xmlcharent/names Namespace

<?xml version="1.0"?>
<grammar xmlns="http://relaxng.org/ns/structure/0.9"
         ns="http://www.oasis-open.org/docbook/xmlcharent/names">

<start>
  <element name="char">
    <choice>
      <attribute name="entity">
        <ref name="EntityNames"/>
      </attribute>
      <attribute name="name">
        <ref name="UnicodeNames"/>
      </attribute>
    </choice>
  </element>
</start>

<define name="EntityNames">
  <!-- Logically, this is the list of ISO 9573 Character Entity Names -->
  <!-- For now, just text. -->
  <text/>
</define>

<define name="UnicodeNames">
  <!-- Logically, this is the list of Unicode Character Names -->
  <!-- For now, just text. -->
  <text/>
</define>

</grammar>

Example 2 shows a sample document using this mechanism.

Example 2. Declaring and Using the ISO Latin 1 Character Entity Set

<doc xmlns:e="http://www.oasis-open.org/docbook/xmlcharent/names">
<p>This document uses the character names element to access
character entities, such as "<e:char name="eacute"/>" and
"<e:char name="COPYRIGHT SIGN"/>".</p>
</doc>

The character names element is limited to contexts where elements may occur. In particular, elements may not occur in XML attribute values. Note, however, that internationalization requirements such as bidirectional language support and Ruby already require structure in arbitrary contexts. It is probably an error to use attributes for human-readable content.

A. Added Latin 1

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Added Latin 1//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isolat1.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
aacute	00E1	=small a, acute accent
Aacute	00C1	=capital A, acute accent
acirc	00E2	=small a, circumflex accent
Acirc	00C2	=capital A, circumflex accent
agrave	00E0	=small a, grave accent
Agrave	00C0	=capital A, grave accent
aring	00E5	=small a, ring
Aring	00C5	=capital A, ring
atilde	00E3	=small a, tilde
Atilde	00C3	=capital A, tilde
auml	00E4	=small a, dieresis or umlaut mark
Auml	00C4	=capital A, dieresis or umlaut mark
aelig	00E6	=small ae diphthong (ligature)
AElig	00C6	=capital AE diphthong (ligature)
ccedil	00E7	=small c, cedilla
Ccedil	00C7	=capital C, cedilla
eth	00F0	=small eth, Icelandic
ETH	00D0	=capital Eth, Icelandic
eacute	00E9	=small e, acute accent
Eacute	00C9	=capital E, acute accent
ecirc	00EA	=small e, circumflex accent
Ecirc	00CA	=capital E, circumflex accent
egrave	00E8	=small e, grave accent
Egrave	00C8	=capital E, grave accent
euml	00EB	=small e, dieresis or umlaut mark
Euml	00CB	=capital E, dieresis or umlaut mark
iacute	00ED	=small i, acute accent
Iacute	00CD	=capital I, acute accent
icirc	00EE	=small i, circumflex accent
Icirc	00CE	=capital I, circumflex accent
igrave	00EC	=small i, grave accent
Igrave	00CC	=capital I, grave accent
iuml	00EF	=small i, dieresis or umlaut mark
Iuml	00CF	=capital I, dieresis or umlaut mark
ntilde	00F1	=small n, tilde
Ntilde	00D1	=capital N, tilde
oacute	00F3	=small o, acute accent
Oacute	00D3	=capital O, acute accent
ocirc	00F4	=small o, circumflex accent
Ocirc	00D4	=capital O, circumflex accent
ograve	00F2	=small o, grave accent
Ograve	00D2	=capital O, grave accent
oslash	00F8	=small o, slash
Oslash	00D8	=capital O, slash
otilde	00F5	=small o, tilde
Otilde	00D5	=capital O, tilde
ouml	00F6	=small o, dieresis or umlaut mark
Ouml	00D6	=capital O, dieresis or umlaut mark
szlig	00DF	=small sharp s, German (sz ligature)
thorn	00FE	=small thorn, Icelandic
THORN	00DE	=capital THORN, Icelandic
uacute	00FA	=small u, acute accent
Uacute	00DA	=capital U, acute accent
ucirc	00FB	=small u, circumflex accent
Ucirc	00DB	=capital U, circumflex accent
ugrave	00F9	=small u, grave accent
Ugrave	00D9	=capital U, grave accent
uuml	00FC	=small u, dieresis or umlaut mark
Uuml	00DC	=capital U, dieresis or umlaut mark
yacute	00FD	=small y, acute accent
Yacute	00DD	=capital Y, acute accent
yuml	00FF	=small y, dieresis or umlaut mark

B. Added Latin 2

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Added Latin 2//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isolat2.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
abreve	0103	=small a, breve
Abreve	0102	=capital A, breve
amacr	0101	=small a, macron
Amacr	0100	=capital A, macron
aogon	0105	=small a, ogonek
Aogon	0104	=capital A, ogonek
cacute	0107	=small c, acute accent
Cacute	0106	=capital C, acute accent
ccaron	010D	=small c, caron
Ccaron	010C	=capital C, caron
ccirc	0109	=small c, circumflex accent
Ccirc	0108	=capital C, circumflex accent
cdot	010B	=small c, dot above
Cdot	010A	=capital C, dot above
dcaron	010F	=small d, caron
Dcaron	010E	=capital D, caron
dstrok	0111	=small d, stroke
Dstrok	0110	=capital D, stroke
ecaron	011B	=small e, caron
Ecaron	011A	=capital E, caron
edot	0117	=small e, dot above
Edot	0116	=capital E, dot above
emacr	0113	=small e, macron
Emacr	0112	=capital E, macron
eogon	0119	=small e, ogonek
Eogon	0118	=capital E, ogonek
gacute	01F5	=small g, acute accent
gbreve	011F	=small g, breve
Gbreve	011E	=capital G, breve
Gcedil	0122	=capital G, cedilla
gcirc	011D	=small g, circumflex accent
Gcirc	011C	=capital G, circumflex accent
gdot	0121	=small g, dot above
Gdot	0120	=capital G, dot above
hcirc	0125	=small h, circumflex accent
Hcirc	0124	=capital H, circumflex accent
hstrok	0127	=small h, stroke
Hstrok	0126	=capital H, stroke
Idot	0130	=capital I, dot above
Imacr	012A	=capital I, macron
imacr	012B	=small i, macron
ijlig	0133	=small ij ligature
IJlig	0132	=capital IJ ligature
inodot	0131	/imath =small i, no dot
iogon	012F	=small i, ogonek
Iogon	012E	=capital I, ogonek
itilde	0129	=small i, tilde
Itilde	0128	=capital I, tilde
jcirc	0135	=small j, circumflex accent
Jcirc	0134	=capital J, circumflex accent
kcedil	0137	=small k, cedilla
Kcedil	0136	=capital K, cedilla
kgreen	0138	=small k, Greenlandic
lacute	013A	=small l, acute accent
Lacute	0139	=capital L, acute accent
lcaron	013E	=small l, caron
Lcaron	013D	=capital L, caron
lcedil	013C	=small l, cedilla
Lcedil	013B	=capital L, cedilla
lmidot	0140	=small l, middle dot
Lmidot	013F	=capital L, middle dot
lstrok	0142	=small l, stroke
Lstrok	0141	=capital L, stroke
nacute	0144	=small n, acute accent
Nacute	0143	=capital N, acute accent
eng	014B	=small eng, Lapp
ENG	014A	=capital ENG, Lapp
napos	0149	=small n, apostrophe
ncaron	0148	=small n, caron
Ncaron	0147	=capital N, caron
ncedil	0146	=small n, cedilla
Ncedil	0145	=capital N, cedilla
odblac	0151	=small o, double acute accent
Odblac	0150	=capital O, double acute accent
Omacr	014C	=capital O, macron
omacr	014D	=small o, macron
oelig	0153	=small oe ligature
OElig	0152	=capital OE ligature
racute	0155	=small r, acute accent
Racute	0154	=capital R, acute accent
rcaron	0159	=small r, caron
Rcaron	0158	=capital R, caron
rcedil	0157	=small r, cedilla
Rcedil	0156	=capital R, cedilla
sacute	015B	=small s, acute accent
Sacute	015A	=capital S, acute accent
scaron	0161	=small s, caron
Scaron	0160	=capital S, caron
scedil	015F	=small s, cedilla
Scedil	015E	=capital S, cedilla
scirc	015D	=small s, circumflex accent
Scirc	015C	=capital S, circumflex accent
tcaron	0165	=small t, caron
Tcaron	0164	=capital T, caron
tcedil	0163	=small t, cedilla
Tcedil	0162	=capital T, cedilla
tstrok	0167	=small t, stroke
Tstrok	0166	=capital T, stroke
ubreve	016D	=small u, breve
Ubreve	016C	=capital U, breve
udblac	0171	=small u, double acute accent
Udblac	0170	=capital U, double acute accent
umacr	016B	=small u, macron
Umacr	016A	=capital U, macron
uogon	0173	=small u, ogonek
Uogon	0172	=capital U, ogonek
uring	016F	=small u, ring
Uring	016E	=capital U, ring
utilde	0169	=small u, tilde
Utilde	0168	=capital U, tilde
wcirc	0175	=small w, circumflex accent
Wcirc	0174	=capital W, circumflex accent
ycirc	0177	=small y, circumflex accent
Ycirc	0176	=capital Y, circumflex accent
Yuml	0178	=capital Y, dieresis or umlaut mark
zacute	017A	=small z, acute accent
Zacute	0179	=capital Z, acute accent
zcaron	017E	=small z, caron
Zcaron	017D	=capital Z, caron
zdot	017C	=small z, dot above
Zdot	017B	=capital Z, dot above

C. Greek Letters

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Greek Letters//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isogrk1.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
agr	03B1	=small alpha, Greek
Agr	0391	=capital Alpha, Greek
bgr	03B2	=small beta, Greek
Bgr	0392	=capital Beta, Greek
ggr	03B3	=small gamma, Greek
Ggr	0393	=capital Gamma, Greek
dgr	03B4	=small delta, Greek
Dgr	0394	=capital Delta, Greek
egr	03B5	=small epsilon, Greek
Egr	0395	=capital Epsilon, Greek
zgr	03B6	=small zeta, Greek
Zgr	0396	=capital Zeta, Greek
eegr	03B7	=small eta, Greek
EEgr	0397	=capital Eta, Greek
thgr	03B8	=small theta, Greek
THgr	0398	=capital Theta, Greek
igr	03B9	=small iota, Greek
Igr	0399	=capital Iota, Greek
kgr	03BA	=small kappa, Greek
Kgr	039A	=capital Kappa, Greek
lgr	03BB	=small lambda, Greek
Lgr	039B	=capital Lambda, Greek
mgr	03BC	=small mu, Greek
Mgr	039C	=capital Mu, Greek
ngr	03BD	=small nu, Greek
Ngr	039D	=capital Nu, Greek
xgr	03BE	=small xi, Greek
Xgr	039E	=capital Xi, Greek
ogr	03BF	=small omicron, Greek
Ogr	039F	=capital Omicron, Greek
pgr	03C0	=small pi, Greek
Pgr	03A0	=capital Pi, Greek
rgr	03C1	=small rho, Greek
Rgr	03A1	=capital Rho, Greek
sgr	03C3	=small sigma, Greek
Sgr	03A3	=capital Sigma, Greek
sfgr	03C2	=final small sigma, Greek
tgr	03C4	=small tau, Greek
Tgr	03A4	=capital Tau, Greek
ugr	03C5	=small upsilon, Greek
Ugr	03A5	=capital Upsilon, Greek
phgr	03C6	=small phi, Greek
PHgr	03A6	=capital Phi, Greek
khgr	03C7	=small chi, Greek
KHgr	03A7	=capital Chi, Greek
psgr	03C8	=small psi, Greek
PSgr	03A8	=capital Psi, Greek
ohgr	03C9	=small omega, Greek
OHgr	03A9	=capital Omega, Greek

D. Monotoniko Greek

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Monotoniko Greek//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isogrk2.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
aacgr	03AC	=small alpha, accent, Greek
Aacgr	0386	=capital Alpha, accent, Greek
eacgr	03AD	=small epsilon, accent, Greek
Eacgr	0388	=capital Epsilon, accent, Greek
eeacgr	03AE	=small eta, accent, Greek
EEacgr	0389	=capital Eta, accent, Greek
idigr	03CA	=small iota, dieresis, Greek
Idigr	03AA	=capital Iota, dieresis, Greek
iacgr	03AF	=small iota, accent, Greek
Iacgr	038A	=capital Iota, accent, Greek
idiagr	0390	=small iota, dieresis, accent, Greek
oacgr	03CC	=small omicron, accent, Greek
Oacgr	038C	=capital Omicron, accent, Greek
udigr	03CB	=small upsilon, dieresis, Greek
Udigr	03AB	=capital Upsilon, dieresis, Greek
uacgr	03CD	=small upsilon, accent, Greek
Uacgr	038E	=capital Upsilon, accent, Greek
udiagr	03B0	=small upsilon, dieresis, accent, Greek
ohacgr	03CE	=small omega, accent, Greek
OHacgr	038F	=capital Omega, accent, Greek

E. Russian Cyrillic

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Russian Cyrillic//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isocyr1.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
acy	0430	=small a, Cyrillic
Acy	0410	=capital A, Cyrillic
bcy	0431	=small be, Cyrillic
Bcy	0411	=capital BE, Cyrillic
vcy	0432	=small ve, Cyrillic
Vcy	0412	=capital VE, Cyrillic
gcy	0433	=small ghe, Cyrillic
Gcy	0413	=capital GHE, Cyrillic
dcy	0434	=small de, Cyrillic
Dcy	0414	=capital DE, Cyrillic
iecy	0435	=small ie, Cyrillic
IEcy	0415	=capital IE, Cyrillic
iocy	0451	=small io, Russian
IOcy	0401	=capital IO, Russian
zhcy	0436	=small zhe, Cyrillic
ZHcy	0416	=capital ZHE, Cyrillic
zcy	0437	=small ze, Cyrillic
Zcy	0417	=capital ZE, Cyrillic
icy	0438	=small i, Cyrillic
Icy	0418	=capital I, Cyrillic
jcy	0439	=small short i, Cyrillic
Jcy	0419	=capital short I, Cyrillic
kcy	043A	=small ka, Cyrillic
Kcy	041A	=capital KA, Cyrillic
lcy	043B	=small el, Cyrillic
Lcy	041B	=capital EL, Cyrillic
mcy	043C	=small em, Cyrillic
Mcy	041C	=capital EM, Cyrillic
ncy	043D	=small en, Cyrillic
Ncy	041D	=capital EN, Cyrillic
ocy	043E	=small o, Cyrillic
Ocy	041E	=capital O, Cyrillic
pcy	043F	=small pe, Cyrillic
Pcy	041F	=capital PE, Cyrillic
rcy	0440	=small er, Cyrillic
Rcy	0420	=capital ER, Cyrillic
scy	0441	=small es, Cyrillic
Scy	0421	=capital ES, Cyrillic
tcy	0442	=small te, Cyrillic
Tcy	0422	=capital TE, Cyrillic
ucy	0443	=small u, Cyrillic
Ucy	0423	=capital U, Cyrillic
fcy	0444	=small ef, Cyrillic
Fcy	0424	=capital EF, Cyrillic
khcy	0445	=small ha, Cyrillic
KHcy	0425	=capital HA, Cyrillic
tscy	0446	=small tse, Cyrillic
TScy	0426	=capital TSE, Cyrillic
chcy	0447	=small che, Cyrillic
CHcy	0427	=capital CHE, Cyrillic
shcy	0448	=small sha, Cyrillic
SHcy	0428	=capital SHA, Cyrillic
shchcy	0449	=small shcha, Cyrillic
SHCHcy	0429	=capital SHCHA, Cyrillic
hardcy	044A	=small hard sign, Cyrillic
HARDcy	042A	=capital HARD sign, Cyrillic
ycy	044B	=small yeru, Cyrillic
Ycy	042B	=capital YERU, Cyrillic
softcy	044C	=small soft sign, Cyrillic
SOFTcy	042C	=capital SOFT sign, Cyrillic
ecy	044D	=small e, Cyrillic
Ecy	042D	=capital E, Cyrillic
yucy	044E	=small yu, Cyrillic
YUcy	042E	=capital YU, Cyrillic
yacy	044F	=small ya, Cyrillic
YAcy	042F	=capital YA, Cyrillic
numero	2116	=numero sign

F. Non-Russian Cyrillic

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Non-Russian Cyrillic//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isocyr2.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
djcy	0452	=small dje, Serbian
DJcy	0402	=capital DJE, Serbian
gjcy	0453	=small gje, Macedonian
GJcy	0403	=capital GJE Macedonian
jukcy	0454	=small je, Ukrainian
Jukcy	0404	=capital JE, Ukrainian
dscy	0455	=small dse, Macedonian
DScy	0405	=capital DSE, Macedonian
iukcy	0456	=small i, Ukrainian
Iukcy	0406	=capital I, Ukrainian
yicy	0457	=small yi, Ukrainian
YIcy	0407	=capital YI, Ukrainian
jsercy	0458	=small je, Serbian
Jsercy	0408	=capital JE, Serbian
ljcy	0459	=small lje, Serbian
LJcy	0409	=capital LJE, Serbian
njcy	045A	=small nje, Serbian
NJcy	040A	=capital NJE, Serbian
tshcy	045B	=small tshe, Serbian
TSHcy	040B	=capital TSHE, Serbian
kjcy	045C	=small kje Macedonian
KJcy	040C	=capital KJE, Macedonian
ubrcy	045E	=small u, Byelorussian
Ubrcy	040E	=capital U, Byelorussian
dzcy	045F	=small dze, Serbian
DZcy	040F	=capital dze, Serbian

G. Numeric and Special Graphic

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Numeric and Special Graphic//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isonum.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
half	00BD	=fraction one-half
frac12	00BD	=fraction one-half
frac14	00BC	=fraction one-quarter
frac34	00BE	=fraction three-quarters
frac18	215B	=fraction one-eighth
frac38	215C	=fraction three-eighths
frac58	215D	=fraction five-eighths
frac78	215E	=fraction seven-eighths
sup1	00B9	=superscript one
sup2	00B2	=superscript two
sup3	00B3	=superscript three
plus	002B	=plus sign B:
plusmn	00B1	/pm B: =plus-or-minus sign
lt	003C	=less-than sign R:
equals	003D	=equals sign R:
gt	003E	=greater-than sign R:
divide	00F7	/div B: =divide sign
times	00D7	/times B: =multiply sign
curren	00A4	=general currency sign
pound	00A3	=pound sign
dollar	0024	=dollar sign
cent	00A2	=cent sign
yen	00A5	/yen =yen sign
num	0023	=number sign
percnt	0025	=percent sign
amp	0026	=ampersand
ast	002A	/ast B: =asterisk
commat	0040	=commercial at
lsqb	005B	/lbrack O: =left square bracket
bsol	005C	/backslash =reverse solidus
rsqb	005D	/rbrack C: =right square bracket
lcub	007B	/lbrace O: =left curly bracket
horbar	2015	=horizontal bar
verbar	007C	/vert =vertical bar
rcub	007D	/rbrace C: =right curly bracket
micro	00B5	=micro sign
ohm	2126	=ohm sign
deg	00B0	=degree sign
ordm	00BA	=ordinal indicator, masculine
ordf	00AA	=ordinal indicator, feminine
sect	00A7	=section sign
para	00B6	=pilcrow (paragraph sign)
middot	00B7	/centerdot B: =middle dot
larr	2190	/leftarrow /gets A: =leftward arrow
rarr	2192	/rightarrow /to A: =rightward arrow
uarr	2191	/uparrow A: =upward arrow
darr	2193	/downarrow A: =downward arrow
copy	00A9	=copyright sign
reg	00AE	/circledR =registered sign
trade	2122	=trade mark sign
brvbar	00A6	=broken (vertical) bar
not	00AC	/neg /lnot =not sign
sung		=music note (sung text sign)
excl	0021	=exclamation mark
iexcl	00A1	=inverted exclamation mark
quot	0022	=quotation mark
apos	0027	=apostrophe
lpar	0028	O: =left parenthesis
rpar	0029	C: =right parenthesis
comma	002C	P: =comma
lowbar	005F	=low line
hyphen	002D	=hyphen
period	002E	=full stop, period
sol	002F	=solidus
colon	003A	/colon P:
semi	003B	=semicolon P:
quest	003F	=question mark
iquest	00BF	=inverted question mark
laquo	00AB	=angle quotation mark, left
raquo	00BB	=angle quotation mark, right
lsquo	2018	=single quotation mark, left
rsquo	2019	=single quotation mark, right
ldquo	201C	=double quotation mark, left
rdquo	201D	=double quotation mark, right
nbsp	00A0	=no break (required) space
shy	00AD	=soft hyphen

H. Diacritical Marks

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Diacritical Marks//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isodia.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
acute	00B4	=acute accent
breve	02D8	=breve
caron	02C7	=caron
cedil	00B8	=cedilla
circ	005E	=circumflex accent
dblac	02DD	=double acute accent
die	00A8	=dieresis
dot	02D9	=dot above
grave	0060	=grave accent
macr	00AF	=macron
ogon	02DB	=ogonek
ring	02DA	=ring
tilde	02DC	=tilde
uml	00A8	=umlaut mark

I. Publishing

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Publishing//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isopub.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
emsp	2003	=em space
ensp	2002	=en space (1/2-em)
emsp13	2004	=1/3-em space
emsp14	2005	=1/4-em space
numsp	2007	=digit space (width of a number)
puncsp	2008	=punctuation space (width of comma)
thinsp	2009	=thin space (1/6-em)
hairsp	200A	=hair space
mdash	2014	=em dash
ndash	2013	=en dash
dash	2010	=hyphen (true graphic)
blank	2423	=significant blank symbol
hellip	2026	=ellipsis (horizontal)
nldr	2025	=double baseline dot (en leader)
frac13	2153	=fraction one-third
frac23	2154	=fraction two-thirds
frac15	2155	=fraction one-fifth
frac25	2156	=fraction two-fifths
frac35	2157	=fraction three-fifths
frac45	2158	=fraction four-fifths
frac16	2159	=fraction one-sixth
frac56	215A	=fraction five-sixths
incare	2105	=in-care-of symbol
block	2588	=full block
uhblk	2580	=upper half block
lhblk	2584	=lower half block
blk14	2591	=25% shaded block
blk12	2592	=50% shaded block
blk34	2593	=75% shaded block
marker	25AE	=histogram marker
cir	25CB	/circ B: =circle, open
squ	25A1	=square, open
rect	25AD	=rectangle, open
utri	25B5	/triangle =up triangle, open
dtri	25BF	/triangledown =down triangle, open
star	22C6	=star, open
bull	2022	/bullet B: =round bullet, filled
squf	25AA	/blacksquare =sq bullet, filled
utrif	25B4	/blacktriangle =up tri, filled
dtrif	25BE	/blacktriangledown =dn tri, filled
ltrif	25C2	/blacktriangleleft R: =l tri, filled
rtrif	25B8	/blacktriangleright R: =r tri, filled
clubs	2663	/clubsuit =club suit symbol
diams	2666	/diamondsuit =diamond suit symbol
hearts	2661	/heartsuit =heart suit symbol
spades	2660	/spadesuit =spades suit symbol
malt	2720	/maltese =maltese cross
dagger	2020	/dagger B: =dagger
Dagger	2021	/ddagger B: =double dagger
check	2713	/checkmark =tick, check mark
cross	2717	=ballot cross
sharp	266F	/sharp =musical sharp
flat	266D	/flat =musical flat
male	2642	=male symbol
female	2640	=female symbol
phone	260E	=telephone symbol
telrec	2315	=telephone recorder symbol
copysr	2117	=sound recording copyright sign
caret	2041	=caret (insertion mark)
lsquor	201A	=rising single quote, left (low)
ldquor	201E	=rising dbl quote, left (low)
fflig	FB00	small ff ligature
filig	FB01	small fi ligature
fjlig	FFFD	small fj ligature
ffilig	FB03	small ffi ligature
ffllig	FB04	small ffl ligature
fllig	FB02	small fl ligature
mldr	2026	em leader
rdquor	201D	rising dbl quote, right (high)
rsquor	2019	rising single quote, right (high)
vellip	22EE	vertical ellipsis
hybull	2043	rectangle, filled (hyphen bullet)
loz	25CA	/lozenge - lozenge or total mark
lozf	2726	/blacklozenge - lozenge, filled
ltri	25C3	/triangleleft B: l triangle, open
rtri	25B9	/triangleright B: r triangle, open
starf	2605	/bigstar - star, filled
natur	266E	/natural - music natural
rx	211E	pharmaceutical prescription (Rx)
sext	2736	sextile (6-pointed star)
target	2316	register mark or target
dlcrop	230D	downward left crop mark
drcrop	230C	downward right crop mark
ulcrop	230F	upward left crop mark
urcrop	230E	upward right crop mark

J. Box and Line Drawing

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Box and Line Drawing//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isobox.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
boxh	2500	horizontal line
boxv	2502	vertical line
boxur	2514	upper right quadrant
boxul	2518	upper left quadrant
boxdl	2510	lower left quadrant
boxdr	250C	lower right quadrant
boxvr	251C	upper and lower right quadrants
boxhu	2534	upper left and right quadrants
boxvl	2524	upper and lower left quadrants
boxhd	252C	lower left and right quadrants
boxvh	253C	all four quadrants
boxvR	255E	upper and lower right quadrants
boxhU	2568	upper left and right quadrants
boxvL	2561	upper and lower left quadrants
boxhD	2565	lower left and right quadrants
boxvH	256A	all four quadrants
boxH	2550	horizontal line
boxV	2551	vertical line
boxUR	255A	upper right quadrant
boxUL	255D	upper left quadrant
boxDL	2557	lower left quadrant
boxDR	2554	lower right quadrant
boxVR	2560	upper and lower right quadrants
boxHU	2569	upper left and right quadrants
boxVL	2563	upper and lower left quadrants
boxHD	2566	lower left and right quadrants
boxVH	256C	all four quadrants
boxVr	255F	upper and lower right quadrants
boxHu	2567	upper left and right quadrants
boxVl	2562	upper and lower left quadrants
boxHd	2564	lower left and right quadrants
boxVh	256B	all four quadrants
boxuR	2558	upper right quadrant
boxUl	255C	upper left quadrant
boxdL	2555	lower left quadrant
boxDr	2553	lower right quadrant
boxUr	2559	upper right quadrant
boxuL	255B	upper left quadrant
boxDl	2556	lower left quadrant
boxdR	2552	lower right quadrant

K. General Technical

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES General Technical//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isotech.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
aleph	2135	/aleph =aleph, Hebrew
and	2227	/wedge /land B: =logical and
ang90	221F	=right (90 degree) angle
angsph	2222	/sphericalangle =angle-spherical
ap	2248	/approx R: =approximate
becaus	2235	/because R: =because
bottom	22A5	/bot B: =perpendicular
cap	2229	/cap B: =intersection
cong	2245	/cong R: =congruent with
conint	222E	/oint L: =contour integral operator
cup	222A	/cup B: =union or logical sum
equiv	2261	/equiv R: =identical with
exist	2203	/exists =at least one exists
forall	2200	/forall =for all
fnof	0192	=function of (italic small f)
ge	2265	/geq /ge R: =greater-than-or-equal
iff	21D4	/iff =if and only if
infin	221E	/infty =infinity
int	222B	/int L: =integral operator
isin	2208	/in R: =set membership
lang	3008	/langle O: =left angle bracket
lArr	21D0	/Leftarrow A: =is implied by
le	2264	/leq /le R: =less-than-or-equal
minus	2212	B: =minus sign
mnplus	2213	/mp B: =minus-or-plus sign
nabla	2207	/nabla =del, Hamilton operator
ne	2260	/ne /neq R: =not equal
ni	220B	/ni /owns R: =contains
or	2228	/vee /lor B: =logical or
par	2225	/parallel R: =parallel
part	2202	/partial =partial differential
permil	2030	=per thousand
perp	22A5	/perp R: =perpendicular
prime	2032	/prime =prime or minute
Prime	2033	=double prime or second
prop	221D	/propto R: =is proportional to
radic	221A	/surd =radical
rang	3009	/rangle C: =right angle bracket
rArr	21D2	/Rightarrow A: =implies
sim	223C	/sim R: =similar
sime	2243	/simeq R: =similar, equals
square	25A1	/square B: =square
sub	2282	/subset R: =subset or is implied by
sube	2286	/subseteq R: =subset, equals
sup	2283	/supset R: =superset or implies
supe	2287	/supseteq R: =superset, equals
there4	2234	/therefore R: =therefore
Verbar	2016	/Vert =dbl vertical bar
angst	212B	Angstrom =capital A, ring
bernou	212C	Bernoulli function (script capital B)
compfn	2218	B: composite function (small circle)
Dot	00A8	=dieresis or umlaut mark
DotDot	20DC	four dots above
hamilt	210B	Hamiltonian (script capital H)
lagran	2112	Lagrangian (script capital L)
lowast	2217	low asterisk
notin	2209	N: negated set membership
order	2134	order of (script small o)
phmmat	2133	physics M-matrix (script capital M)
tdot	20DB	three dots above
tprime	2034	triple prime
wedgeq	2259	R: corresponds to (wedge, equals)

L. Greek Symbols

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Greek Symbols//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isogrk3.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
alpha	03B1	=small alpha, Greek
beta	03B2	=small beta, Greek
gamma	03B3	=small gamma, Greek
Gamma	0393	=capital Gamma, Greek
gammad	03DC	/digamma
delta	03B4	=small delta, Greek
Delta	0394	=capital Delta, Greek
epsi	03B5	=small epsilon, Greek
epsiv	025B	/varepsilon
epsis	03B5	/straightepsilon
zeta	03B6	=small zeta, Greek
eta	03B7	=small eta, Greek
thetas	03B8	straight theta
Theta	0398	=capital Theta, Greek
thetav	03D1	/vartheta - curly or open theta
iota	03B9	=small iota, Greek
kappa	03BA	=small kappa, Greek
kappav	03F0	/varkappa
lambda	03BB	=small lambda, Greek
Lambda	039B	=capital Lambda, Greek
mu	03BC	=small mu, Greek
nu	03BD	=small nu, Greek
xi	03BE	=small xi, Greek
Xi	039E	=capital Xi, Greek
pi	03C0	=small pi, Greek
piv	03D6	/varpi
Pi	03A0	=capital Pi, Greek
rho	03C1	=small rho, Greek
rhov	03F1	/varrho
sigma	03C3	=small sigma, Greek
Sigma	03A3	=capital Sigma, Greek
sigmav	03C2	/varsigma
tau	03C4	=small tau, Greek
upsi	03C5	=small upsilon, Greek
Upsi	03D2	=capital Upsilon, Greek
phis	03C6	/straightphi - straight phi
Phi	03A6	=capital Phi, Greek
phiv	03D5	/varphi - curly or open phi
chi	03C7	=small chi, Greek
psi	03C8	=small psi, Greek
Psi	03A8	=capital Psi, Greek
omega	03C9	=small omega, Greek
Omega	03A9	=capital Omega, Greek

M. Alternative Greek Symbols

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Alternative Greek Symbols//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isogrk4.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
b.alpha	03B1	=small alpha, Greek
b.beta	03B2	=small beta, Greek
b.gamma	03B3	=small gamma, Greek
b.Gamma	0393	=capital Gamma, Greek
b.gammad	03DC	/digamma
b.delta	03B4	=small delta, Greek
b.Delta	0394	=capital Delta, Greek
b.epsi	03B5	=small epsilon, Greek
b.epsiv	025B	/varepsilon
b.epsis	03B5	/straightepsilon
b.zeta	03B6	=small zeta, Greek
b.eta	03B7	=small eta, Greek
b.thetas	03B8	straight theta
b.Theta	0398	=capital Theta, Greek
b.thetav	03D1	/vartheta - curly or open theta
b.iota	03B9	=small iota, Greek
b.kappa	03BA	=small kappa, Greek
b.kappav	03F0	/varkappa
b.lambda	03BB	=small lambda, Greek
b.Lambda	039B	=capital Lambda, Greek
b.mu	03BC	=small mu, Greek
b.nu	03BD	=small nu, Greek
b.xi	03BE	=small xi, Greek
b.Xi	039E	=capital Xi, Greek
b.pi	03C0	=small pi, Greek
b.Pi	03A0	=capital Pi, Greek
b.piv	03D6	/varpi
b.rho	03C1	=small rho, Greek
b.rhov	03F1	/varrho
b.sigma	03C3	=small sigma, Greek
b.Sigma	03A3	=capital Sigma, Greek
b.sigmav	03C2	/varsigma
b.tau	03C4	=small tau, Greek
b.upsi	03C5	=small upsilon, Greek
b.Upsi	03D2	=capital Upsilon, Greek
b.phis	03C6	/straightphi - straight phi
b.Phi	03A6	=capital Phi, Greek
b.phiv	03D5	/varphi - curly or open phi
b.chi	03C7	=small chi, Greek
b.psi	03C8	=small psi, Greek
b.Psi	03A8	=capital Psi, Greek
b.omega	03C9	=small omega, Greek
b.Omega	03A9	=capital Omega, Greek

N. Added Math Symbols: Ordinary

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Added Math Symbols: Ordinary//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isoamso.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
ang	2220	/angle - angle
angmsd	2221	/measuredangle - angle-measured
beth	2136	/beth - beth, Hebrew
bprime	2035	/backprime - reverse prime
comp	2201	/complement - complement sign
daleth	2138	/daleth - daleth, Hebrew
ell	2113	/ell - cursive small l
empty	2205	/emptyset /varnothing =small o, slash
gimel	2137	/gimel - gimel, Hebrew
image	2111	/Im - imaginary
inodot	0131	/imath =small i, no dot
jnodot	FFFD	/jmath - small j, no dot
nexist	2204	/nexists - negated exists
oS	24C8	/circledS - capital S in circle
planck	0127	/hbar /hslash - Planck's over 2pi
real	211C	/Re - real
sbsol	FE68	/sbs - short reverse solidus
vprime	2032	/varprime - prime, variant
weierp	2118	/wp - Weierstrass p

O. Added Math Symbols: Binary Operators

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Added Math Symbols: Binary Operators//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isoamsb.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
amalg	2201	/amalg B: amalgamation or coproduct
Barwed	22BC	/doublebarwedge B: log and, dbl bar
barwed	22BC	/barwedge B: logical and, bar above
Cap	22D2	/Cap /doublecap B: dbl intersection
Cup	22D3	/Cup /doublecup B: dbl union
cuvee	22CE	/curlyvee B: curly logical or
cuwed	22CF	/curlywedge B: curly logical and
diam	22C4	/diamond B: open diamond
divonx	22C7	/divideontimes B: division on times
intcal	22BA	/intercal B: intercal
lthree	22CB	/leftthreetimes B:
ltimes	22C9	/ltimes B: times sign, left closed
minusb	229F	/boxminus B: minus sign in box
oast	229B	/circledast B: asterisk in circle
ocir	229A	/circledcirc B: open dot in circle
odash	229D	/circleddash B: hyphen in circle
odot	2299	/odot B: middle dot in circle
ominus	2296	/ominus B: minus sign in circle
oplus	2295	/oplus B: plus sign in circle
osol	2298	/oslash B: solidus in circle
otimes	2297	/otimes B: multiply sign in circle
plusb	229E	/boxplus B: plus sign in box
plusdo	2214	/dotplus B: plus sign, dot above
rthree	22CC	/rightthreetimes B:
rtimes	22CA	/rtimes B: times sign, right closed
sdot	22C5	/cdot B: small middle dot
sdotb	22A1	/dotsquare /boxdot B: small dot in box
setmn	2216	/setminus B: reverse solidus
sqcap	2293	/sqcap B: square intersection
sqcup	2294	/sqcup B: square union
ssetmn	2216	/smallsetminus B: sm reverse solidus
sstarf	22C6	/star B: small star, filled
timesb	22A0	/boxtimes B: multiply sign in box
top	22A4	/top B: inverted perpendicular
uplus	228E	/uplus B: plus sign in union
wreath	2240	/wr B: wreath product
xcirc	25EF	/bigcirc B: large circle
xdtri	25BD	/bigtriangledown B: big dn tri, open
xutri	25B3	/bigtriangleup B: big up tri, open
coprod	2210	/coprod L: coproduct operator
prod	220F	/prod L: product operator
sum	2211	/sum L: summation operator

P. Added Math Symbols: Relations

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Added Math Symbols: Relations//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isoamsr.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
ape	224A	/approxeq R: approximate, equals
asymp	224D	/asymp R: asymptotically equal to
bcong	224C	/backcong R: reverse congruent
bepsi	220D	/backepsilon R: such that
bowtie	22C8	/bowtie R:
bsim	223D	/backsim R: reverse similar
bsime	22CD	/backsimeq R: reverse similar, eq
bump	224E	/Bumpeq R: bumpy equals
bumpe	224F	/bumpeq R: bumpy equals, equals
cire	2257	/circeq R: circle, equals
colone	2254	/coloneq R: colon, equals
cuepr	22DE	/curlyeqprec R: curly eq, precedes
cuesc	22DF	/curlyeqsucc R: curly eq, succeeds
cupre	227C	/curlypreceq R: curly precedes, eq
dashv	22A3	/dashv R: dash, vertical
ecir	2256	/eqcirc R: circle on equals sign
ecolon	2255	/eqcolon R: equals, colon
eDot	2251	/doteqdot /Doteq R: eq, even dots
esdot	2250	/doteq R: equals, single dot above
efDot	2252	/fallingdotseq R: eq, falling dots
egs	22DD	/eqslantgtr R: equal-or-gtr, slanted
els	22DC	/eqslantless R: eq-or-less, slanted
erDot	2253	/risingdotseq R: eq, rising dots
fork	22D4	/pitchfork R: pitchfork
frown	2322	/frown R: down curve
gap	2273	/gtrapprox R: greater, approximate
gsdot	22D7	/gtrdot R: greater than, single dot
gE	2267	/geqq R: greater, double equals
gel	22DB	/gtreqless R: greater, equals, less
gEl	22DB	/gtreqqless R: gt, dbl equals, less
ges	2265	/geqslant R: gt-or-equal, slanted
Gg	22D9	/ggg /Gg /gggtr R: triple gtr-than
gl	2277	/gtrless R: greater, less
gsim	2273	/gtrsim R: greater, similar
Gt	226B	/gg R: dbl greater-than sign
lap	2272	/lessapprox R: less, approximate
ldot	22D6	/lessdot R: less than, with dot
lE	2266	/leqq R: less, double equals
lEg	22DA	/lesseqqgtr R: less, dbl eq, greater
leg	22DA	/lesseqgtr R: less, eq, greater
les	2264	/leqslant R: less-than-or-eq, slant
lg	2276	/lessgtr R: less, greater
Ll	22D8	/Ll /lll /llless R: triple less-than
lsim	2272	/lesssim R: less, similar
Lt	226A	/ll R: double less-than sign
ltrie	22B4	/trianglelefteq R: left triangle, eq
mid	2223	/mid R:
models	22A7	/models R:
pr	227A	/prec R: precedes
prap	227E	/precapprox R: precedes, approximate
pre	227C	/preceq R: precedes, equals
prsim	227E	/precsim R: precedes, similar
rtrie	22B5	/trianglerighteq R: right tri, eq
samalg	2210	/smallamalg R: small amalg
sc	227B	/succ R: succeeds
scap	227F	/succapprox R: succeeds, approximate
sccue	227D	/succcurlyeq R: succeeds, curly eq
sce	227D	/succeq R: succeeds, equals
scsim	227F	/succsim R: succeeds, similar
sfrown	2322	/smallfrown R: small down curve
smid	FFFD
smile	2323	/smile R: up curve
spar	2225	/shortparallel R: short parallel
sqsub	228F	/sqsubset R: square subset
sqsube	2291	/sqsubseteq R: square subset, equals
sqsup	2290	/sqsupset R: square superset
sqsupe	2292	/sqsupseteq R: square superset, eq
ssmile	2323	/smallsmile R: small up curve
Sub	22D0	/Subset R: double subset
subE	2286	/subseteqq R: subset, dbl equals
Sup	22D1	/Supset R: dbl superset
supE	2287	/supseteqq R: superset, dbl equals
thkap	2248	/thickapprox R: thick approximate
thksim	223C	/thicksim R: thick similar
trie	225C	/triangleq R: triangle, equals
twixt	226C	/between R: between
vdash	22A2	/vdash R: vertical, dash
Vdash	22A9	/Vdash R: dbl vertical, dash
vDash	22A8	/vDash R: vertical, dbl dash
veebar	22BB	/veebar R: logical or, bar below
vltri	22B2	/vartriangleleft R: l tri, open, var
vprop	221D	/varpropto R: proportional, variant
vrtri	22B3	/vartriangleright R: r tri, open, var
Vvdash	22AA	/Vvdash R: triple vertical, dash

Q. Added Math Symbols: Negated Relations

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Added Math Symbols: Negated Relations//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isoamsn.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
gnap	FFFD	greater, not approximate
gne	2269	/gneq N: greater, not equals
gnE	2269	/gneqq N: greater, not dbl equals
gnsim	22E7	/gnsim N: greater, not similar
gvnE	2269	/gvertneqq N: gt, vert, not dbl eq
lnap	FFFD	less, not approximate
lnE	2268	/lneqq N: less, not double equals
lne	2268	/lneq N: less, not equals
lnsim	22E6	/lnsim N: less, not similar
lvnE	2268	/lvertneqq N: less, vert, not dbl eq
nap	2249	/napprox N: not approximate
ncong	2247	/ncong N: not congruent with
nequiv	2262	/nequiv N: not identical with
ngE	2271	/ngeqq N: not greater, dbl equals
nge	2271	/ngeq N: not greater-than-or-equal
nges	2271	/ngeqslant N: not gt-or-eq, slanted
ngt	226F	/ngtr N: not greater-than
nle	2270	/nleq N: not less-than-or-equal
nlE	2270	/nleqq N: not less, dbl equals
nles	2270	/nleqslant N: not less-or-eq, slant
nlt	226E	/nless N: not less-than
nltri	22EA	/ntriangleleft N: not left triangle
nltrie	22EC	/ntrianglelefteq N: not l tri, eq
nmid	2224	/nmid
npar	2226	/nparallel N: not parallel
npr	2280	/nprec N: not precedes
npre	22E0	/npreceq N: not precedes, equals
nrtri	22EB	/ntriangleright N: not rt triangle
nrtrie	22ED	/ntrianglerighteq N: not r tri, eq
nsc	2281	/nsucc N: not succeeds
nsce	22E1	/nsucceq N: not succeeds, equals
nsim	2241	/nsim N: not similar
nsime	2244	/nsimeq N: not similar, equals
nsmid	FFFD	/nshortmid
nspar	2226	/nshortparallel N: not short par
nsub	2284	/nsubset N: not subset
nsube	2288	/nsubseteq N: not subset, equals
nsubE	2288	/nsubseteqq N: not subset, dbl eq
nsup	2285	/nsupset N: not superset
nsupE	2289	/nsupseteqq N: not superset, dbl eq
nsupe	2289	/nsupseteq N: not superset, equals
nvdash	22AC	/nvdash N: not vertical, dash
nvDash	22AD	/nvDash N: not vertical, dbl dash
nVDash	22AF	/nVDash N: not dbl vert, dbl dash
nVdash	22AE	/nVdash N: not dbl vertical, dash
prnap	22E8	/precnapprox N: precedes, not approx
prnE	FFFD	precedes, not dbl eq
prnsim	22E8	/precnsim N: precedes, not similar
scnap	22E9	/succnapprox N: succeeds, not approx
scnE	FFFD	succeeds, not dbl eq
scnsim	22E9	/succnsim N: succeeds, not similar
subne	228A	/subsetneq N: subset, not equals
subnE	228A	/subsetneqq N: subset, not dbl eq
supne	228B	/supsetneq N: superset, not equals
supnE	228B	/supsetneqq N: superset, not dbl eq
vsubnE	FFFD	subset not dbl eq, var
vsubne	228A	/subsetneq N: subset, not eq, var
vsupne	228B	/supsetneq N: superset, not eq, var
vsupnE	228B	/supsetneqq N: super not dbl eq, var

R. Added Math Symbols: Arrow Relations

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Added Math Symbols: Arrow Relations//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isoamsa.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
cularr	21B6	/curvearrowleft A: left curved arrow
curarr	21B7	/curvearrowright A: rt curved arrow
dArr	21D3	/Downarrow A: down dbl arrow
darr2	21CA	/downdownarrows A: two down arrows
dharl	21C3	/downleftharpoon A: dn harpoon-left
dharr	21C2	/downrightharpoon A: down harpoon-rt
lAarr	21DA	/Lleftarrow A: left triple arrow
Larr	219E	/twoheadleftarrow A:
larr2	21C7	/leftleftarrows A: two left arrows
larrhk	21A9	/hookleftarrow A: left arrow-hooked
larrlp	21AB	/looparrowleft A: left arrow-looped
larrtl	21A2	/leftarrowtail A: left arrow-tailed
lhard	21BD	/leftharpoondown A: l harpoon-down
lharu	21BC	/leftharpoonup A: left harpoon-up
hArr	21D4	/Leftrightarrow A: l&r dbl arrow
harr	2194	/leftrightarrow A: l&r arrow
lrarr2	21C6	/leftrightarrows A: l arr over r arr
rlarr2	21C4	/rightleftarrows A: r arr over l arr
harrw	21AD	/leftrightsquigarrow A: l&r arr-wavy
rlhar2	21CC	/rightleftharpoons A: r harp over l
lrhar2	21CB	/leftrightharpoons A: l harp over r
lsh	21B0	/Lsh A:
map	21A6	/mapsto A:
mumap	22B8	/multimap A:
nearr	2197	/nearrow A: NE pointing arrow
nlArr	21CD	/nLeftarrow A: not implied by
nlarr	219A	/nleftarrow A: not left arrow
nhArr	21CE	/nLeftrightarrow A: not l&r dbl arr
nharr	21AE	/nleftrightarrow A: not l&r arrow
nrarr	219B	/nrightarrow A: not right arrow
nrArr	21CF	/nRightarrow A: not implies
nwarr	2196	/nwarrow A: NW pointing arrow
olarr	21BA	/circlearrowleft A: l arr in circle
orarr	21BB	/circlearrowright A: r arr in circle
rAarr	21DB	/Rrightarrow A: right triple arrow
Rarr	21A0	/twoheadrightarrow A:
rarr2	21C9	/rightrightarrows A: two rt arrows
rarrhk	21AA	/hookrightarrow A: rt arrow-hooked
rarrlp	21AC	/looparrowright A: rt arrow-looped
rarrtl	21A3	/rightarrowtail A: rt arrow-tailed
rarrw	21DD	/squigarrowright A: rt arrow-wavy
rhard	21C1	/rightharpoondown A: rt harpoon-down
rharu	21C0	/rightharpoonup A: rt harpoon-up
rsh	21B1	/Rsh A:
drarr	2198	/searrow A: downward rt arrow
dlarr	2199	/swarrow A: downward l arrow
uArr	21D1	/Uparrow A: up dbl arrow
uarr2	21C8	/upuparrows A: two up arrows
vArr	21D5	/Updownarrow A: up&down dbl arrow
varr	2195	/updownarrow A: up&down arrow
uharl	21BF	/upleftharpoon A: up harpoon-left
uharr	21BE	/uprightharpoon A: up harp-r
xlArr	21D0	/Longleftarrow A: long l dbl arrow
xhArr	2194	/Longleftrightarrow A: long l&r dbl arr
xharr	2194	/longleftrightarrow A: long l&r arr
xrArr	21D2	/Longrightarrow A: long rt dbl arr

S. Added Math Symbols: Delimiters

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Added Math Symbols: Delimiters//EN//XML

System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.2/isoamsc.ent

The following character entities are defined in this entity set:

Entity Name	Unicode Code point	Description
rceil	2309	/rceil C: right ceiling
rfloor	230B	/rfloor C: right floor
rpargt	FFFD	right paren, gt
urcorn	231D	/urcorner C: upper right corner
drcorn	231F	/lrcorner C: downward right corner
lceil	2308	/lceil O: left ceiling
lfloor	230A	/lfloor O: left floor
lpargt	FFFD	left parenthesis, gt
ulcorn	231C	/ulcorner O: upper left corner
dlcorn	231E	/llcorner O: downward left corner

T. Unicode Glyphs

The Unicode reference glyphs in this document are examples only. Some characters have more than one Unicode representation and different Unicode characters may be appropriate in different contexts. The glyph images offer only one of many possible representations for the specified character.

Most of the glyphs this reference are from the TmsPF Roman font by Production First Software. A few glyphs are from Everson Mono.

Unicode support requires much more than a simple character to glyph mapping; for more information on Unicode, consult The Unicode Standard, Version 2.0 and Unicode Technical Report #8, which describes Unicode Version 2.1.

U. OASIS DocBook Technical Committee (Non-Normative)

Dennis Evans
Dick Hamilton
Nancy (Paisner) Harrison
Sabine Ocker
Michael Smith
Bob Stayton
Norman Walsh (Chair)

References

Normative

[ISO 8879:1986] JTC 1, SC 34. ISO 8879:1986 Information processing -- Text and office systems -- Standard Generalized Markup Language (SGML). 1986.

[XML 1.0] Tim Bray, Jean Paoli, C. M. Sperberg-McQueen, and Eve Maler, editors. Extensible Markup Language (XML) 1.0 Second Edition. World Wide Web Consortium, 2000.

[Namespaces] Tim Bray, Dave Hollander, and Andrew Layman, editors. Namespaces in XML. World Wide Web Consortium, 1999.

[RELAX NG] James Clark, editor. RELAX NG Specification (Committee Specification). OASIS. 2001.

[Unicode] The Unicode Consortium. The Unicode Standard, Version 2.0. Addison-Wesley Developers Press. Reading, Mass. 1996.

XML Character Entities Version 0.2

OASIS DocBook Technical Committee

Working Draft 19 Mar 2002

Abstract

Status of this Document

Table of Contents

Appendixes

1. XML Character Entity Sets

Note

1.1. Multi-Character Replacements

1.2. Duplicate Entities

1.3. Entities with no Mapping

1.4. Entities with Substituted Mappings

2. XML Character Elements

A. Added Latin 1

B. Added Latin 2

C. Greek Letters

D. Monotoniko Greek

E. Russian Cyrillic

F. Non-Russian Cyrillic

G. Numeric and Special Graphic

H. Diacritical Marks

I. Publishing

J. Box and Line Drawing

K. General Technical

L. Greek Symbols

M. Alternative Greek Symbols

N. Added Math Symbols: Ordinary

O. Added Math Symbols: Binary Operators

P. Added Math Symbols: Relations

Q. Added Math Symbols: Negated Relations

R. Added Math Symbols: Arrow Relations

S. Added Math Symbols: Delimiters

T. Unicode Glyphs

U. OASIS DocBook Technical Committee (Non-Normative)

References

Normative