XML Character Entities Version 0.1

OASIS DocBook Technical Committee

Working Draft 19 Nov 2001

This version:
Working Draft: 19 Nov 2001
Editor:
Norman Walsh <Norman.Walsh@Sun.COM>

This document and translations of it may be copied and furnished to others, and derivative works that comment on or otherwise explain it or assist in its implementation may be prepared, copied, published and distributed, in whole or in part, without restriction of any kind, provided that the above copyright notice and this paragraph are included on all such copies and derivative works. However, this document itself may not be modified in any way, such as by removing the copyright notice or references to OASIS, except as needed for the purpose of developing OASIS specifications, in which case the procedures for copyrights defined in the OASIS Intellectual Property Rights document must be followed, or as required to translate it into languages other than English.

The limited permissions granted above are perpetual and will not be revoked by OASIS or its successors or assigns.

This document and the information contained herein is provided on an "AS IS" basis and OASIS DISCLAIMS ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.


Abstract

Non-normative Annex D of [ISO 8879:1986] defines 19 standard character entity sets. The SGML declarations for these entities use the specific character data (SDATA) entity type. The SDATA entity type is not supported in XML. This Standard defines a set of XML alternatives to the 19 standard character entity sets.

Status of this Document

This is a working draft constructed by the editor. It is not an official committee work product and may not reflect the consensus opinion of the committee.

Table of Contents

1. XML Character Entity Sets
1.1. Added Latin 1
1.2. Added Latin 2
1.3. Greek Letters
1.4. Monotoniko Greek
1.5. Russian Cyrillic
1.6. Non-Russian Cyrillic
1.7. Numeric and Special Graphic
1.8. Diacritical Marks
1.9. Publishing
1.10. Box and Line Drawing
1.11. General Technical
1.12. Greek Symbols
1.13. Alternative Greek Symbols
1.14. Added Math Symbols: Ordinary
1.15. Added Math Symbols: Binary Operators
1.16. Added Math Symbols: Relations
1.17. Added Math Symbols: Negated Relations
1.18. Added Math Symbols: Arrow Relations
1.19. Added Math Symbols: Delimiters

Appendixes

A. Unicode Glyphs
B. OASIS DocBook Technical Committee (Non-Normative)
References

Non-normative Annex D of [ISO 8879:1986] defines 19 standard character entity sets (Added Latin 1, Added Latin 2, Greek Letters, Monotoniko Greek, Russian Cyrillic, Non-Russian Cyrillic, Numeric and Special Graphic, Diacritical Marks, Publishing, Box and Line Drawing, General Technical, Greek Symbols, Alternative Greek Symbols, Added Math Symbols: Ordinary, Added Math Symbols: Binary Operators, Added Math Symbols: Relations, Added Math Symbols: Negated Relations, Added Math Symbols: Arrow Relations, Added Math Symbols: Delimiters). The SGML declarations for these entities use the specific character data (SDATA) entity type. The SDATA entity type is not supported in XML, so alternative XML declarations must be used. This Standard defines a set of XML alternatives to the 19 standard character entity sets.

In XML, the specific character data of each entity can be expressed as a [Unicode] character.

1. XML Character Entity Sets

Note

The Unicode reference glyphs in this document are examples only. Some characters have more than one Unicode representation and different Unicode characters may be appropriate in different contexts. The glyph images offer only one of many possible representations for the specified character.

1.1. Added Latin 1

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Added Latin 1//EN//XML
System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.1/isolat1.ent

The following character entities are defined in this entity set:

Entity
Name
Unicode
Code point
Sample
Glyph
Description
aacute00E1
Unicode 00E1
Latin small letter a with acute
Aacute00C1
Unicode 00C1
Latin capital letter A with acute
acirc00E2
Unicode 00E2
Latin small letter a with circumflex
Acirc00C2
Unicode 00C2
Latin capital letter A with circumflex
agrave00E0
Unicode 00E0
Latin small letter a with grave
Agrave00C0
Unicode 00C0
Latin capital letter A with grave
aring00E5
Unicode 00E5
Latin small letter a with ring above
Aring00C5
Unicode 00C5
Latin capital letter A with ring above
atilde00E3
Unicode 00E3
Latin small letter a with tilde
Atilde00C3
Unicode 00C3
Latin capital letter A with tilde
auml00E4
Unicode 00E4
Latin small letter a with diaeresis
Auml00C4
Unicode 00C4
Latin capital letter A with diaeresis
aelig00E6
Unicode 00E6
Latin small letter ae
AElig00C6
Unicode 00C6
Latin capital letter AE
ccedil00E7
Unicode 00E7
Latin small letter c with cedilla
Ccedil00C7
Unicode 00C7
Latin capital letter C with cedilla
eth00F0
Unicode 00F0
Latin small letter eth
ETH00D0
Unicode 00D0
Latin capital letter ETH
eacute00E9
Unicode 00E9
Latin small letter e with acute
Eacute00C9
Unicode 00C9
Latin capital letter E with acute
ecirc00EA
Unicode 00EA
Latin small letter e with circumflex
Ecirc00CA
Unicode 00CA
Latin capital letter E with circumflex
egrave00E8
Unicode 00E8
Latin small letter e with grave
Egrave00C8
Unicode 00C8
Latin capital letter E with grave
euml00EB
Unicode 00EB
Latin small letter e with diaeresis
Euml00CB
Unicode 00CB
Latin capital letter E with diaeresis
iacute00ED
Unicode 00ED
Latin small letter i with acute
Iacute00CD
Unicode 00CD
Latin capital letter I with acute
icirc00EE
Unicode 00EE
Latin small letter i with circumflex
Icirc00CE
Unicode 00CE
Latin capital letter I with circumflex
igrave00EC
Unicode 00EC
Latin small letter i with grave
Igrave00CC
Unicode 00CC
Latin capital letter I with grave
iuml00EF
Unicode 00EF
Latin small letter i with diaeresis
Iuml00CF
Unicode 00CF
Latin capital letter I with diaeresis
ntilde00F1
Unicode 00F1
Latin small letter n with tilde
Ntilde00D1
Unicode 00D1
Latin capital letter N with tilde
oacute00F3
Unicode 00F3
Latin small letter o with acute
Oacute00D3
Unicode 00D3
Latin capital letter O with acute
ocirc00F4
Unicode 00F4
Latin small letter o with circumflex
Ocirc00D4
Unicode 00D4
Latin capital letter O with circumflex
ograve00F2
Unicode 00F2
Latin small letter o with grave
Ograve00D2
Unicode 00D2
Latin capital letter O with grave
oslash00F8
Unicode 00F8
Latin small letter o with stroke
Oslash00D8
Unicode 00D8
Latin capital letter O with stroke
otilde00F5
Unicode 00F5
Latin small letter o with tilde
Otilde00D5
Unicode 00D5
Latin capital letter O with tilde
ouml00F6
Unicode 00F6
Latin small letter o with diaeresis
Ouml00D6
Unicode 00D6
Latin capital letter O with diaeresis
szlig00DF
Unicode 00DF
Latin small letter sharp s
thorn00FE
Unicode 00FE
Latin small letter thorn
THORN00DE
Unicode 00DE
Latin capital letter THORN
uacute00FA
Unicode 00FA
Latin small letter u with acute
Uacute00DA
Unicode 00DA
Latin capital letter U with acute
ucirc00FB
Unicode 00FB
Latin small letter u with circumflex
Ucirc00DB
Unicode 00DB
Latin capital letter U with circumflex
ugrave00F9
Unicode 00F9
Latin small letter u with grave
Ugrave00D9
Unicode 00D9
Latin capital letter U with grave
uuml00FC
Unicode 00FC
Latin small letter u with diaeresis
Uuml00DC
Unicode 00DC
Latin capital letter U with diaeresis
yacute00FD
Unicode 00FD
Latin small letter y with acute
Yacute00DD
Unicode 00DD
Latin capital letter Y with acute
yuml00FF
Unicode 00FF
Latin small letter y with diaeresis

1.2. Added Latin 2

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Added Latin 2//EN//XML
System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.1/isolat2.ent

The following character entities are defined in this entity set:

Entity
Name
Unicode
Code point
Sample
Glyph
Description
abreve0103
Unicode 0103
Latin small letter a with breve
Abreve0102
Unicode 0102
Latin capital letter A with breve
amacr0101
Unicode 0101
Latin small letter a with macron
Amacr0100
Unicode 0100
Latin capital letter A with macron
aogon0105
Unicode 0105
Latin small letter a with ogonek
Aogon0104
Unicode 0104
Latin capital letter A with ogonek
cacute0107
Unicode 0107
Latin small letter c with acute
Cacute0106
Unicode 0106
Latin capital letter C with acute
ccaron010D
Unicode 010D
Latin small letter c with caron
Ccaron010C
Unicode 010C
Latin capital letter C with caron
ccirc0109
Unicode 0109
Latin small letter c with circumflex
Ccirc0108
Unicode 0108
Latin capital letter C with circumflex
cdot010B
Unicode 010B
Latin small letter c with dot above
Cdot010A
Unicode 010A
Latin capital letter C with dot above
dcaron010F
Unicode 010F
Latin small letter d with caron
Dcaron010E
Unicode 010E
Latin capital letter D with caron
dstrok0111
Unicode 0111
Latin small letter d with stroke
Dstrok0110
Unicode 0110
Latin capital letter D with stroke
ecaron011B
Unicode 011B
Latin small letter e with caron
Ecaron011A
Unicode 011A
Latin capital letter E with caron
edot0117
Unicode 0117
Latin small letter e with dot above
Edot0116
Unicode 0116
Latin capital letter E with dot above
emacr0113
Unicode 0113
Latin small letter e with macron
Emacr0112
Unicode 0112
Latin capital letter E with macron
eogon0119
Unicode 0119
Latin small letter e with ogonek
Eogon0118
Unicode 0118
Latin capital letter E with ogonek
gacute01F5
Unicode 01F5
Latin small letter g with acute
gbreve011F
Unicode 011F
Latin small letter g with breve
Gbreve011E
Unicode 011E
Latin capital letter G with breve
Gcedil0122
Unicode 0122
Latin capital letter G with cedilla
gcirc011D
Unicode 011D
Latin small letter g with circumflex
Gcirc011C
Unicode 011C
Latin capital letter G with circumflex
gdot0121
Unicode 0121
Latin small letter g with dot above
Gdot0120
Unicode 0120
Latin capital letter G with dot above
hcirc0125
Unicode 0125
Latin small letter h with circumflex
Hcirc0124
Unicode 0124
Latin capital letter H with circumflex
hstrok0127
Unicode 0127
Latin small letter h with stroke
Hstrok0126
Unicode 0126
Latin capital letter H with stroke
Idot0130
Unicode 0130
Latin capital letter I with dot above
Imacr012A
Unicode 012A
Latin capital letter I with macron
imacr012B
Unicode 012B
Latin small letter i with macron
ijlig0133
Unicode 0133
Latin small ligature ij
IJlig0132
Unicode 0132
Latin capital ligature ij
inodot0131
Unicode 0131
Latin small letter dotless i
iogon012F
Unicode 012F
Latin small letter i with ogonek
Iogon012E
Unicode 012E
Latin capital letter I with ogonek
itilde0129
Unicode 0129
Latin small letter i with tilde
Itilde0128
Unicode 0128
Latin capital letter I with tilde
jcirc0135
Unicode 0135
Latin small letter j with circumflex
Jcirc0134
Unicode 0134
Latin capital letter J with circumflex
kcedil0137
Unicode 0137
Latin small letter k with cedilla
Kcedil0136
Unicode 0136
Latin capital letter K with cedilla
kgreen0138
Unicode 0138
Latin small letter kra
lacute013A
Unicode 013A
Latin small letter l with acute
Lacute0139
Unicode 0139
Latin capital letter L with acute
lcaron013E
Unicode 013E
Latin small letter l with caron
Lcaron013D
Unicode 013D
Latin capital letter L with caron
lcedil013C
Unicode 013C
Latin small letter l with cedilla
Lcedil013B
Unicode 013B
Latin capital letter L with cedilla
lmidot0140
Unicode 0140
Latin small letter l with middle dot
Lmidot013F
Unicode 013F
Latin capital letter L with middle dot
lstrok0142
Unicode 0142
Latin small letter l with stroke
Lstrok0141
Unicode 0141
Latin capital letter L with stroke
nacute0144
Unicode 0144
Latin small letter n with acute
Nacute0143
Unicode 0143
Latin capital letter N with acute
eng014B
Unicode 014B
Latin small letter eng
ENG014A
Unicode 014A
Latin capital letter ENG
napos0149
Unicode 0149
Latin small letter n preceded by apostrophe
ncaron0148
Unicode 0148
Latin small letter n with caron
Ncaron0147
Unicode 0147
Latin capital letter N with caron
ncedil0146
Unicode 0146
Latin small letter n with cedilla
Ncedil0145
Unicode 0145
Latin capital letter N with cedilla
odblac0151
Unicode 0151
Latin small letter o with double acute
Odblac0150
Unicode 0150
Latin capital letter O with double acute
Omacr014C
Unicode 014C
Latin capital letter O with macron
omacr014D
Unicode 014D
Latin small letter o with macron
oelig0153
Unicode 0153
Latin small ligature oe
OElig0152
Unicode 0152
Latin capital ligature oe
racute0155
Unicode 0155
Latin small letter r with acute
Racute0154
Unicode 0154
Latin capital letter R with acute
rcaron0159
Unicode 0159
Latin small letter r with caron
Rcaron0158
Unicode 0158
Latin capital letter R with caron
rcedil0157
Unicode 0157
Latin small letter r with cedilla
Rcedil0156
Unicode 0156
Latin capital letter R with cedilla
sacute015B
Unicode 015B
Latin small letter s with acute
Sacute015A
Unicode 015A
Latin capital letter S with acute
scaron0161
Unicode 0161
Latin small letter s with caron
Scaron0160
Unicode 0160
Latin capital letter S with caron
scedil015F
Unicode 015F
Latin small letter s with cedilla
Scedil015E
Unicode 015E
Latin capital letter S with cedilla
scirc015D
Unicode 015D
Latin small letter s with circumflex
Scirc015C
Unicode 015C
Latin capital letter S with circumflex
tcaron0165
Unicode 0165
Latin small letter t with caron
Tcaron0164
Unicode 0164
Latin capital letter T with caron
tcedil0163
Unicode 0163
Latin small letter t with cedilla
Tcedil0162
Unicode 0162
Latin capital letter T with cedilla
tstrok0167
Unicode 0167
Latin small letter t with stroke
Tstrok0166
Unicode 0166
Latin capital letter T with stroke
ubreve016D
Unicode 016D
Latin small letter u with breve
Ubreve016C
Unicode 016C
Latin capital letter U with breve
udblac0171
Unicode 0171
Latin small letter u with double acute
Udblac0170
Unicode 0170
Latin capital letter U with double acute
umacr016B
Unicode 016B
Latin small letter u with macron
Umacr016A
Unicode 016A
Latin capital letter U with macron
uogon0173
Unicode 0173
Latin small letter u with ogonek
Uogon0172
Unicode 0172
Latin capital letter U with ogonek
uring016F
Unicode 016F
Latin small letter u with ring above
Uring016E
Unicode 016E
Latin capital letter U with ring above
utilde0169
Unicode 0169
Latin small letter u with tilde
Utilde0168
Unicode 0168
Latin capital letter U with tilde
wcirc0175
Unicode 0175
Latin small letter w with circumflex
Wcirc0174
Unicode 0174
Latin capital letter W with circumflex
ycirc0177
Unicode 0177
Latin small letter y with circumflex
Ycirc0176
Unicode 0176
Latin capital letter Y with circumflex
Yuml0178
Unicode 0178
Latin capital letter Y with diaeresis
zacute017A
Unicode 017A
Latin small letter z with acute
Zacute0179
Unicode 0179
Latin capital letter Z with acute
zcaron017E
Unicode 017E
Latin small letter z with caron
Zcaron017D
Unicode 017D
Latin capital letter Z with caron
zdot017C
Unicode 017C
Latin small letter z with dot above
Zdot017B
Unicode 017B
Latin capital letter Z with dot above

1.3. Greek Letters

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Greek Letters//EN//XML
System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.1/isogrk1.ent

The following character entities are defined in this entity set:

Entity
Name
Unicode
Code point
Sample
Glyph
Description
agr03B1
Unicode 03B1
Greek small letter alpha
Agr0391
Unicode 0391
Greek capital letter ALPHA
bgr03B2
Unicode 03B2
Greek small letter beta
Bgr0392
Unicode 0392
Greek capital letter BETA
ggr03B3
Unicode 03B3
Greek small letter gamma
Ggr0393
Unicode 0393
Greek capital letter GAMMA
dgr03B4
Unicode 03B4
Greek small letter delta
Dgr0394
Unicode 0394
Greek capital letter DELTA
egr03B5
Unicode 03B5
Greek small letter epsilon
Egr0395
Unicode 0395
Greek capital letter EPSILON
zgr03B6
Unicode 03B6
Greek small letter zeta
Zgr0396
Unicode 0396
Greek capital letter ZETA
eegr03B7
Unicode 03B7
Greek small letter eta
EEgr0397
Unicode 0397
Greek capital letter ETA
thgr03B8
Unicode 03B8
Greek small letter theta
THgr0398
Unicode 0398
Greek capital letter THETA
igr03B9
Unicode 03B9
Greek small letter iota
Igr0399
Unicode 0399
Greek capital letter IOTA
kgr03BA
Unicode 03BA
Greek small letter kappa
Kgr039A
Unicode 039A
Greek capital letter KAPPA
lgr03BB
Unicode 03BB
Greek small letter lamda
Lgr039B
Unicode 039B
Greek capital letter LAMDA
mgr03BC
Unicode 03BC
Greek small letter mu
Mgr039C
Unicode 039C
Greek capital letter MU
ngr03BD
Unicode 03BD
Greek small letter nu
Ngr039D
Unicode 039D
Greek capital letter NU
xgr03BE
Unicode 03BE
Greek small letter xi
Xgr039E
Unicode 039E
Greek capital letter XI
ogr03BF
Unicode 03BF
Greek small letter omicron
Ogr039F
Unicode 039F
Greek capital letter OMICRON
pgr03C0
Unicode 03C0
Greek small letter pi
Pgr03A0
Unicode 03A0
Greek capital letter PI
rgr03C1
Unicode 03C1
Greek small letter rho
Rgr03A1
Unicode 03A1
Greek capital letter RHO
sgr03C3
Unicode 03C3
Greek small letter sigma
Sgr03A3
Unicode 03A3
Greek capital letter SIGMA
sfgr03C2
Unicode 03C2
Greek small letter final sigma
tgr03C4
Unicode 03C4
Greek small letter tau
Tgr03A4
Unicode 03A4
Greek capital letter TAU
ugr03C5
Unicode 03C5
Greek small letter upsilon
Ugr03A5
Unicode 03A5
Greek capital letter UPSILON
phgr03C6
Unicode 03C6
Greek small letter phi
PHgr03A6
Unicode 03A6
Greek capital letter PHI
khgr03C7
Unicode 03C7
Greek small letter chi
KHgr03A7
Unicode 03A7
Greek capital letter CHI
psgr03C8
Unicode 03C8
Greek small letter psi
PSgr03A8
Unicode 03A8
Greek capital letter PSI
ohgr03C9
Unicode 03C9
Greek small letter omega
OHgr03A9
Unicode 03A9
Greek capital letter OMEGA

1.4. Monotoniko Greek

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Monotoniko Greek//EN//XML
System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.1/isogrk2.ent

The following character entities are defined in this entity set:

Entity
Name
Unicode
Code point
Sample
Glyph
Description
aacgr03AC
Unicode 03AC
Greek small letter alpha with tonos
Aacgr0386
Unicode 0386
Greek capital letter ALPHA with tonos
eacgr03AD
Unicode 03AD
Greek small letter epsilon with tonos
Eacgr0388
Unicode 0388
Greek capital letter EPSILON with tonos
eeacgr03AE
Unicode 03AE
Greek small letter eta with tonos
EEacgr0389
Unicode 0389
Greek capital letter ETA with tonos
idigr03CA
Unicode 03CA
Greek small letter iota with dialytika
Idigr03AA
Unicode 03AA
Greek capital letter IOTA with dialytika
iacgr03AF
Unicode 03AF
Greek small letter iota with tonos
Iacgr038A
Unicode 038A
Greek capital letter IOTA with tonos
idiagr0390
Unicode 0390
Greek small letter iota with dialytika and tonos
oacgr03CC
Unicode 03CC
Greek small letter omicron with tonos
Oacgr038C
Unicode 038C
Greek capital letter OMICRON with tonos
udigr03CB
Unicode 03CB
Greek small letter upsilon with dialytika
Udigr03AB
Unicode 03AB
Greek capital letter UPSILON with dialytika
uacgr03CD
Unicode 03CD
Greek small letter upsilon with tonos
Uacgr038E
Unicode 038E
Greek capital letter UPSILON with tonos
udiagr03B0
Unicode 03B0
Greek small letter upsilon with tonos and dialytika
ohacgr03CE
Unicode 03CE
Greek small letter omega with tonos
OHacgr038F
Unicode 038F
Greek capital letter OMEGA with tonos

1.5. Russian Cyrillic

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Russian Cyrillic//EN//XML
System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.1/isocyr1.ent

The following character entities are defined in this entity set:

Entity
Name
Unicode
Code point
Sample
Glyph
Description
acy0430
Unicode 0430
Cyrillic small letter a
Acy0410
Unicode 0410
Cyrillic capital letter A
bcy0431
Unicode 0431
Cyrillic small letter be
Bcy0411
Unicode 0411
Cyrillic capital letter BE
vcy0432
Unicode 0432
Cyrillic small letter ve
Vcy0412
Unicode 0412
Cyrillic capital letter VE
gcy0433
Unicode 0433
Cyrillic small letter ghe
Gcy0413
Unicode 0413
Cyrillic capital letter GHE
dcy0434
Unicode 0434
Cyrillic small letter de
Dcy0414
Unicode 0414
Cyrillic capital letter DE
iecy0435
Unicode 0435
Cyrillic small letter ie
IEcy0415
Unicode 0415
Cyrillic capital letter IE
iocy0451
Unicode 0451
Cyrillic small letter io
IOcy0401
Unicode 0401
Cyrillic capital letter IO
zhcy0436
Unicode 0436
Cyrillic small letter zhe
ZHcy0416
Unicode 0416
Cyrillic capital letter ZHE
zcy0437
Unicode 0437
Cyrillic small letter ze
Zcy0417
Unicode 0417
Cyrillic capital letter ZE
icy0438
Unicode 0438
Cyrillic small letter i
Icy0418
Unicode 0418
Cyrillic capital letter I
jcy0439
Unicode 0439
Cyrillic small letter short i
Jcy0419
Unicode 0419
Cyrillic capital letter SHORT i
kcy043A
Unicode 043A
Cyrillic small letter ka
Kcy041A
Unicode 041A
Cyrillic capital letter KA
lcy043B
Unicode 043B
Cyrillic small letter el
Lcy041B
Unicode 041B
Cyrillic capital letter EL
mcy043C
Unicode 043C
Cyrillic small letter em
Mcy041C
Unicode 041C
Cyrillic capital letter EM
ncy043D
Unicode 043D
Cyrillic small letter en
Ncy041D
Unicode 041D
Cyrillic capital letter EN
ocy043E
Unicode 043E
Cyrillic small letter o
Ocy041E
Unicode 041E
Cyrillic capital letter O
pcy043F
Unicode 043F
Cyrillic small letter pe
Pcy041F
Unicode 041F
Cyrillic capital letter PE
rcy0440
Unicode 0440
Cyrillic small letter er
Rcy0420
Unicode 0420
Cyrillic capital letter ER
scy0441
Unicode 0441
Cyrillic small letter es
Scy0421
Unicode 0421
Cyrillic capital letter ES
tcy0442
Unicode 0442
Cyrillic small letter te
Tcy0422
Unicode 0422
Cyrillic capital letter TE
ucy0443
Unicode 0443
Cyrillic small letter u
Ucy0423
Unicode 0423
Cyrillic capital letter U
fcy0444
Unicode 0444
Cyrillic small letter ef
Fcy0424
Unicode 0424
Cyrillic capital letter EF
khcy0445
Unicode 0445
Cyrillic small letter ha
KHcy0425
Unicode 0425
Cyrillic capital letter HA
tscy0446
Unicode 0446
Cyrillic small letter tse
TScy0426
Unicode 0426
Cyrillic capital letter TSE
chcy0447
Unicode 0447
Cyrillic small letter che
CHcy0427
Unicode 0427
Cyrillic capital letter CHE
shcy0448
Unicode 0448
Cyrillic small letter sha
SHcy0428
Unicode 0428
Cyrillic capital letter SHA
shchcy0449
Unicode 0449
Cyrillic small letter shcha
SHCHcy0429
Unicode 0429
Cyrillic capital letter SHCHA
hardcy044A
Unicode 044A
Cyrillic small letter hard sign
HARDcy042A
Unicode 042A
Cyrillic capital letter HARD sign
ycy044B
Unicode 044B
Cyrillic small letter yeru
Ycy042B
Unicode 042B
Cyrillic capital letter YERU
softcy044C
Unicode 044C
Cyrillic small letter soft sign
SOFTcy042C
Unicode 042C
Cyrillic capital letter SOFT sign
ecy044D
Unicode 044D
Cyrillic small letter e
Ecy042D
Unicode 042D
Cyrillic capital letter E
yucy044E
Unicode 044E
Cyrillic small letter yu
YUcy042E
Unicode 042E
Cyrillic capital letter YU
yacy044F
Unicode 044F
Cyrillic small letter ya
YAcy042F
Unicode 042F
Cyrillic capital letter YA
numero2116
Unicode 2116
Numero sign

1.6. Non-Russian Cyrillic

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Non-Russian Cyrillic//EN//XML
System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.1/isocyr2.ent

The following character entities are defined in this entity set:

Entity
Name
Unicode
Code point
Sample
Glyph
Description
djcy0452
Unicode 0452
Cyrillic small letter dje
DJcy0402
Unicode 0402
Cyrillic capital letter DJE
gjcy0453
Unicode 0453
Cyrillic small letter gje
GJcy0403
Unicode 0403
Cyrillic capital letter GJE
jukcy0454
Unicode 0454
Cyrillic small letter ukrainian ie
Jukcy0404
Unicode 0404
Cyrillic capital letter UKRAINIAN ie
dscy0455
Unicode 0455
Cyrillic small letter dze
DScy0405
Unicode 0405
Cyrillic capital letter DZE
iukcy0456
Unicode 0456
Cyrillic small letter byelorussian-ukrainian i
Iukcy0406
Unicode 0406
Cyrillic capital letter BYELORUSSIAN-UKRAINIAN i
yicy0457
Unicode 0457
Cyrillic small letter yi
YIcy0407
Unicode 0407
Cyrillic capital letter YI
jsercy0458
Unicode 0458
Cyrillic small letter je
Jsercy0408
Unicode 0408
Cyrillic capital letter JE
ljcy0459
Unicode 0459
Cyrillic small letter lje
LJcy0409
Unicode 0409
Cyrillic capital letter LJE
njcy045A
Unicode 045A
Cyrillic small letter nje
NJcy040A
Unicode 040A
Cyrillic capital letter NJE
tshcy045B
Unicode 045B
Cyrillic small letter tshe
TSHcy040B
Unicode 040B
Cyrillic capital letter TSHE
kjcy045C
Unicode 045C
Cyrillic small letter kje
KJcy040C
Unicode 040C
Cyrillic capital letter KJE
ubrcy045E
Unicode 045E
Cyrillic small letter short u
Ubrcy040E
Unicode 040E
Cyrillic capital letter SHORT u
dzcy045F
Unicode 045F
Cyrillic small letter dzhe
DZcy040F
Unicode 040F
Cyrillic capital letter DZHE

1.7. Numeric and Special Graphic

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Numeric and Special Graphic//EN//XML
System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.1/isonum.ent

The following character entities are defined in this entity set:

Entity
Name
Unicode
Code point
Sample
Glyph
Description
half00BD
Unicode 00BD
Vulgar fraction one half
frac1200BD
Unicode 00BD
Vulgar fraction one half
frac1400BC
Unicode 00BC
Vulgar fraction one quarter
frac3400BE
Unicode 00BE
Vulgar fraction three quarters
frac18215B
Unicode 215B
Vulgar fraction one eighth
frac38215C
Unicode 215C
Vulgar fraction three eighths
frac58215D
Unicode 215D
Vulgar fraction five eighths
frac78215E
Unicode 215E
Vulgar fraction seven eighths
sup100B9
Unicode 00B9
Superscript one
sup200B2
Unicode 00B2
Superscript two
sup300B3
Unicode 00B3
Superscript three
plus002B
Unicode 002B
Plus sign
plusmn00B1
Unicode 00B1
Plus-minus sign
lt003C
Unicode 003C
Less-than sign
equals003D
Unicode 003D
Equals sign
gt003E
Unicode 003E
Greater-than sign
divide00F7
Unicode 00F7
Division sign
times00D7
Unicode 00D7
Multiplication sign
curren00A4
Unicode 00A4
Currency sign
pound00A3
Unicode 00A3
Pound sign
dollar0024
Unicode 0024
Dollar sign
cent00A2
Unicode 00A2
Cent sign
yen00A5
Unicode 00A5
Yen sign
num0023
Unicode 0023
Number sign
percnt0025
Unicode 0025
Percent sign
amp0026
Unicode 0026
Ampersand
ast002A
Unicode 002A
Asterisk
commat0040
Unicode 0040
Commercial at
lsqb005B
Unicode 005B
Left square bracket
bsol005C
Unicode 005C
Reverse solidus
rsqb005D
Unicode 005D
Right square bracket
lcub007B
Unicode 007B
Left curly bracket
horbar2015
Unicode 2015
Horizontal bar
verbar007C
Unicode 007C
Vertical line
rcub007D
Unicode 007D
Right curly bracket
micro00B5
Unicode 00B5
Micro sign
ohm2126
Unicode 2126
Ohm sign
deg00B0
Unicode 00B0
Degree sign
ordm00BA
Unicode 00BA
Masculine ordinal indicator
ordf00AA
Unicode 00AA
Feminine ordinal indicator
sect00A7
Unicode 00A7
Section sign
para00B6
Unicode 00B6
Pilcrow sign
middot00B7
Unicode 00B7
Middle dot
larr2190
Unicode 2190
Leftwards arrow
rarr2192
Unicode 2192
Rightwards arrow
uarr2191
Unicode 2191
Upwards arrow
darr2193
Unicode 2193
Downwards arrow
copy00A9
Unicode 00A9
Copyright sign
reg00AE
Unicode 00AE
Registered sign
trade2122
Unicode 2122
Trade mark sign
brvbar00A6
Unicode 00A6
Broken bar
not00AC
Unicode 00AC
Not sign
sung 
Unicode  
Eighth note
excl0021
Unicode 0021
Exclamation mark
iexcl00A1
Unicode 00A1
Inverted exclamation mark
quot0022
Unicode 0022
Quotation mark
apos0027
Unicode 0027
Apostrophe
lpar0028
Unicode 0028
Left parenthesis
rpar0029
Unicode 0029
Right parenthesis
comma002C
Unicode 002C
Comma
lowbar005F
Unicode 005F
Low line
hyphen002D
Unicode 002D
Hyphen
period002E
Unicode 002E
Period
sol002F
Unicode 002F
Solidus
colon003A
Unicode 003A
Colon
semi003B
Unicode 003B
Semicolon
quest003F
Unicode 003F
Question mark
iquest00BF
Unicode 00BF
Inverted question mark
laquo00AB
Unicode 00AB
Left-pointing double angle quotation mark
raquo00BB
Unicode 00BB
Right-pointing double angle quotation mark
lsquo2018
Unicode 2018
Left single quotation mark
rsquo2019
Unicode 2019
Right single quotation mark
ldquo201C
Unicode 201C
Left double quotation mark
rdquo201D
Unicode 201D
Right double quotation mark
nbsp00A0
Unicode 00A0
No-break space
shy00AD
Unicode 00AD
Soft hyphen

1.8. Diacritical Marks

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Diacritical Marks//EN//XML
System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.1/isodia.ent

The following character entities are defined in this entity set:

Entity
Name
Unicode
Code point
Sample
Glyph
Description
acute00B4
Unicode 00B4
Acute accent
breve02D8
Unicode 02D8
Breve
caron02C7
Unicode 02C7
Caron
cedil00B8
Unicode 00B8
Cedilla
circ005E
Unicode 005E
Circumflex accent
dblac02DD
Unicode 02DD
Double acute accent
die00A8
Unicode 00A8
Diaeresis
dot02D9
Unicode 02D9
Dot above
grave0060
Unicode 0060
Grave accent
macr00AF
Unicode 00AF
Macron
ogon02DB
Unicode 02DB
Ogonek
ring02DA
Unicode 02DA
Ring above
tilde02DC
Unicode 02DC
Small tilde
uml00A8
Unicode 00A8
Diaeresis

1.9. Publishing

Identifiers for this entity set:

Public identifier: ISO 8879:1986//ENTITIES Publishing//EN//XML
System identifier: http://www.oasis-open.org/docbook/xmlcharent/0.1/isopub.ent

The following character entities are defined in this entity set:

Entity
Name
Unicode
Code point
Sample
Glyph
Description
emsp2003
Unicode 2003
Em space
ensp2002
Unicode 2002
En space
emsp132004
Unicode 2004
Three-per-em space
emsp142005
Unicode 2005
Four-per-em space
numsp2007
Unicode 2007
Figure space
puncsp2008
Unicode 2008
Punctuation space
thinsp2009
Unicode 2009
Thin space
hairsp200A
Unicode 200A
Hair space
mdash2014
Unicode 2014
Em dash
ndash2013
Unicode 2013
En dash
dash2010
Unicode 2010
Dash
blank2423
Unicode 2423
Open box
hellip2026
Unicode 2026
Horizontal ellipsis
nldr2025
Unicode 2025
Two dot leader
frac132153
Unicode 2153
Vulgar fraction one third
frac232154
Unicode 2154
Vulgar fraction two thirds
frac152155
Unicode 2155
Vulgar fraction one fifth
frac252156
Unicode 2156
Vulgar fraction two fifths
frac352157
Unicode 2157
Vulgar fraction three fifths
frac452158
Unicode 2158
Vulgar fraction four fifths
frac162159
Unicode 2159
Vulgar fraction one sixth
frac56215A
Unicode 215A
Vulgar fraction five sixths
incare2105
Unicode 2105
Care of
block2588
Unicode 2588
Full block
uhblk2580
Unicode 2580
Upper half block
lhblk2584
Unicode 2584
Lower half block
blk142591
Unicode 2591
Light shade
blk122592
Unicode 2592
Medium shade
blk342593
Unicode 2593
Dark shade
marker25AE
Unicode 25AE
Black vertical rectangle
cir25CB
Unicode 25CB
White circle
squ25A1
Unicode 25A1
White square
rect25AD
Unicode 25AD
White rectangle
utri25B5
Unicode 25B5
White up-pointing small triangle
dtri25BF
Unicode 25BF
White down-pointing small triangle
starNONE
Unicode NONE
=star, open
bull2022
Unicode 2022
Bullet
squf25AA
Unicode 25AA
Black small square
utrif25B4
Unicode 25B4
Black up-pointing small triangle
dtrif25BE
Unicode 25BE
Black down-pointing small triangle
ltrif25C2
Unicode 25C2
Black left-pointing small triangle
rtrif25B8
Unicode 25B8
Black right-pointing small triangle
clubs2663