summaryrefslogtreecommitdiff
path: root/data/unicode/DoNotEmit.txt
diff options
context:
space:
mode:
authorGravatar Sam Atman2025-04-30 20:32:23 -0400
committerGravatar Sam Atman2025-04-30 20:32:23 -0400
commita7164d9e7b3c3ec6813e06a42d82180d766e15ca (patch)
treeb9c55a45ddac98e51653cb64d39b6b26cfb50362 /data/unicode/DoNotEmit.txt
parentAllocation Failure Tests (diff)
downloadzg-a7164d9e7b3c3ec6813e06a42d82180d766e15ca.tar.gz
zg-a7164d9e7b3c3ec6813e06a42d82180d766e15ca.tar.xz
zg-a7164d9e7b3c3ec6813e06a42d82180d766e15ca.zip
Unicode 16.0
Went smoothly, needed to add some scripts and adjust the magic numbers, but other than that, all set.
Diffstat (limited to 'data/unicode/DoNotEmit.txt')
-rw-r--r--data/unicode/DoNotEmit.txt472
1 files changed, 472 insertions, 0 deletions
diff --git a/data/unicode/DoNotEmit.txt b/data/unicode/DoNotEmit.txt
new file mode 100644
index 0000000..757a313
--- /dev/null
+++ b/data/unicode/DoNotEmit.txt
@@ -0,0 +1,472 @@
1# DoNotEmit-16.0.0.txt
2# Date: 2024-07-30, 19:30:00 GMT
3# © 2024 Unicode®, Inc.
4# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
5# For terms of use and license, see https://www.unicode.org/terms_of_use.html
6#
7# For documentation, see UAX #44: Unicode Character Database,
8# at https://www.unicode.org/reports/tr44/
9#
10# Do_Not_Emit
11#
12# This file is part of the Unicode Character Database. It does not define
13# any properties, but rather provides additional information about
14# characters or character sequences that should not be emitted or generated
15# in newly authored text. Applications such as input methods could use this
16# information to replace "Do Not Emit" sequences input by users with an
17# acceptable alternative. Other applications may use the information in
18# this file to consider certain sequences similar to each other for display,
19# collation, or searching purposes. (This is an addition to canonical
20# equivalence, which is defined elsewhere in the standard.)
21#
22# Note that the discouraged sequences listed in this file should not be
23# considered invalid input to text display implementations. When received
24# by an implementation, the sequences are not required to be displayed with
25# a visual indication of an error (such as dotted circles). Implementation
26# should try their best to display them as normal text, perhaps in the same
27# way or very similar to the way their alternative sequence is displayed.
28#
29# Only characters and character sequences for which a suitable alternative
30# sequence exists are provided. For example, deprecated characters for
31# which no suitable alternative exists are not listed. (For a list of
32# deprecated characters see the "Deprecated" property defined in the Unicode
33# Character Database file "PropList.txt".)
34#
35# Also, canonically equivalent sequences are not listed, even if one
36# sequence is specified to be discouraged or deprecated in the Unicode
37# Standard. For example, U+2126 OHM SIGN, which is canonically equivalent
38# to U+03A9 GREEK CAPITAL LETTER OMEGA is not explicitly listed, since it is
39# expected that conforming Unicode processes would discover the relation
40# between the two characters.
41#
42# Note that some sequences could be considered recursive, in the way that
43# the preferred sequence to use may be a subsequence of the "Do Not Emit"
44# sequence. This may have implications for some implementations who may want
45# to treat the original sequence and its alternative as similar.
46#
47# This file should not be considered to be comprehensive. It is expected
48# that new sequences and categories may be added to or removed from the file
49# as the Unicode Standard goes through new releases.
50#
51# Format:
52# Field 0 A sequence of Unicode code point values
53# Field 1 A replacement sequence of Unicode code point values
54# Field 2 DoNotEmit type of the original character sequence
55#
56# Field 2 is followed by an optional human-readable comment field.
57#
58# These are the values used for Field 2:
59# Indic_Atomic_Consonant:
60# Sequences that look like an Indic consonant but should be avoided
61# in representing that consonant. For now, these are limited to
62# Devanagari.
63# Indic_Consonant_Conjunct:
64# Sequences that look like an Indic conjunct but should be avoided
65# in representing that conjunct. For now, these are limited to
66# Devanagari.
67# Indic_Vowel_Letter:
68# Sequences that look like an Indic vowel letter but should be avoided
69# in representing that vowel letter.
70# Bengali_Khanda_Ta:
71# Legacy representation of Bengali khanda ta prior to
72# Unicode Version 4.1.
73# Malayalam_Chillu:
74# Legacy representation of Malayalam chillus prior to
75# Unicode Version 5.1. Note that the sequence in Field 0 may appear
76# in legitimate Malayalam sequences not related to chillus.
77# Tamil_Shrii:
78# Legacy representation of Tamil ligature shri prior to
79# Unicode Version 4.1.
80# Dotless_Form:
81# Dotless forms of lowercase Latin i and j followed by a
82# combining dot above.
83# Hamza_Form:
84# Sequences containing Arabic hamza above, which should be avoided.
85# Precomposed_Form:
86# Sequences for which a precomposed form exists, but without canonical
87# equivalence.
88# Deprecated:
89# Characters that are identified in the Unicode Standard as
90# deprecated for which a replacement sequence exists.
91# Discouraged:
92# Miscellaneous characters and sequences discouraged in the
93# Unicode Standard.
94# Preferred_Spelling:
95# Miscellaneous characters and sequeences for which the Unicode Standard
96# specifies a preferred spelling.
97
98# ================================================
99# "Do Not Use" tables from the Core Specification
100# ================================================
101
102# Devanagari, from Table 12-1
1030905 0946; 0904; Indic_Vowel_Letter # DEVANAGARI LETTER A, DEVANAGARI VOWEL SIGN SHORT E; DEVANAGARI LETTER SHORT A
1040905 093E; 0906; Indic_Vowel_Letter # DEVANAGARI LETTER A, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER AA
1050930 094D 0907; 0908; Indic_Vowel_Letter # DEVANAGARI LETTER RA, DEVANAGARI SIGN VIRAMA, DEVANAGARI LETTER I; DEVANAGARI LETTER II
1060909 0941; 090A; Indic_Vowel_Letter # DEVANAGARI LETTER U, DEVANAGARI VOWEL SIGN U; DEVANAGARI LETTER UU
107090F 0945; 090D; Indic_Vowel_Letter # DEVANAGARI LETTER E, DEVANAGARI VOWEL SIGN CANDRA E; DEVANAGARI LETTER CANDRA E
108090F 0946; 090E; Indic_Vowel_Letter # DEVANAGARI LETTER E, DEVANAGARI VOWEL SIGN SHORT E; DEVANAGARI LETTER SHORT E
109090F 0947; 0910; Indic_Vowel_Letter # DEVANAGARI LETTER E, DEVANAGARI VOWEL SIGN E; DEVANAGARI LETTER AI
1100905 0949; 0911; Indic_Vowel_Letter # DEVANAGARI LETTER A, DEVANAGARI VOWEL SIGN CANDRA O; DEVANAGARI LETTER CANDRA O
1110906 0945; 0911; Indic_Vowel_Letter # DEVANAGARI LETTER AA, DEVANAGARI VOWEL SIGN CANDRA E; DEVANAGARI LETTER CANDRA O
1120905 094A; 0912; Indic_Vowel_Letter # DEVANAGARI LETTER A, DEVANAGARI VOWEL SIGN SHORT O; DEVANAGARI LETTER SHORT O
1130906 0946; 0912; Indic_Vowel_Letter # DEVANAGARI LETTER AA, DEVANAGARI VOWEL SIGN SHORT E; DEVANAGARI LETTER SHORT O
1140905 094B; 0913; Indic_Vowel_Letter # DEVANAGARI LETTER A, DEVANAGARI VOWEL SIGN O; DEVANAGARI LETTER O
1150906 0947; 0913; Indic_Vowel_Letter # DEVANAGARI LETTER AA, DEVANAGARI VOWEL SIGN E; DEVANAGARI LETTER O
1160905 094C; 0914; Indic_Vowel_Letter # DEVANAGARI LETTER A, DEVANAGARI VOWEL SIGN AU; DEVANAGARI LETTER AU
1170906 0948; 0914; Indic_Vowel_Letter # DEVANAGARI LETTER AA, DEVANAGARI VOWEL SIGN AI; DEVANAGARI LETTER AU
1180905 0945; 0972; Indic_Vowel_Letter # DEVANAGARI LETTER A, DEVANAGARI VOWEL SIGN CANDRA E; DEVANAGARI LETTER CANDRA A
1190905 093A; 0973; Indic_Vowel_Letter # DEVANAGARI LETTER A, DEVANAGARI VOWEL SIGN OE; DEVANAGARI LETTER OE
1200905 093B; 0974; Indic_Vowel_Letter # DEVANAGARI LETTER A, DEVANAGARI VOWEL SIGN OOE; DEVANAGARI LETTER OOE
1210906 093A; 0974; Indic_Vowel_Letter # DEVANAGARI LETTER AA, DEVANAGARI VOWEL SIGN OE; DEVANAGARI LETTER OOE
1220905 094F; 0975; Indic_Vowel_Letter # DEVANAGARI LETTER A, DEVANAGARI VOWEL SIGN AW; DEVANAGARI LETTER AW
1230905 0956; 0976; Indic_Vowel_Letter # DEVANAGARI LETTER A, DEVANAGARI VOWEL SIGN UE; DEVANAGARI LETTER UE
1240905 0957; 0977; Indic_Vowel_Letter # DEVANAGARI LETTER A, DEVANAGARI VOWEL SIGN UUE; DEVANAGARI LETTER UUE
125
126# Devanagari, from Table 12-2
127# Review Note: Some experts have recommended removing these, while
128# others prefer keeping them. They may also be procedurally generated.
1290916 094D 093E; 0916; Indic_Atomic_Consonant # DEVANAGARI LETTER KHA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER KHA
1300916 094D 200D 093E; 0916; Indic_Atomic_Consonant # DEVANAGARI LETTER KHA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER KHA
1310917 094D 093E; 0917; Indic_Atomic_Consonant # DEVANAGARI LETTER GA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER GA
1320917 094D 200D 093E; 0917; Indic_Atomic_Consonant # DEVANAGARI LETTER GA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER GA
1330918 094D 093E; 0918; Indic_Atomic_Consonant # DEVANAGARI LETTER GHA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER GHA
1340918 094D 200D 093E; 0918; Indic_Atomic_Consonant # DEVANAGARI LETTER GHA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER GHA
135091A 094D 093E; 091A; Indic_Atomic_Consonant # DEVANAGARI LETTER CA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER CA
136091A 094D 200D 093E; 091A; Indic_Atomic_Consonant # DEVANAGARI LETTER CA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER CA
137091C 094D 093E; 091C; Indic_Atomic_Consonant # DEVANAGARI LETTER JA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER JA
138091C 094D 200D 093E; 091C; Indic_Atomic_Consonant # DEVANAGARI LETTER JA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER JA
139091D 094D 093E; 091D; Indic_Atomic_Consonant # DEVANAGARI LETTER JHA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER JHA
140091D 094D 200D 093E; 091D; Indic_Atomic_Consonant # DEVANAGARI LETTER JHA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER JHA
141091E 094D 093E; 091E; Indic_Atomic_Consonant # DEVANAGARI LETTER NYA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER NYA
142091E 094D 200D 093E; 091E; Indic_Atomic_Consonant # DEVANAGARI LETTER NYA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER NYA
1430923 094D 093E; 0923; Indic_Atomic_Consonant # DEVANAGARI LETTER NNA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER NNA
1440923 094D 200D 093E; 0923; Indic_Atomic_Consonant # DEVANAGARI LETTER NNA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER NNA
1450924 094D 093E; 0924; Indic_Atomic_Consonant # DEVANAGARI LETTER TA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER TA
1460924 094D 200D 093E; 0924; Indic_Atomic_Consonant # DEVANAGARI LETTER TA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER TA
1470925 094D 093E; 0925; Indic_Atomic_Consonant # DEVANAGARI LETTER THA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER THA
1480925 094D 200D 093E; 0925; Indic_Atomic_Consonant # DEVANAGARI LETTER THA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER THA
1490927 094D 093E; 0927; Indic_Atomic_Consonant # DEVANAGARI LETTER DHA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER DHA
1500927 094D 200D 093E; 0927; Indic_Atomic_Consonant # DEVANAGARI LETTER DHA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER DHA
1510928 094D 093E; 0928; Indic_Atomic_Consonant # DEVANAGARI LETTER NA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER NA
1520928 094D 200D 093E; 0928; Indic_Atomic_Consonant # DEVANAGARI LETTER NA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER NA
1530929 094D 093E; 0929; Indic_Atomic_Consonant # DEVANAGARI LETTER NNNA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER NNNA
1540929 094D 200D 093E; 0929; Indic_Atomic_Consonant # DEVANAGARI LETTER NNNA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER NNNA
1550928 093C 094D 093E; 0929; Indic_Atomic_Consonant # DEVANAGARI LETTER NA, DEVANAGARI SIGN NUKTA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER NNNA
1560928 093C 094D 200D 093E; 0929; Indic_Atomic_Consonant # DEVANAGARI LETTER NA, DEVANAGARI SIGN NUKTA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER NNNA
157092A 094D 093E; 092A; Indic_Atomic_Consonant # DEVANAGARI LETTER PA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER PA
158092A 094D 200D 093E; 092A; Indic_Atomic_Consonant # DEVANAGARI LETTER PA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER PA
159092C 094D 093E; 092C; Indic_Atomic_Consonant # DEVANAGARI LETTER BA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER BA
160092C 094D 200D 093E; 092C; Indic_Atomic_Consonant # DEVANAGARI LETTER BA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER BA
161092D 094D 093E; 092D; Indic_Atomic_Consonant # DEVANAGARI LETTER BHA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER BHA
162092D 094D 200D 093E; 092D; Indic_Atomic_Consonant # DEVANAGARI LETTER BHA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER BHA
163092E 094D 093E; 092E; Indic_Atomic_Consonant # DEVANAGARI LETTER MA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER MA
164092E 094D 200D 093E; 092E; Indic_Atomic_Consonant # DEVANAGARI LETTER MA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER MA
165092F 094D 093E; 092F; Indic_Atomic_Consonant # DEVANAGARI LETTER YA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER YA
166092F 094D 200D 093E; 092F; Indic_Atomic_Consonant # DEVANAGARI LETTER YA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER YA
1670932 094D 093E; 0932; Indic_Atomic_Consonant # DEVANAGARI LETTER LA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER LA
1680932 094D 200D 093E; 0932; Indic_Atomic_Consonant # DEVANAGARI LETTER LA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER LA
1690935 094D 093E; 0935; Indic_Atomic_Consonant # DEVANAGARI LETTER VA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER VA
1700935 094D 200D 093E; 0935; Indic_Atomic_Consonant # DEVANAGARI LETTER VA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER VA
1710936 094D 093E; 0936; Indic_Atomic_Consonant # DEVANAGARI LETTER SHA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER SHA
1720936 094D 200D 093E; 0936; Indic_Atomic_Consonant # DEVANAGARI LETTER SHA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER SHA
1730937 094D 093E; 0937; Indic_Atomic_Consonant # DEVANAGARI LETTER SSA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER SSA
1740937 094D 200D 093E; 0937; Indic_Atomic_Consonant # DEVANAGARI LETTER SSA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER SSA
1750938 094D 093E; 0938; Indic_Atomic_Consonant # DEVANAGARI LETTER SA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER SA
1760938 094D 200D 093E; 0938; Indic_Atomic_Consonant # DEVANAGARI LETTER SA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER SA
1770959 094D 093E; 0959; Indic_Atomic_Consonant # DEVANAGARI LETTER KHHA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER KHHA
1780959 094D 200D 093E; 0959; Indic_Atomic_Consonant # DEVANAGARI LETTER KHHA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER KHHA
1790916 093C 094D 093E; 0959; Indic_Atomic_Consonant # DEVANAGARI LETTER KHA, DEVANAGARI SIGN NUKTA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER KHHA
1800916 093C 094D 200D 093E; 0959; Indic_Atomic_Consonant # DEVANAGARI LETTER KHA, DEVANAGARI SIGN NUKTA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER KHHA
181095A 094D 093E; 095A; Indic_Atomic_Consonant # DEVANAGARI LETTER GHHA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER GHHA
182095A 094D 200D 093E; 095A; Indic_Atomic_Consonant # DEVANAGARI LETTER GHHA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER GHHA
1830917 093C 094D 093E; 095A; Indic_Atomic_Consonant # DEVANAGARI LETTER GA, DEVANAGARI SIGN NUKTA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER GHHA
1840917 093C 094D 200D 093E; 095A; Indic_Atomic_Consonant # DEVANAGARI LETTER GA, DEVANAGARI SIGN NUKTA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER GHHA
185095B 094D 093E; 095B; Indic_Atomic_Consonant # DEVANAGARI LETTER ZA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER ZA
186095B 094D 200D 093E; 095B; Indic_Atomic_Consonant # DEVANAGARI LETTER ZA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER ZA
187091C 093C 094D 093E; 095B; Indic_Atomic_Consonant # DEVANAGARI LETTER JA, DEVANAGARI SIGN NUKTA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER ZA
188091C 093C 094D 200D 093E; 095B; Indic_Atomic_Consonant # DEVANAGARI LETTER JA, DEVANAGARI SIGN NUKTA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER ZA
189095F 094D 093E; 095F; Indic_Atomic_Consonant # DEVANAGARI LETTER YYA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER YYA
190095F 094D 200D 093E; 095F; Indic_Atomic_Consonant # DEVANAGARI LETTER YYA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER YYA
191092F 093C 094D 093E; 095F; Indic_Atomic_Consonant # DEVANAGARI LETTER YA, DEVANAGARI SIGN NUKTA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER YYA
192092F 093C 094D 200D 093E; 095F; Indic_Atomic_Consonant # DEVANAGARI LETTER YA, DEVANAGARI SIGN NUKTA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER YYA
1930979 094D 093E; 0979; Indic_Atomic_Consonant # DEVANAGARI LETTER ZHA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER ZHA
1940979 094D 200D 093E; 0979; Indic_Atomic_Consonant # DEVANAGARI LETTER ZHA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER ZHA
195097A 094D 093E; 097A; Indic_Atomic_Consonant # DEVANAGARI LETTER HEAVY YA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER HEAVY YA
196097A 094D 200D 093E; 097A; Indic_Atomic_Consonant # DEVANAGARI LETTER HEAVY YA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER HEAVY YA
197097B 094D 093E; 097B; Indic_Atomic_Consonant # DEVANAGARI LETTER GGA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER GGA
198097B 094D 200D 093E; 097B; Indic_Atomic_Consonant # DEVANAGARI LETTER GGA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER GGA
199097C 094D 093E; 097C; Indic_Atomic_Consonant # DEVANAGARI LETTER JJA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER JJA
200097C 094D 200D 093E; 097C; Indic_Atomic_Consonant # DEVANAGARI LETTER JJA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER JJA
201097E 094D 093E; 097E; Indic_Atomic_Consonant # DEVANAGARI LETTER DDDA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER DDDA
202097E 094D 200D 093E; 097E; Indic_Atomic_Consonant # DEVANAGARI LETTER DDDA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER DDDA
203097F 094D 093E; 097F; Indic_Atomic_Consonant # DEVANAGARI LETTER BBA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER BBA
204097F 094D 200D 093E; 097F; Indic_Atomic_Consonant # DEVANAGARI LETTER BBA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER BBA
205
206# Devanagari, from Table 12-3
207# Review Note: Some experts have recommended removing these, while
208# others prefer keeping them. They may also be procedurally generated.
209# Note: This list may be incomplete.
2100915 094D 091A 094D 093E; 0915 094D 091A; Indic_Consonant_Conjunct # DEVANAGARI LETTER KA, DEVANAGARI SIGN VIRAMA, DEVANAGARI LETTER CA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER KA, DEVANAGARI SIGN VIRAMA, DEVANAGARI LETTER CA
2110915 094D 091A 094D 200D 093E; 0915 094D 091A; Indic_Consonant_Conjunct # DEVANAGARI LETTER KA, DEVANAGARI SIGN VIRAMA, DEVANAGARI LETTER CA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER KA, DEVANAGARI SIGN VIRAMA, DEVANAGARI LETTER CA
2120915 094D 0937 094D 093E; 0915 094D 0937; Indic_Consonant_Conjunct # DEVANAGARI LETTER KA, DEVANAGARI SIGN VIRAMA, DEVANAGARI LETTER SSA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER KA, DEVANAGARI SIGN VIRAMA, DEVANAGARI LETTER SSA
2130915 094D 0937 094D 200D 093E; 0915 094D 0937; Indic_Consonant_Conjunct # DEVANAGARI LETTER KA, DEVANAGARI SIGN VIRAMA, DEVANAGARI LETTER SSA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER KA, DEVANAGARI SIGN VIRAMA, DEVANAGARI LETTER SSA
2140924 094D 0924 094D 093E; 0924 094D 0924; Indic_Consonant_Conjunct # DEVANAGARI LETTER TA, DEVANAGARI SIGN VIRAMA, DEVANAGARI LETTER TA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER TA, DEVANAGARI SIGN VIRAMA, DEVANAGARI LETTER TA
2150924 094D 0924 094D 200D 093E; 0924 094D 0924; Indic_Consonant_Conjunct # DEVANAGARI LETTER TA, DEVANAGARI SIGN VIRAMA, DEVANAGARI LETTER TA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER TA, DEVANAGARI SIGN VIRAMA, DEVANAGARI LETTER TA
2160928 094D 0924 094D 093E; 0928 094D 0924; Indic_Consonant_Conjunct # DEVANAGARI LETTER NA, DEVANAGARI SIGN VIRAMA, DEVANAGARI LETTER TA, DEVANAGARI SIGN VIRAMA, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER NA, DEVANAGARI SIGN VIRAMA, DEVANAGARI LETTER TA
2170928 094D 0924 094D 200D 093E; 0928 094D 0924; Indic_Consonant_Conjunct # DEVANAGARI LETTER NA, DEVANAGARI SIGN VIRAMA, DEVANAGARI LETTER TA, DEVANAGARI SIGN VIRAMA, ZERO WIDTH JOINER, DEVANAGARI VOWEL SIGN AA; DEVANAGARI LETTER NA, DEVANAGARI SIGN VIRAMA, DEVANAGARI LETTER TA
218
219# Bengali, from Table 12-11
2200985 09BE; 0986; Indic_Vowel_Letter # BENGALI LETTER A, BENGALI VOWEL SIGN AA; BENGALI LETTER AA
221098B 09C3; 09E0; Indic_Vowel_Letter # BENGALI LETTER VOCALIC R, BENGALI VOWEL SIGN VOCALIC R; BENGALI LETTER VOCALIC RR
222098C 09E2; 09E1; Indic_Vowel_Letter # BENGALI LETTER VOCALIC L, BENGALI VOWEL SIGN VOCALIC L; BENGALI LETTER VOCALIC LL
223
224# Gurmukhi, from Table 12-16
2250A05 0A3E; 0A06; Indic_Vowel_Letter # GURMUKHI LETTER A, GURMUKHI VOWEL SIGN AA; GURMUKHI LETTER AA
2260A72 0A3F; 0A07; Indic_Vowel_Letter # GURMUKHI IRI, GURMUKHI VOWEL SIGN I; GURMUKHI LETTER I
2270A72 0A40; 0A08; Indic_Vowel_Letter # GURMUKHI IRI, GURMUKHI VOWEL SIGN II; GURMUKHI LETTER II
2280A73 0A41; 0A09; Indic_Vowel_Letter # GURMUKHI URA, GURMUKHI VOWEL SIGN U; GURMUKHI LETTER U
2290A73 0A42; 0A0A; Indic_Vowel_Letter # GURMUKHI URA, GURMUKHI VOWEL SIGN UU; GURMUKHI LETTER UU
2300A72 0A47; 0A0F; Indic_Vowel_Letter # GURMUKHI IRI, GURMUKHI VOWEL SIGN EE; GURMUKHI LETTER EE
2310A05 0A48; 0A10; Indic_Vowel_Letter # GURMUKHI LETTER A, GURMUKHI VOWEL SIGN AI; GURMUKHI LETTER AI
2320A73 0A4B; 0A13; Indic_Vowel_Letter # GURMUKHI URA, GURMUKHI VOWEL SIGN OO; GURMUKHI LETTER OO
2330A05 0A4C; 0A14; Indic_Vowel_Letter # GURMUKHI LETTER A, GURMUKHI VOWEL SIGN AU; GURMUKHI LETTER AU
234
235# Gujarati, from Table 12-20
2360A85 0ABE; 0A86; Indic_Vowel_Letter # GUJARATI LETTER A, GUJARATI VOWEL SIGN AA; GUJARATI LETTER AA
2370A85 0AC5; 0A8D; Indic_Vowel_Letter # GUJARATI LETTER A, GUJARATI VOWEL SIGN CANDRA E; GUJARATI VOWEL CANDRA E
2380A85 0AC7; 0A8F; Indic_Vowel_Letter # GUJARATI LETTER A, GUJARATI VOWEL SIGN E; GUJARATI LETTER E
2390A85 0AC8; 0A90; Indic_Vowel_Letter # GUJARATI LETTER A, GUJARATI VOWEL SIGN AI; GUJARATI LETTER AI
2400A85 0AC9; 0A91; Indic_Vowel_Letter # GUJARATI LETTER A, GUJARATI VOWEL SIGN CANDRA O; GUJARATI VOWEL CANDRA O
2410A85 0ACB; 0A93; Indic_Vowel_Letter # GUJARATI LETTER A, GUJARATI VOWEL SIGN O; GUJARATI LETTER O
2420A85 0ABE 0AC5; 0A93; Indic_Vowel_Letter # GUJARATI LETTER A, GUJARATI VOWEL SIGN AA, GUJARATI VOWEL SIGN CANDRA E; GUJARATI LETTER O
2430A85 0ACC; 0A94; Indic_Vowel_Letter # GUJARATI LETTER A, GUJARATI VOWEL SIGN AU; GUJARATI LETTER AU
2440A85 0ABE 0AC8; 0A94; Indic_Vowel_Letter # GUJARATI LETTER A, GUJARATI VOWEL SIGN AA, GUJARATI VOWEL SIGN AI; GUJARATI LETTER AU
2450AC5 0ABE; 0AC9; Indic_Vowel_Letter # GUJARATI VOWEL SIGN CANDRA E, GUJARATI VOWEL SIGN AA; GUJARATI VOWEL SIGN CANDRA O
246
247# Oriya, from Table 12-22
2480B05 0B3E; 0B06; Indic_Vowel_Letter # ORIYA LETTER A, ORIYA VOWEL SIGN AA; ORIYA LETTER AA
2490B0F 0B57; 0B10; Indic_Vowel_Letter # ORIYA LETTER E, ORIYA AU LENGTH MARK; ORIYA LETTER AI
2500B13 0B57; 0B14; Indic_Vowel_Letter # ORIYA LETTER O, ORIYA AU LENGTH MARK; ORIYA LETTER AU
251
252# Tamil, from Table 12-26
2530B85 0BC2; 0B86; Indic_Vowel_Letter # TAMIL LETTER A, TAMIL VOWEL SIGN UU; TAMIL LETTER AA
254
255# Telugu, from Table 12-30
2560C12 0C55; 0C13; Indic_Vowel_Letter # TELUGU LETTER O, TELUGU LENGTH MARK; TELUGU LETTER OO
2570C12 0C4C; 0C14; Indic_Vowel_Letter # TELUGU LETTER O, TELUGU VOWEL SIGN AU; TELUGU LETTER AU
2580C3F 0C55; 0C40; Indic_Vowel_Letter # TELUGU VOWEL SIGN I, TELUGU LENGTH MARK; TELUGU VOWEL SIGN II
2590C46 0C55; 0C47; Indic_Vowel_Letter # TELUGU VOWEL SIGN E, TELUGU LENGTH MARK; TELUGU VOWEL SIGN EE
2600C4A 0C55; 0C4B; Indic_Vowel_Letter # TELUGU VOWEL SIGN O, TELUGU LENGTH MARK; TELUGU VOWEL SIGN OO
261
262# Kannada, from Table 12-31
2630C89 0CBE; 0C8A; Indic_Vowel_Letter # KANNADA LETTER U, KANNADA VOWEL SIGN AA; KANNADA LETTER UU
2640C92 0CCC; 0C94; Indic_Vowel_Letter # KANNADA LETTER O, KANNADA VOWEL SIGN AU; KANNADA LETTER AU
2650C8B 0CBE; 0CE0; Indic_Vowel_Letter # KANNADA LETTER VOCALIC R, KANNADA VOWEL SIGN AA; KANNADA LETTER VOCALIC RR
266
267# Malayalam, from Table 12-32
2680D07 0D57; 0D08; Indic_Vowel_Letter # MALAYALAM LETTER I, MALAYALAM AU LENGTH MARK; MALAYALAM LETTER II
2690D09 0D57; 0D0A; Indic_Vowel_Letter # MALAYALAM LETTER U, MALAYALAM AU LENGTH MARK; MALAYALAM LETTER UU
2700D0E 0D46; 0D10; Indic_Vowel_Letter # MALAYALAM LETTER E, MALAYALAM VOWEL SIGN E; MALAYALAM LETTER AI
2710D12 0D3E; 0D13; Indic_Vowel_Letter # MALAYALAM LETTER O, MALAYALAM VOWEL SIGN AA; MALAYALAM LETTER OO
2720D12 0D57; 0D14; Indic_Vowel_Letter # MALAYALAM LETTER O, MALAYALAM AU LENGTH MARK; MALAYALAM LETTER AU
273
274# Sinhala, from Table 13-2
2750D85 0DCF; 0D86; Indic_Vowel_Letter # SINHALA LETTER AYANNA, SINHALA VOWEL SIGN AELA-PILLA; SINHALA LETTER AAYANNA
2760D85 0DD0; 0D87; Indic_Vowel_Letter # SINHALA LETTER AYANNA, SINHALA VOWEL SIGN KETTI AEDA-PILLA; SINHALA LETTER AEYANNA
2770D85 0DD1; 0D88; Indic_Vowel_Letter # SINHALA LETTER AYANNA, SINHALA VOWEL SIGN DIGA AEDA-PILLA; SINHALA LETTER AEEYANNA
2780D8B 0DDF; 0D8C; Indic_Vowel_Letter # SINHALA LETTER UYANNA, SINHALA VOWEL SIGN GAYANUKITTA; SINHALA LETTER UUYANNA
2790D8D 0DD8; 0D8E; Indic_Vowel_Letter # SINHALA LETTER IRUYANNA, SINHALA VOWEL SIGN GAETTA-PILLA; SINHALA LETTER IRUUYANNA
2800D8F 0DDF; 0D90; Indic_Vowel_Letter # SINHALA LETTER ILUYANNA, SINHALA VOWEL SIGN GAYANUKITTA; SINHALA LETTER ILUUYANNA
2810D91 0DCA; 0D92; Indic_Vowel_Letter # SINHALA LETTER EYANNA, SINHALA SIGN AL-LAKUNA; SINHALA LETTER EEYANNA
2820D91 0DD9; 0D93; Indic_Vowel_Letter # SINHALA LETTER EYANNA, SINHALA VOWEL SIGN KOMBUVA; SINHALA LETTER AIYANNA
2830D94 0DDF; 0D96; Indic_Vowel_Letter # SINHALA LETTER OYANNA, SINHALA VOWEL SIGN GAYANUKITTA; SINHALA LETTER AUYANNA
284
285# Brahmi, from Table 14-1
28611005 11038; 11006; Indic_Vowel_Letter # BRAHMI LETTER A, BRAHMI VOWEL SIGN AA; BRAHMI LETTER AA
2871100B 1103E; 1100C; Indic_Vowel_Letter # BRAHMI LETTER VOCALIC R, BRAHMI VOWEL SIGN VOCALIC R; BRAHMI LETTER VOCALIC RR
2881100F 11042; 11010; Indic_Vowel_Letter # BRAHMI LETTER E, BRAHMI VOWEL SIGN E; BRAHMI LETTER AI
289
290# Takri, from Table 15-1
29111680 116AD; 11681; Indic_Vowel_Letter # TAKRI LETTER A, TAKRI VOWEL SIGN AA; TAKRI LETTER AA
29211686 116B2; 11687; Indic_Vowel_Letter # TAKRI LETTER E, TAKRI VOWEL SIGN E; TAKRI LETTER AI
29311680 116B4; 11688; Indic_Vowel_Letter # TAKRI LETTER A, TAKRI VOWEL SIGN O; TAKRI LETTER O
29411680 116B5; 11689; Indic_Vowel_Letter # TAKRI LETTER A, TAKRI VOWEL SIGN AU; TAKRI LETTER AU
295
296# Khojki, from Table 15-3
29711200 1122C; 11201; Indic_Vowel_Letter # KHOJKI LETTER A, KHOJKI VOWEL SIGN AA; KHOJKI LETTER AA
29811240 1122E; 11202; Indic_Vowel_Letter # KHOJKI LETTER SHORT I, KHOJKI VOWEL SIGN II; KHOJKI LETTER I
29911206 1122C; 11203; Indic_Vowel_Letter # KHOJKI LETTER O, KHOJKI VOWEL SIGN AA; KHOJKI LETTER U
30011200 11231; 11205; Indic_Vowel_Letter # KHOJKI LETTER A, KHOJKI VOWEL SIGN AI; KHOJKI LETTER AI
30111200 11233; 11207; Indic_Vowel_Letter # KHOJKI LETTER A, KHOJKI VOWEL SIGN AU; KHOJKI LETTER AU
30211200 1122C 11231; 11207; Indic_Vowel_Letter # KHOJKI LETTER A, KHOJKI VOWEL SIGN AA, KHOJKI VOWEL SIGN AI; KHOJKI LETTER AU
3031122C 11230; 11232; Indic_Vowel_Letter # KHOJKI VOWEL SIGN AA, KHOJKI VOWEL SIGN E; KHOJKI VOWEL SIGN O
3041122C 11231; 11233; Indic_Vowel_Letter # KHOJKI VOWEL SIGN AA, KHOJKI VOWEL SIGN AI; KHOJKI VOWEL SIGN AU
305
306# Khudawadi, from Table 15-4
307112B0 112E0; 112B1; Indic_Vowel_Letter # KHUDAWADI LETTER A, KHUDAWADI VOWEL SIGN AA; KHUDAWADI LETTER AA
308112B0 112E5; 112B6; Indic_Vowel_Letter # KHUDAWADI LETTER A, KHUDAWADI VOWEL SIGN E; KHUDAWADI LETTER E
309112B0 112E6; 112B7; Indic_Vowel_Letter # KHUDAWADI LETTER A, KHUDAWADI VOWEL SIGN AI; KHUDAWADI LETTER AI
310112B0 112E7; 112B8; Indic_Vowel_Letter # KHUDAWADI LETTER A, KHUDAWADI VOWEL SIGN O; KHUDAWADI LETTER O
311112B0 112E8; 112B9; Indic_Vowel_Letter # KHUDAWADI LETTER A, KHUDAWADI VOWEL SIGN AU; KHUDAWADI LETTER AU
312
313# Tirhuta, from Table 15-6
31411481 114B0; 11482; Indic_Vowel_Letter # TIRHUTA LETTER A, TIRHUTA VOWEL SIGN AA; TIRHUTA LETTER AA
315114AA 114B5; 11489; Indic_Vowel_Letter # TIRHUTA LETTER LA, TIRHUTA VOWEL SIGN VOCALIC R; TIRHUTA LETTER VOCALIC L
316114AA 114B6; 1148A; Indic_Vowel_Letter # TIRHUTA LETTER LA, TIRHUTA VOWEL SIGN VOCALIC RR; TIRHUTA LETTER VOCALIC LL
3171148B 114BA; 1148C; Indic_Vowel_Letter # TIRHUTA LETTER E, TIRHUTA VOWEL SIGN SHORT E; TIRHUTA LETTER AI
3181148D 114BA; 1148E; Indic_Vowel_Letter # TIRHUTA LETTER O, TIRHUTA VOWEL SIGN SHORT E; TIRHUTA LETTER AU
319
320# Modi, from Table 15-7
32111600 11639; 1160A; Indic_Vowel_Letter # MODI LETTER A, MODI VOWEL SIGN E; MODI LETTER E
32211600 1163A; 1160B; Indic_Vowel_Letter # MODI LETTER A, MODI VOWEL SIGN AI; MODI LETTER AI
32311601 11639; 1160C; Indic_Vowel_Letter # MODI LETTER AA, MODI VOWEL SIGN E; MODI LETTER O
32411601 1163A; 1160D; Indic_Vowel_Letter # MODI LETTER AA, MODI VOWEL SIGN AI; MODI LETTER AU
325
326# ================================================
327# Deprecated characters and other discouraged characters and sequences
328# ================================================
329
330# Latin, from text of Section 7.1, the NamesList, and the uppercase mapping
3310140; 006C 00B7; Preferred_Spelling # LATIN SMALL LETTER L WITH MIDDLE DOT; LATIN SMALL LETTER L, MIDDLE DOT
3320149; 2019 006E; Deprecated # LATIN SMALL LETTER N PRECEDED BY APOSTROPHE; RIGHT SINGLE QUOTATION MARK, LATIN SMALL LETTER N
3330131 0307; 0069 0307; Dotless_Form # LATIN SMALL LETTER DOTLESS I, COMBINING DOT ABOVE; LATIN SMALL LETTER I, COMBINING DOT ABOVE
3340237 0307; 006A 0307; Dotless_Form # LATIN SMALL LETTER DOTLESS J, COMBINING DOT ABOVE; LATIN SMALL LETTER J, COMBINING DOT ABOVE
335# Characters with overstruck tilde for which a precomposed form exists,
336# but the sequences are not canonically equivalent
337004C 0334; 2C62; Precomposed_Form # LATIN CAPITAL LETTER L, COMBINING TILDE OVERLAY; LATIN CAPITAL LETTER L WITH MIDDLE TILDE
3380062 0334; 1D6C; Precomposed_Form # LATIN SMALL LETTER B, COMBINING TILDE OVERLAY; LATIN SMALL LETTER B WITH MIDDLE TILDE
3390064 0334; 1D6D; Precomposed_Form # LATIN SMALL LETTER D, COMBINING TILDE OVERLAY; LATIN SMALL LETTER D WITH MIDDLE TILDE
3400066 0334; 1D6E; Precomposed_Form # LATIN SMALL LETTER F, COMBINING TILDE OVERLAY; LATIN SMALL LETTER F WITH MIDDLE TILDE
341006C 0334; 026B; Precomposed_Form # LATIN SMALL LETTER L, COMBINING TILDE OVERLAY; LATIN SMALL LETTER L WITH MIDDLE TILDE
342006D 0334; 1D6F; Precomposed_Form # LATIN SMALL LETTER M, COMBINING TILDE OVERLAY; LATIN SMALL LETTER M WITH MIDDLE TILDE
343006E 0334; 1D70; Precomposed_Form # LATIN SMALL LETTER N, COMBINING TILDE OVERLAY; LATIN SMALL LETTER N WITH MIDDLE TILDE
3440070 0334; 1D71; Precomposed_Form # LATIN SMALL LETTER P, COMBINING TILDE OVERLAY; LATIN SMALL LETTER P WITH MIDDLE TILDE
3450072 0334; 1D72; Precomposed_Form # LATIN SMALL LETTER R, COMBINING TILDE OVERLAY; LATIN SMALL LETTER R WITH MIDDLE TILDE
3460073 0334; 1D74; Precomposed_Form # LATIN SMALL LETTER S, COMBINING TILDE OVERLAY; LATIN SMALL LETTER S WITH MIDDLE TILDE
3470074 0334; 1D75; Precomposed_Form # LATIN SMALL LETTER T, COMBINING TILDE OVERLAY; LATIN SMALL LETTER T WITH MIDDLE TILDE
348007A 0334; 1D76; Precomposed_Form # LATIN SMALL LETTER Z, COMBINING TILDE OVERLAY; LATIN SMALL LETTER Z WITH MIDDLE TILDE
3490279 0334; AB68; Precomposed_Form # LATIN SMALL LETTER TURNED R, COMBINING TILDE OVERLAY; LATIN SMALL LETTER TURNED R WITH MIDDLE TILDE
350027E 0334; 1D73; Precomposed_Form # LATIN SMALL LETTER R WITH FISHHOOK, COMBINING TILDE OVERLAY; LATIN SMALL LETTER R WITH FISHHOOK AND MIDDLE TILDE
35102E1 0334; AB5E; Precomposed_Form # MODIFIER LETTER SMALL L, COMBINING TILDE OVERLAY; MODIFIER LETTER SMALL L WITH MIDDLE TILDE
352# Characters with palatalized hook for which a precomposed form exists,
353# but the sequences are not canonically equivalent
3540043 0321; A7C4; Precomposed_Form # LATIN CAPITAL LETTER C, COMBINING PALATALIZED HOOK BELOW; LATIN CAPITAL LETTER C WITH PALATAL HOOK
355005A 0321; A7C6; Precomposed_Form # LATIN CAPITAL LETTER Z, COMBINING PALATALIZED HOOK BELOW; LATIN CAPITAL LETTER Z WITH PALATAL HOOK
3560062 0321; 1D80; Precomposed_Form # LATIN SMALL LETTER B, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER B WITH PALATAL HOOK
3570063 0321; A794; Precomposed_Form # LATIN SMALL LETTER C, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER C WITH PALATAL HOOK
3580064 0321; 1D81; Precomposed_Form # LATIN SMALL LETTER D, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER D WITH PALATAL HOOK
3590066 0321; 1D82; Precomposed_Form # LATIN SMALL LETTER F, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER F WITH PALATAL HOOK
3600068 0321; A795; Precomposed_Form # LATIN SMALL LETTER H, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER H WITH PALATAL HOOK
361006B 0321; 1D84; Precomposed_Form # LATIN SMALL LETTER K, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER K WITH PALATAL HOOK
362006C 0321; 1D85; Precomposed_Form # LATIN SMALL LETTER L, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER L WITH PALATAL HOOK
363006D 0321; 1D86; Precomposed_Form # LATIN SMALL LETTER M, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER M WITH PALATAL HOOK
364006E 0321; 1D87; Precomposed_Form # LATIN SMALL LETTER N, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER N WITH PALATAL HOOK
3650070 0321; 1D88; Precomposed_Form # LATIN SMALL LETTER P, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER P WITH PALATAL HOOK
3660072 0321; 1D89; Precomposed_Form # LATIN SMALL LETTER R, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER R WITH PALATAL HOOK
3670073 0321; 1D8A; Precomposed_Form # LATIN SMALL LETTER S, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER S WITH PALATAL HOOK
3680074 0321; 01AB; Precomposed_Form # LATIN SMALL LETTER T, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER T WITH PALATAL HOOK
3690076 0321; 1D8C; Precomposed_Form # LATIN SMALL LETTER V, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER V WITH PALATAL HOOK
3700078 0321; 1D8D; Precomposed_Form # LATIN SMALL LETTER X, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER X WITH PALATAL HOOK
371007A 0321; 1D8E; Precomposed_Form # LATIN SMALL LETTER Z, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER Z WITH PALATAL HOOK
372014B 0321; 1DF14; Precomposed_Form # LATIN SMALL LETTER ENG, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER ENG WITH PALATAL HOOK
3730261 0321; 1D83; Precomposed_Form # LATIN SMALL LETTER SCRIPT G, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER G WITH PALATAL HOOK
374026C 0321; 1DF13; Precomposed_Form # LATIN SMALL LETTER L WITH BELT, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER L WITH BELT AND PALATAL HOOK
3750279 0321; 1DF15; Precomposed_Form # LATIN SMALL LETTER TURNED R, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER TURNED R WITH PALATAL HOOK
376027E 0321; 1DF16; Precomposed_Form # LATIN SMALL LETTER R WITH FISHHOOK, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER R WITH FISHHOOK AND PALATAL HOOK
3770283 0321; 1D8B; Precomposed_Form # LATIN SMALL LETTER ESH, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER ESH WITH PALATAL HOOK
3780292 0321; 1DF18; Precomposed_Form # LATIN SMALL LETTER EZH, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER EZH WITH PALATAL HOOK
37902A4 0321; 1DF12; Precomposed_Form # LATIN SMALL LETTER DEZH DIGRAPH, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER DEZH DIGRAPH WITH PALATAL HOOK
38002A7 0321; 1DF17; Precomposed_Form # LATIN SMALL LETTER TESH DIGRAPH, COMBINING PALATALIZED HOOK BELOW; LATIN SMALL LETTER TESH DIGRAPH WITH PALATAL HOOK
38102E1 0321; 1DAA; Precomposed_Form # MODIFIER LETTER SMALL L, COMBINING PALATALIZED HOOK BELOW; MODIFIER LETTER SMALL L WITH PALATAL HOOK
3821D57 0321; 1DB5; Precomposed_Form # MODIFIER LETTER SMALL T, COMBINING PALATALIZED HOOK BELOW; MODIFIER LETTER SMALL T WITH PALATAL HOOK
383# Characters with retroflex hook for which a precomposed form exists,
384# but the sequences are not canonically equivalent
3850052 0322; 2C64; Precomposed_Form # LATIN CAPITAL LETTER R, COMBINING RETROFLEX HOOK BELOW; LATIN CAPITAL LETTER R WITH TAIL
3860054 0322; 01AE; Precomposed_Form # LATIN CAPITAL LETTER T, COMBINING RETROFLEX HOOK BELOW; LATIN CAPITAL LETTER T WITH RETROFLEX HOOK
3870061 0322; 1D8F; Precomposed_Form # LATIN SMALL LETTER A, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER A WITH RETROFLEX HOOK
3880063 0322; 1DF1D; Precomposed_Form # LATIN SMALL LETTER C, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER C WITH RETROFLEX HOOK
3890064 0322; 0256; Precomposed_Form # LATIN SMALL LETTER D, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER D WITH TAIL
3900065 0322; 1D92; Precomposed_Form # LATIN SMALL LETTER E, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER E WITH RETROFLEX HOOK
3910069 0322; 1D96; Precomposed_Form # LATIN SMALL LETTER I, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER I WITH RETROFLEX HOOK
392006C 0322; 026D; Precomposed_Form # LATIN SMALL LETTER L, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER L WITH RETROFLEX HOOK
393006E 0322; 0273; Precomposed_Form # LATIN SMALL LETTER N, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER N WITH RETROFLEX HOOK
394006F 0322; 1DF1B; Precomposed_Form # LATIN SMALL LETTER O, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER O WITH RETROFLEX HOOK
3950072 0322; 027D; Precomposed_Form # LATIN SMALL LETTER R, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER R WITH TAIL
3960074 0322; 0288; Precomposed_Form # LATIN SMALL LETTER T, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER T WITH RETROFLEX HOOK
3970075 0322; 1D99; Precomposed_Form # LATIN SMALL LETTER U, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER U WITH RETROFLEX HOOK
398007A 0322; 0290; Precomposed_Form # LATIN SMALL LETTER Z, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER Z WITH RETROFLEX HOOK
39901AD 0322; 1DF09; Precomposed_Form # LATIN SMALL LETTER T WITH HOOK, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER T WITH HOOK AND RETROFLEX HOOK
40001C3 0322; 1DF0A; Precomposed_Form # LATIN LETTER RETROFLEX CLICK, COMBINING RETROFLEX HOOK BELOW; LATIN LETTER RETROFLEX CLICK WITH RETROFLEX HOOK
4010251 0322; 1D90; Precomposed_Form # LATIN SMALL LETTER ALPHA, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER ALPHA WITH RETROFLEX HOOK
4020254 0322; 1D97; Precomposed_Form # LATIN SMALL LETTER OPEN O, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER OPEN O WITH RETROFLEX HOOK
4030257 0322; 1D91; Precomposed_Form # LATIN SMALL LETTER D WITH HOOK, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER D WITH HOOK AND TAIL
4040259 0322; 1D95; Precomposed_Form # LATIN SMALL LETTER SCHWA, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER SCHWA WITH RETROFLEX HOOK
405025B 0322; 1D93; Precomposed_Form # LATIN SMALL LETTER OPEN E, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER OPEN E WITH RETROFLEX HOOK
406025C 0322; 1D94; Precomposed_Form # LATIN SMALL LETTER REVERSED OPEN E, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER REVERSED OPEN E WITH RETROFLEX HOOK
4070268 0322; 1DF1A; Precomposed_Form # LATIN SMALL LETTER I WITH STROKE, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER I WITH STROKE AND RETROFLEX HOOK
408026C 0322; A78E; Precomposed_Form # LATIN SMALL LETTER L WITH BELT, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER L WITH RETROFLEX HOOK AND BELT
409026E 0322; 1DF05; Precomposed_Form # LATIN SMALL LETTER LEZH, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER LEZH WITH RETROFLEX HOOK
410027A 0322; 1DF08; Precomposed_Form # LATIN SMALL LETTER TURNED R WITH LONG LEG, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER TURNED R WITH LONG LEG AND RETROFLEX HOOK
4110283 0322; 1D98; Precomposed_Form # LATIN SMALL LETTER ESH, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER ESH WITH RETROFLEX HOOK
4120292 0322; 1D9A; Precomposed_Form # LATIN SMALL LETTER EZH, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER EZH WITH RETROFLEX HOOK
41302A3 0322; AB66; Precomposed_Form # LATIN SMALL LETTER DZ DIGRAPH, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER DZ DIGRAPH WITH RETROFLEX HOOK
41402A4 0322; 1DF19; Precomposed_Form # LATIN SMALL LETTER DEZH DIGRAPH, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER DEZH DIGRAPH WITH RETROFLEX HOOK
41502A6 0322; AB67; Precomposed_Form # LATIN SMALL LETTER TS DIGRAPH, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER TS DIGRAPH WITH RETROFLEX HOOK
41602A7 0322; 1DF1C; Precomposed_Form # LATIN SMALL LETTER TESH DIGRAPH, COMBINING RETROFLEX HOOK BELOW; LATIN SMALL LETTER TESH DIGRAPH WITH RETROFLEX HOOK
41702B3 0322; 107A8; Precomposed_Form # MODIFIER LETTER SMALL R, COMBINING RETROFLEX HOOK BELOW; MODIFIER LETTER SMALL R WITH TAIL
41802E1 0322; 1DA9; Precomposed_Form # MODIFIER LETTER SMALL L, COMBINING RETROFLEX HOOK BELOW; MODIFIER LETTER SMALL L WITH RETROFLEX HOOK
4191D48 0322; 1078B; Precomposed_Form # MODIFIER LETTER SMALL D, COMBINING RETROFLEX HOOK BELOW; MODIFIER LETTER SMALL D WITH TAIL
4201D57 0322; 107AF; Precomposed_Form # MODIFIER LETTER SMALL T, COMBINING RETROFLEX HOOK BELOW; MODIFIER LETTER SMALL T WITH RETROFLEX HOOK
4211DBB 0322; 1DBC; Precomposed_Form # MODIFIER LETTER SMALL Z, COMBINING RETROFLEX HOOK BELOW; MODIFIER LETTER SMALL Z WITH RETROFLEX HOOK
422207F 0322; 1DAF; Precomposed_Form # SUPERSCRIPT LATIN SMALL LETTER N, COMBINING RETROFLEX HOOK BELOW; MODIFIER LETTER SMALL N WITH RETROFLEX HOOK
42310787 0322; 10788; Precomposed_Form # MODIFIER LETTER SMALL DZ DIGRAPH, COMBINING RETROFLEX HOOK BELOW; MODIFIER LETTER SMALL DZ DIGRAPH WITH RETROFLEX HOOK
4241078C 0322; 1078D; Precomposed_Form # MODIFIER LETTER SMALL D WITH HOOK, COMBINING RETROFLEX HOOK BELOW; MODIFIER LETTER SMALL D WITH HOOK AND TAIL
4251079B 0322; 1079D; Precomposed_Form # MODIFIER LETTER SMALL L WITH BELT, COMBINING RETROFLEX HOOK BELOW; MODIFIER LETTER SMALL L WITH RETROFLEX HOOK AND BELT
4261079E 0322; 1079F; Precomposed_Form # MODIFIER LETTER SMALL LEZH, COMBINING RETROFLEX HOOK BELOW; MODIFIER LETTER SMALL LEZH WITH RETROFLEX HOOK
427107A6 0322; 107A7; Precomposed_Form # MODIFIER LETTER SMALL TURNED R WITH LONG LEG, COMBINING RETROFLEX HOOK BELOW; MODIFIER LETTER SMALL TURNED R WITH LONG LEG AND RETROFLEX HOOK
428107AC 0322; 107AD; Precomposed_Form # MODIFIER LETTER SMALL TS DIGRAPH, COMBINING RETROFLEX HOOK BELOW; MODIFIER LETTER SMALL TS DIGRAPH WITH RETROFLEX HOOK
429
430# Arabic, from text of Section 9.2 and the NamesList
4310649 0654; 0626; Hamza_Form # ARABIC LETTER ALEF MAKSURA, ARABIC HAMZA ABOVE; ARABIC LETTER YEH WITH HAMZA ABOVE
4320673; 0627 065F; Deprecated # ARABIC LETTER ALEF WITH WAVY HAMZA BELOW; ARABIC LETTER ALEF, ARABIC WAVY HAMZA BELOW
4330675; 0674 0627; Preferred_Spelling # ARABIC LETTER HIGH HAMZA ALEF; ARABIC LETTER HIGH HAMZA, ARABIC LETTER ALEF
4340676; 0674 0648; Preferred_Spelling # ARABIC LETTER HIGH HAMZA WAW; ARABIC LETTER HIGH HAMZA, ARABIC LETTER WAW
4350677; 0674 06C7; Preferred_Spelling # ARABIC LETTER U WITH HAMZA ABOVE; ARABIC LETTER HIGH HAMZA, ARABIC LETTER U
4360678; 0674 0649; Preferred_Spelling # ARABIC LETTER HIGH HAMZA YEH; ARABIC LETTER HIGH HAMZA, ARABIC LETTER ALEF MAKSURA
437
438# Devanagari, from Section 12.1 and the NamesList
4390953; 0300; Discouraged # DEVANAGARI GRAVE ACCENT; COMBINING GRAVE ACCENT
4400954; 0301; Discouraged # DEVANAGARI ACUTE ACCENT; COMBINING ACUTE ACCENT
441
442# Bengali, from Section 12.2
44309A4 09CD 200D; 09CE; Bengali_Khanda_Ta # BENGALI LETTER TA, BENGALI SIGN VIRAMA, ZERO WIDTH JOINER; BENGALI LETTER KHANDA TA
444
445# Gujarati, from the NamesList
4460AF1; 0AB0 0AC2 0AF0; Preferred_Spelling # GUJARATI RUPEE SIGN; GUJARATI LETTER RA, GUJARATI VOWEL SIGN UU, GUJARATI ABBREVIATION SIGN
447
448# Tamil ligature shri
4490BB8 0BCD 0BB0 0BC0; 0BB6 0BCD 0BB0 0BC0; Tamil_Shrii # TAMIL LETTER SA, TAMIL SIGN VIRAMA, TAMIL LETTER RA, TAMIL VOWEL SIGN II; TAMIL LETTER SHA, TAMIL SIGN VIRAMA, TAMIL LETTER RA, TAMIL VOWEL SIGN II
450
451# Malayalam Chillus, from Table 12-40
4520D23 0D4D 200D; 0D7A; Malayalam_Chillu # MALAYALAM LETTER NNA, MALAYALAM SIGN VIRAMA, ZERO WIDTH JOINER; MALAYALAM LETTER CHILLU NN
4530D28 0D4D 200D; 0D7B; Malayalam_Chillu # MALAYALAM LETTER NA, MALAYALAM SIGN VIRAMA, ZERO WIDTH JOINER; MALAYALAM LETTER CHILLU N
4540D30 0D4D 200D; 0D7C; Malayalam_Chillu # MALAYALAM LETTER RA, MALAYALAM SIGN VIRAMA, ZERO WIDTH JOINER; MALAYALAM LETTER CHILLU RR
4550D32 0D4D 200D; 0D7D; Malayalam_Chillu # MALAYALAM LETTER LA, MALAYALAM SIGN VIRAMA, ZERO WIDTH JOINER; MALAYALAM LETTER CHILLU L
4560D33 0D4D 200D; 0D7E; Malayalam_Chillu # MALAYALAM LETTER LLA, MALAYALAM SIGN VIRAMA, ZERO WIDTH JOINER; MALAYALAM LETTER CHILLU LL
457
458# Tibetan, from text of Section 13.4, the NamesList, and the decompositions
4590F77; 0FB2 0F71 0F80; Deprecated # TIBETAN VOWEL SIGN VOCALIC RR; TIBETAN SUBJOINED LETTER RA, TIBETAN VOWEL SIGN AA, TIBETAN VOWEL SIGN REVERSED I
4600F79; 0FB3 0F71 0F80; Deprecated # TIBETAN VOWEL SIGN VOCALIC LL; TIBETAN SUBJOINED LETTER LA, TIBETAN VOWEL SIGN AA, TIBETAN VOWEL SIGN REVERSED I
461
462# Khmer, from text of Section 16.4 and the NamesList
46317A3; 17A2; Deprecated # KHMER INDEPENDENT VOWEL QAQ; KHMER LETTER QA
46417A4; 17A2 17B6; Deprecated # KHMER INDEPENDENT VOWEL QAA; KHMER LETTER QA, KHMER VOWEL SIGN AA
46517D8; 17D4 179B 17D4; Discouraged # KHMER SIGN BEYYAL; KHMER SIGN KHAN, KHMER LETTER LO, KHMER SIGN KHAN
46617E8 17D3; 19E0; Discouraged # KHMER DIGIT EIGHT, KHMER SIGN BATHAMASAT; KHMER SYMBOL PATHAMASAT
467
468# Sharada, from the NamesList, and glyph shape of U+1118E
4691118D 111BC; 1118E; Indic_Vowel_Letter # SHARADA LETTER E, SHARADA VOWEL SIGN E; SHARADA LETTER AI
470111C4; 1118F 11180; Discouraged # SHARADA OM; SHARADA LETTER O, SHARADA SIGN CANDRABINDU
471
472# EOF