summaryrefslogtreecommitdiff
path: root/data/unicode/IndicSyllabicCategory.txt
diff options
context:
space:
mode:
authorGravatar Sam Atman2025-04-30 20:32:23 -0400
committerGravatar Sam Atman2025-04-30 20:32:23 -0400
commita7164d9e7b3c3ec6813e06a42d82180d766e15ca (patch)
treeb9c55a45ddac98e51653cb64d39b6b26cfb50362 /data/unicode/IndicSyllabicCategory.txt
parentAllocation Failure Tests (diff)
downloadzg-a7164d9e7b3c3ec6813e06a42d82180d766e15ca.tar.gz
zg-a7164d9e7b3c3ec6813e06a42d82180d766e15ca.tar.xz
zg-a7164d9e7b3c3ec6813e06a42d82180d766e15ca.zip
Unicode 16.0
Went smoothly, needed to add some scripts and adjust the magic numbers, but other than that, all set.
Diffstat (limited to 'data/unicode/IndicSyllabicCategory.txt')
-rw-r--r--data/unicode/IndicSyllabicCategory.txt99
1 files changed, 76 insertions, 23 deletions
diff --git a/data/unicode/IndicSyllabicCategory.txt b/data/unicode/IndicSyllabicCategory.txt
index f2623b4..dc07604 100644
--- a/data/unicode/IndicSyllabicCategory.txt
+++ b/data/unicode/IndicSyllabicCategory.txt
@@ -1,11 +1,11 @@
1# IndicSyllabicCategory-15.1.0.txt 1# IndicSyllabicCategory-16.0.0.txt
2# Date: 2023-01-05 2# Date: 2024-04-30, 21:48:21 GMT
3# © 2023 Unicode®, Inc. 3# © 2024 Unicode®, Inc.
4# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. 4# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
5# For terms of use, see https://www.unicode.org/terms_of_use.html 5# For terms of use and license, see https://www.unicode.org/terms_of_use.html
6# 6#
7# For documentation, see UAX #44: Unicode Character Database, 7# Unicode Character Database
8# at https://www.unicode.org/reports/tr44/ 8# For documentation, see https://www.unicode.org/reports/tr44/
9# 9#
10# This file defines the following property: 10# This file defines the following property:
11# 11#
@@ -37,13 +37,14 @@
37# 37#
38# Ahom, Balinese, Batak, Bengali, Bhaiksuki, Brahmi, Buginese, Buhid, 38# Ahom, Balinese, Batak, Bengali, Bhaiksuki, Brahmi, Buginese, Buhid,
39# Chakma, Cham, Devanagari, Dives Akuru, Dogra, Grantha, Gujarati, 39# Chakma, Cham, Devanagari, Dives Akuru, Dogra, Grantha, Gujarati,
40# Gunjala Gondi, Gurmukhi, Hanunoo, Javanese, Kaithi, Kannada, Kawi, 40# Gunjala Gondi, Gurmukhi, Gurung Khema, Hanunoo, Javanese, Kaithi,
41# Kayah Li, Kharoshthi, Khmer, Khojki, Khudawadi, Lao, Lepcha, Limbu, 41# Kannada, Kawi, Kayah Li, Kharoshthi, Khmer, Khojki, Khudawadi,
42# Mahajani, Makasar, Malayalam, Marchen, Masaram Gondi, Meetei Mayek, 42# Kirat Rai, Lao, Lepcha, Limbu, Mahajani, Makasar, Malayalam,
43# Modi, Multani, Myanmar, Nandinagari, Newa, New Tai Lue, Oriya, 43# Marchen, Masaram Gondi, Meetei Mayek, Modi, Multani, Myanmar,
44# Phags-pa, Rejang, Saurashtra, Sharada, Siddham, Sinhala, Soyombo, 44# Nandinagari, Newa, New Tai Lue, Oriya, Phags-pa, Rejang,
45# Sundanese, Syloti Nagri, Tagalog, Tagbanwa, Tai Le, Tai Tham, 45# Saurashtra, Sharada, Siddham, Sinhala, Soyombo, Sundanese,
46# Tai Viet, Takri, Tamil, Telugu, Thai, Tibetan, Tirhuta, and 46# Syloti Nagri, Tagalog, Tagbanwa, Tai Le, Tai Tham, Tai Viet, Takri,
47# Tamil, Telugu, Thai, Tibetan, Tirhuta, Tulu-Tigalari, and
47# Zanabazar Square. 48# Zanabazar Square.
48# 49#
49# All characters for all other scripts not in that list 50# All characters for all other scripts not in that list
@@ -119,6 +120,8 @@ A980..A981 ; Bindu # Mn [2] JAVANESE SIGN PANYANGGA..JAVANESE SIGN CECAK
11911300..11301 ; Bindu # Mn [2] GRANTHA SIGN COMBINING ANUSVARA ABOVE..GRANTHA SIGN CANDRABINDU 12011300..11301 ; Bindu # Mn [2] GRANTHA SIGN COMBINING ANUSVARA ABOVE..GRANTHA SIGN CANDRABINDU
12011302 ; Bindu # Mc GRANTHA SIGN ANUSVARA 12111302 ; Bindu # Mc GRANTHA SIGN ANUSVARA
1211135E..1135F ; Bindu # Lo [2] GRANTHA LETTER VEDIC ANUSVARA..GRANTHA LETTER VEDIC DOUBLE ANUSVARA 1221135E..1135F ; Bindu # Lo [2] GRANTHA LETTER VEDIC ANUSVARA..GRANTHA LETTER VEDIC DOUBLE ANUSVARA
123113CA ; Bindu # Mc TULU-TIGALARI SIGN CANDRA ANUNASIKA
124113CC ; Bindu # Mc TULU-TIGALARI SIGN ANUSVARA
12211443..11444 ; Bindu # Mn [2] NEWA SIGN CANDRABINDU..NEWA SIGN ANUSVARA 12511443..11444 ; Bindu # Mn [2] NEWA SIGN CANDRABINDU..NEWA SIGN ANUSVARA
1231145F ; Bindu # Lo NEWA LETTER VEDIC ANUSVARA 1261145F ; Bindu # Lo NEWA LETTER VEDIC ANUSVARA
124114BF..114C0 ; Bindu # Mn [2] TIRHUTA SIGN CANDRABINDU..TIRHUTA SIGN ANUSVARA 127114BF..114C0 ; Bindu # Mn [2] TIRHUTA SIGN CANDRABINDU..TIRHUTA SIGN ANUSVARA
@@ -135,6 +138,8 @@ A980..A981 ; Bindu # Mn [2] JAVANESE SIGN PANYANGGA..JAVANESE SIGN CECAK
13511D40 ; Bindu # Mn MASARAM GONDI SIGN ANUSVARA 13811D40 ; Bindu # Mn MASARAM GONDI SIGN ANUSVARA
13611D95 ; Bindu # Mn GUNJALA GONDI SIGN ANUSVARA 13911D95 ; Bindu # Mn GUNJALA GONDI SIGN ANUSVARA
13711F00..11F01 ; Bindu # Mn [2] KAWI SIGN CANDRABINDU..KAWI SIGN ANUSVARA 14011F00..11F01 ; Bindu # Mn [2] KAWI SIGN CANDRABINDU..KAWI SIGN ANUSVARA
1411612D ; Bindu # Mn GURUNG KHEMA SIGN ANUSVARA
14216D40..16D41 ; Bindu # Lm [2] KIRAT RAI SIGN ANUSVARA..KIRAT RAI SIGN TONPI
138 143
139# ================================================ 144# ================================================
140 145
@@ -169,6 +174,7 @@ AAF5 ; Visarga # Mc MEETEI MAYEK VOWEL SIGN VISARGA
16911102 ; Visarga # Mn CHAKMA SIGN VISARGA 17411102 ; Visarga # Mn CHAKMA SIGN VISARGA
17011182 ; Visarga # Mc SHARADA SIGN VISARGA 17511182 ; Visarga # Mc SHARADA SIGN VISARGA
17111303 ; Visarga # Mc GRANTHA SIGN VISARGA 17611303 ; Visarga # Mc GRANTHA SIGN VISARGA
177113CD ; Visarga # Mc TULU-TIGALARI SIGN VISARGA
17211445 ; Visarga # Mc NEWA SIGN VISARGA 17811445 ; Visarga # Mc NEWA SIGN VISARGA
173114C1 ; Visarga # Mc TIRHUTA SIGN VISARGA 179114C1 ; Visarga # Mc TIRHUTA SIGN VISARGA
174115BE ; Visarga # Mc SIDDHAM SIGN VISARGA 180115BE ; Visarga # Mc SIDDHAM SIGN VISARGA
@@ -182,6 +188,7 @@ AAF5 ; Visarga # Mc MEETEI MAYEK VOWEL SIGN VISARGA
18211D41 ; Visarga # Mn MASARAM GONDI SIGN VISARGA 18811D41 ; Visarga # Mn MASARAM GONDI SIGN VISARGA
18311D96 ; Visarga # Mc GUNJALA GONDI SIGN VISARGA 18911D96 ; Visarga # Mc GUNJALA GONDI SIGN VISARGA
18411F03 ; Visarga # Mc KAWI SIGN VISARGA 19011F03 ; Visarga # Mc KAWI SIGN VISARGA
19116D42 ; Visarga # Lm KIRAT RAI SIGN VISARGA
185 192
186# ================================================ 193# ================================================
187 194
@@ -203,6 +210,7 @@ AAF5 ; Visarga # Mc MEETEI MAYEK VOWEL SIGN VISARGA
2031BBA ; Avagraha # Lo SUNDANESE AVAGRAHA 2101BBA ; Avagraha # Lo SUNDANESE AVAGRAHA
204111C1 ; Avagraha # Lo SHARADA SIGN AVAGRAHA 211111C1 ; Avagraha # Lo SHARADA SIGN AVAGRAHA
2051133D ; Avagraha # Lo GRANTHA SIGN AVAGRAHA 2121133D ; Avagraha # Lo GRANTHA SIGN AVAGRAHA
213113B7 ; Avagraha # Lo TULU-TIGALARI SIGN AVAGRAHA
20611447 ; Avagraha # Lo NEWA SIGN AVAGRAHA 21411447 ; Avagraha # Lo NEWA SIGN AVAGRAHA
207114C4 ; Avagraha # Lo TIRHUTA SIGN AVAGRAHA 215114C4 ; Avagraha # Lo TIRHUTA SIGN AVAGRAHA
208119E1 ; Avagraha # Lo NANDINAGARI SIGN AVAGRAHA 216119E1 ; Avagraha # Lo NANDINAGARI SIGN AVAGRAHA
@@ -249,19 +257,21 @@ A9B3 ; Nukta # Mn JAVANESE SIGN CECAK TELU
2491183A ; Nukta # Mn DOGRA SIGN NUKTA 2571183A ; Nukta # Mn DOGRA SIGN NUKTA
25011943 ; Nukta # Mn DIVES AKURU SIGN NUKTA 25811943 ; Nukta # Mn DIVES AKURU SIGN NUKTA
25111D42 ; Nukta # Mn MASARAM GONDI SIGN NUKTA 25911D42 ; Nukta # Mn MASARAM GONDI SIGN NUKTA
26011F5A ; Nukta # Mn KAWI SIGN NUKTA
252 261
253# ================================================ 262# ================================================
254 263
255# Indic_Syllabic_Category=Virama 264# Indic_Syllabic_Category=Virama
256 265
257# Virama (killing of inherent vowel in consonant sequence 266# Virama (kills inherent vowel of consonant; may act as a Pure_Killer
258# or consonant stacker) 267# or Invisible_Stacker depending on context)
259# Only includes characters that can act both as visible killer viramas 268# Only includes characters that can act both as visible killer viramas
260# and consonant stackers. Separate property values exist for characters 269# and consonant stackers. Separate property values exist for characters
261# that can only act as pure killers or only as consonant stackers. 270# that can only act as pure killers, only as reordering killers, or only
271# as consonant stackers.
262 272
263# [Derivation: (ccc=9) - (InSC=Pure_Killer) - (InSC=Invisible_Stacker) 273# [Derivation: (ccc=9) - (InSC=Pure_Killer) - (InSC=Invisible_Stacker)
264# - (InSC=Number_Joiner) - 2D7F] 274# - (InSC=Reordering_Killer) - (InSC=Number_Joiner) - 2D7F]
265 275
266094D ; Virama # Mn DEVANAGARI SIGN VIRAMA 276094D ; Virama # Mn DEVANAGARI SIGN VIRAMA
26709CD ; Virama # Mn BENGALI SIGN VIRAMA 27709CD ; Virama # Mn BENGALI SIGN VIRAMA
@@ -295,8 +305,9 @@ A9C0 ; Virama # Mc JAVANESE PANGKON
295 305
296# Indic_Syllabic_Category=Pure_Killer 306# Indic_Syllabic_Category=Pure_Killer
297 307
298# Pure killer (killing of inherent vowel in consonant sequence, 308# Pure killer (kills inherent vowel of consonant; always visible;
299# with no consonant stacking behavior) 309# has no conjuct formation, consonant stacking, or reordering
310# behavior)
300 311
301# [Not derivable] 312# [Not derivable]
302 313
@@ -312,24 +323,40 @@ A9C0 ; Virama # Mc JAVANESE PANGKON
31217D1 ; Pure_Killer # Mn KHMER SIGN VIRIAM 32317D1 ; Pure_Killer # Mn KHMER SIGN VIRIAM
3131A7A ; Pure_Killer # Mn TAI THAM SIGN RA HAAM 3241A7A ; Pure_Killer # Mn TAI THAM SIGN RA HAAM
3141BAA ; Pure_Killer # Mc SUNDANESE SIGN PAMAAEH 3251BAA ; Pure_Killer # Mc SUNDANESE SIGN PAMAAEH
3151BF2..1BF3 ; Pure_Killer # Mc [2] BATAK PANGOLAT..BATAK PANONGONAN
316A82C ; Pure_Killer # Mn SYLOTI NAGRI SIGN ALTERNATE HASANTA 326A82C ; Pure_Killer # Mn SYLOTI NAGRI SIGN ALTERNATE HASANTA
317A953 ; Pure_Killer # Mc REJANG VIRAMA 327A953 ; Pure_Killer # Mc REJANG VIRAMA
318ABED ; Pure_Killer # Mn MEETEI MAYEK APUN IYEK 328ABED ; Pure_Killer # Mn MEETEI MAYEK APUN IYEK
31911070 ; Pure_Killer # Mn BRAHMI SIGN OLD TAMIL VIRAMA 32911070 ; Pure_Killer # Mn BRAHMI SIGN OLD TAMIL VIRAMA
32011134 ; Pure_Killer # Mn CHAKMA MAAYYAA 33011134 ; Pure_Killer # Mn CHAKMA MAAYYAA
321112EA ; Pure_Killer # Mn KHUDAWADI SIGN VIRAMA 331112EA ; Pure_Killer # Mn KHUDAWADI SIGN VIRAMA
332113CE ; Pure_Killer # Mn TULU-TIGALARI SIGN VIRAMA
333113CF ; Pure_Killer # Mc TULU-TIGALARI SIGN LOOPED VIRAMA
3221172B ; Pure_Killer # Mn AHOM SIGN KILLER 3341172B ; Pure_Killer # Mn AHOM SIGN KILLER
3231193D ; Pure_Killer # Mc DIVES AKURU SIGN HALANTA 3351193D ; Pure_Killer # Mc DIVES AKURU SIGN HALANTA
32411A34 ; Pure_Killer # Mn ZANABAZAR SQUARE SIGN VIRAMA 33611A34 ; Pure_Killer # Mn ZANABAZAR SQUARE SIGN VIRAMA
32511D44 ; Pure_Killer # Mn MASARAM GONDI SIGN HALANTA 33711D44 ; Pure_Killer # Mn MASARAM GONDI SIGN HALANTA
32611F41 ; Pure_Killer # Mc KAWI SIGN KILLER 33811F41 ; Pure_Killer # Mc KAWI SIGN KILLER
3391612F ; Pure_Killer # Mn GURUNG KHEMA SIGN THOLHOMA
34016D6B..16D6C ; Pure_Killer # Lm [2] KIRAT RAI SIGN VIRAMA..KIRAT RAI SIGN SAAT
341
342# ================================================
343
344# Indic_Syllabic_Category=Reordering_Killer
345
346# Reordering killer (kills inherent vowel of consonant; always visible;
347# may cause consonant reordering)
348
349# [Not derivable]
350
3511BF2..1BF3 ; Reordering_Killer # Mc [2] BATAK PANGOLAT..BATAK PANONGONAN
327 352
328# ================================================ 353# ================================================
329 354
330# Indic_Syllabic_Category=Invisible_Stacker 355# Indic_Syllabic_Category=Invisible_Stacker
331 356
332# Invisible stacker (invisible consonant stacker virama). 357# Invisible stacker (usually kills inherent vowel of consonant; is not visible
358# by itself; causes conjunct formation or consonant
359# stacking)
333# 360#
334# Note that in some scripts, such as Kharoshthi and Masaram Gondi, an invisible 361# Note that in some scripts, such as Kharoshthi and Masaram Gondi, an invisible
335# stacker may have a second function, changing the shape and/or location of the 362# stacker may have a second function, changing the shape and/or location of the
@@ -345,6 +372,7 @@ ABED ; Pure_Killer # Mn MEETEI MAYEK APUN IYEK
345AAF6 ; Invisible_Stacker # Mn MEETEI MAYEK VIRAMA 372AAF6 ; Invisible_Stacker # Mn MEETEI MAYEK VIRAMA
34610A3F ; Invisible_Stacker # Mn KHAROSHTHI VIRAMA 37310A3F ; Invisible_Stacker # Mn KHAROSHTHI VIRAMA
34711133 ; Invisible_Stacker # Mn CHAKMA VIRAMA 37411133 ; Invisible_Stacker # Mn CHAKMA VIRAMA
375113D0 ; Invisible_Stacker # Mn TULU-TIGALARI CONJOINER
3481193E ; Invisible_Stacker # Mn DIVES AKURU VIRAMA 3761193E ; Invisible_Stacker # Mn DIVES AKURU VIRAMA
34911A47 ; Invisible_Stacker # Mn ZANABAZAR SQUARE SUBJOINER 37711A47 ; Invisible_Stacker # Mn ZANABAZAR SQUARE SUBJOINER
35011A99 ; Invisible_Stacker # Mn SOYOMBO SUBJOINER 37811A99 ; Invisible_Stacker # Mn SOYOMBO SUBJOINER
@@ -428,6 +456,10 @@ ABD1 ; Vowel_Independent # Lo MEETEI MAYEK LETTER ATIYA
4281130F..11310 ; Vowel_Independent # Lo [2] GRANTHA LETTER EE..GRANTHA LETTER AI 4561130F..11310 ; Vowel_Independent # Lo [2] GRANTHA LETTER EE..GRANTHA LETTER AI
42911313..11314 ; Vowel_Independent # Lo [2] GRANTHA LETTER OO..GRANTHA LETTER AU 45711313..11314 ; Vowel_Independent # Lo [2] GRANTHA LETTER OO..GRANTHA LETTER AU
43011360..11361 ; Vowel_Independent # Lo [2] GRANTHA LETTER VOCALIC RR..GRANTHA LETTER VOCALIC LL 45811360..11361 ; Vowel_Independent # Lo [2] GRANTHA LETTER VOCALIC RR..GRANTHA LETTER VOCALIC LL
45911380..11389 ; Vowel_Independent # Lo [10] TULU-TIGALARI LETTER A..TULU-TIGALARI LETTER VOCALIC LL
4601138B ; Vowel_Independent # Lo TULU-TIGALARI LETTER EE
4611138E ; Vowel_Independent # Lo TULU-TIGALARI LETTER AI
46211390..11391 ; Vowel_Independent # Lo [2] TULU-TIGALARI LETTER OO..TULU-TIGALARI LETTER AU
43111400..1140D ; Vowel_Independent # Lo [14] NEWA LETTER A..NEWA LETTER AU 46311400..1140D ; Vowel_Independent # Lo [14] NEWA LETTER A..NEWA LETTER AU
43211481..1148E ; Vowel_Independent # Lo [14] TIRHUTA LETTER A..TIRHUTA LETTER AU 46411481..1148E ; Vowel_Independent # Lo [14] TIRHUTA LETTER A..TIRHUTA LETTER AU
43311580..1158D ; Vowel_Independent # Lo [14] SIDDHAM LETTER A..SIDDHAM LETTER AU 46511580..1158D ; Vowel_Independent # Lo [14] SIDDHAM LETTER A..SIDDHAM LETTER AU
@@ -450,6 +482,7 @@ ABD1 ; Vowel_Independent # Lo MEETEI MAYEK LETTER ATIYA
45011D67..11D68 ; Vowel_Independent # Lo [2] GUNJALA GONDI LETTER EE..GUNJALA GONDI LETTER AI 48211D67..11D68 ; Vowel_Independent # Lo [2] GUNJALA GONDI LETTER EE..GUNJALA GONDI LETTER AI
45111D6A..11D6B ; Vowel_Independent # Lo [2] GUNJALA GONDI LETTER OO..GUNJALA GONDI LETTER AU 48311D6A..11D6B ; Vowel_Independent # Lo [2] GUNJALA GONDI LETTER OO..GUNJALA GONDI LETTER AU
45211F04..11F10 ; Vowel_Independent # Lo [13] KAWI LETTER A..KAWI LETTER O 48411F04..11F10 ; Vowel_Independent # Lo [13] KAWI LETTER A..KAWI LETTER O
48516100 ; Vowel_Independent # Lo GURUNG KHEMA LETTER A
453 486
454# ================================================ 487# ================================================
455 488
@@ -655,6 +688,11 @@ ABE9..ABEA ; Vowel_Dependent # Mc [2] MEETEI MAYEK VOWEL SIGN CHEINAP..MEET
6551134B..1134C ; Vowel_Dependent # Mc [2] GRANTHA VOWEL SIGN OO..GRANTHA VOWEL SIGN AU 6881134B..1134C ; Vowel_Dependent # Mc [2] GRANTHA VOWEL SIGN OO..GRANTHA VOWEL SIGN AU
65611357 ; Vowel_Dependent # Mc GRANTHA AU LENGTH MARK 68911357 ; Vowel_Dependent # Mc GRANTHA AU LENGTH MARK
65711362..11363 ; Vowel_Dependent # Mc [2] GRANTHA VOWEL SIGN VOCALIC L..GRANTHA VOWEL SIGN VOCALIC LL 69011362..11363 ; Vowel_Dependent # Mc [2] GRANTHA VOWEL SIGN VOCALIC L..GRANTHA VOWEL SIGN VOCALIC LL
691113B8..113BA ; Vowel_Dependent # Mc [3] TULU-TIGALARI VOWEL SIGN AA..TULU-TIGALARI VOWEL SIGN II
692113BB..113C0 ; Vowel_Dependent # Mn [6] TULU-TIGALARI VOWEL SIGN U..TULU-TIGALARI VOWEL SIGN VOCALIC LL
693113C2 ; Vowel_Dependent # Mc TULU-TIGALARI VOWEL SIGN EE
694113C5 ; Vowel_Dependent # Mc TULU-TIGALARI VOWEL SIGN AI
695113C7..113C9 ; Vowel_Dependent # Mc [3] TULU-TIGALARI VOWEL SIGN OO..TULU-TIGALARI AU LENGTH MARK
65811435..11437 ; Vowel_Dependent # Mc [3] NEWA VOWEL SIGN AA..NEWA VOWEL SIGN II 69611435..11437 ; Vowel_Dependent # Mc [3] NEWA VOWEL SIGN AA..NEWA VOWEL SIGN II
65911438..1143F ; Vowel_Dependent # Mn [8] NEWA VOWEL SIGN U..NEWA VOWEL SIGN AI 69711438..1143F ; Vowel_Dependent # Mn [8] NEWA VOWEL SIGN U..NEWA VOWEL SIGN AI
66011440..11441 ; Vowel_Dependent # Mc [2] NEWA VOWEL SIGN O..NEWA VOWEL SIGN AU 69811440..11441 ; Vowel_Dependent # Mc [2] NEWA VOWEL SIGN O..NEWA VOWEL SIGN AU
@@ -712,6 +750,8 @@ ABE9..ABEA ; Vowel_Dependent # Mc [2] MEETEI MAYEK VOWEL SIGN CHEINAP..MEET
71211F36..11F3A ; Vowel_Dependent # Mn [5] KAWI VOWEL SIGN I..KAWI VOWEL SIGN VOCALIC R 75011F36..11F3A ; Vowel_Dependent # Mn [5] KAWI VOWEL SIGN I..KAWI VOWEL SIGN VOCALIC R
71311F3E..11F3F ; Vowel_Dependent # Mc [2] KAWI VOWEL SIGN E..KAWI VOWEL SIGN AI 75111F3E..11F3F ; Vowel_Dependent # Mc [2] KAWI VOWEL SIGN E..KAWI VOWEL SIGN AI
71411F40 ; Vowel_Dependent # Mn KAWI VOWEL SIGN EU 75211F40 ; Vowel_Dependent # Mn KAWI VOWEL SIGN EU
7531611E..16129 ; Vowel_Dependent # Mn [12] GURUNG KHEMA VOWEL SIGN AA..GURUNG KHEMA VOWEL LENGTH MARK
75416D63..16D6A ; Vowel_Dependent # Lo [8] KIRAT RAI VOWEL SIGN AA..KIRAT RAI VOWEL SIGN AU
715 755
716# ================================================ 756# ================================================
717 757
@@ -901,6 +941,7 @@ ABD2..ABDA ; Consonant # Lo [9] MEETEI MAYEK LETTER GOK..MEETEI MAYEK LETTE
9011132A..11330 ; Consonant # Lo [7] GRANTHA LETTER PA..GRANTHA LETTER RA 9411132A..11330 ; Consonant # Lo [7] GRANTHA LETTER PA..GRANTHA LETTER RA
90211332..11333 ; Consonant # Lo [2] GRANTHA LETTER LA..GRANTHA LETTER LLA 94211332..11333 ; Consonant # Lo [2] GRANTHA LETTER LA..GRANTHA LETTER LLA
90311335..11339 ; Consonant # Lo [5] GRANTHA LETTER VA..GRANTHA LETTER HA 94311335..11339 ; Consonant # Lo [5] GRANTHA LETTER VA..GRANTHA LETTER HA
94411392..113B5 ; Consonant # Lo [36] TULU-TIGALARI LETTER KA..TULU-TIGALARI LETTER LLLA
9041140E..11434 ; Consonant # Lo [39] NEWA LETTER KA..NEWA LETTER HA 9451140E..11434 ; Consonant # Lo [39] NEWA LETTER KA..NEWA LETTER HA
9051148F..114AF ; Consonant # Lo [33] TIRHUTA LETTER KA..TIRHUTA LETTER HA 9461148F..114AF ; Consonant # Lo [33] TIRHUTA LETTER KA..TIRHUTA LETTER HA
9061158E..115AE ; Consonant # Lo [33] SIDDHAM LETTER KA..SIDDHAM LETTER HA 9471158E..115AE ; Consonant # Lo [33] SIDDHAM LETTER KA..SIDDHAM LETTER HA
@@ -922,6 +963,8 @@ ABD2..ABDA ; Consonant # Lo [9] MEETEI MAYEK LETTER GOK..MEETEI MAYEK LETTE
92211D6C..11D89 ; Consonant # Lo [30] GUNJALA GONDI LETTER YA..GUNJALA GONDI LETTER SA 96311D6C..11D89 ; Consonant # Lo [30] GUNJALA GONDI LETTER YA..GUNJALA GONDI LETTER SA
92311EE0..11EF1 ; Consonant # Lo [18] MAKASAR LETTER KA..MAKASAR LETTER A 96411EE0..11EF1 ; Consonant # Lo [18] MAKASAR LETTER KA..MAKASAR LETTER A
92411F12..11F33 ; Consonant # Lo [34] KAWI LETTER KA..KAWI LETTER JNYA 96511F12..11F33 ; Consonant # Lo [34] KAWI LETTER KA..KAWI LETTER JNYA
96616101..1611D ; Consonant # Lo [29] GURUNG KHEMA LETTER KA..GURUNG KHEMA LETTER SA
96716D43..16D62 ; Consonant # Lo [32] KIRAT RAI LETTER A..KIRAT RAI LETTER HA
925 968
926# ================================================ 969# ================================================
927 970
@@ -975,6 +1018,7 @@ ABD2..ABDA ; Consonant # Lo [9] MEETEI MAYEK LETTER GOK..MEETEI MAYEK LETTE
975# [Not derivable] 1018# [Not derivable]
976 1019
9770D4E ; Consonant_Preceding_Repha # Lo MALAYALAM LETTER DOT REPH 10200D4E ; Consonant_Preceding_Repha # Lo MALAYALAM LETTER DOT REPH
1021113D1 ; Consonant_Preceding_Repha # Lo TULU-TIGALARI REPHA
97811941 ; Consonant_Preceding_Repha # Lo DIVES AKURU INITIAL RA 102211941 ; Consonant_Preceding_Repha # Lo DIVES AKURU INITIAL RA
97911D46 ; Consonant_Preceding_Repha # Lo MASARAM GONDI REPHA 102311D46 ; Consonant_Preceding_Repha # Lo MASARAM GONDI REPHA
98011F02 ; Consonant_Preceding_Repha # Lo KAWI SIGN REPHA 102411F02 ; Consonant_Preceding_Repha # Lo KAWI SIGN REPHA
@@ -1046,11 +1090,15 @@ A9BD ; Consonant_Medial # Mn JAVANESE CONSONANT SIGN KERET
1046A9BE..A9BF ; Consonant_Medial # Mc [2] JAVANESE CONSONANT SIGN PENGKAL..JAVANESE CONSONANT SIGN CAKRA 1090A9BE..A9BF ; Consonant_Medial # Mc [2] JAVANESE CONSONANT SIGN PENGKAL..JAVANESE CONSONANT SIGN CAKRA
1047AA33..AA34 ; Consonant_Medial # Mc [2] CHAM CONSONANT SIGN YA..CHAM CONSONANT SIGN RA 1091AA33..AA34 ; Consonant_Medial # Mc [2] CHAM CONSONANT SIGN YA..CHAM CONSONANT SIGN RA
1048AA35..AA36 ; Consonant_Medial # Mn [2] CHAM CONSONANT SIGN LA..CHAM CONSONANT SIGN WA 1092AA35..AA36 ; Consonant_Medial # Mn [2] CHAM CONSONANT SIGN LA..CHAM CONSONANT SIGN WA
10491171D..1171F ; Consonant_Medial # Mn [3] AHOM CONSONANT SIGN MEDIAL LA..AHOM CONSONANT SIGN MEDIAL LIGATING RA 10931171D ; Consonant_Medial # Mn AHOM CONSONANT SIGN MEDIAL LA
10941171E ; Consonant_Medial # Mc AHOM CONSONANT SIGN MEDIAL RA
10951171F ; Consonant_Medial # Mn AHOM CONSONANT SIGN MEDIAL LIGATING RA
105011940 ; Consonant_Medial # Mc DIVES AKURU MEDIAL YA 109611940 ; Consonant_Medial # Mc DIVES AKURU MEDIAL YA
105111942 ; Consonant_Medial # Mc DIVES AKURU MEDIAL RA 109711942 ; Consonant_Medial # Mc DIVES AKURU MEDIAL RA
105211A3B..11A3E ; Consonant_Medial # Mn [4] ZANABAZAR SQUARE CLUSTER-FINAL LETTER YA..ZANABAZAR SQUARE CLUSTER-FINAL LETTER VA 109811A3B..11A3E ; Consonant_Medial # Mn [4] ZANABAZAR SQUARE CLUSTER-FINAL LETTER YA..ZANABAZAR SQUARE CLUSTER-FINAL LETTER VA
105311D47 ; Consonant_Medial # Mn MASARAM GONDI RA-KARA 109911D47 ; Consonant_Medial # Mn MASARAM GONDI RA-KARA
11001612A..1612C ; Consonant_Medial # Mc [3] GURUNG KHEMA CONSONANT SIGN MEDIAL YA..GURUNG KHEMA CONSONANT SIGN MEDIAL HA
11011612E ; Consonant_Medial # Mn GURUNG KHEMA CONSONANT SIGN MEDIAL RA
1054 1102
1055# ================================================ 1103# ================================================
1056 1104
@@ -1156,6 +1204,7 @@ ABEC ; Tone_Mark # Mc MEETEI MAYEK LUM IYEK
11560A71 ; Gemination_Mark # Mn GURMUKHI ADDAK 12040A71 ; Gemination_Mark # Mn GURMUKHI ADDAK
11570AFB ; Gemination_Mark # Mn GUJARATI SIGN SHADDA 12050AFB ; Gemination_Mark # Mn GUJARATI SIGN SHADDA
115811237 ; Gemination_Mark # Mn KHOJKI SIGN SHADDA 120611237 ; Gemination_Mark # Mn KHOJKI SIGN SHADDA
1207113D2 ; Gemination_Mark # Mn TULU-TIGALARI GEMINATION MARK
115911A98 ; Gemination_Mark # Mn SOYOMBO GEMINATION MARK 120811A98 ; Gemination_Mark # Mn SOYOMBO GEMINATION MARK
1160 1209
1161# ================================================ 1210# ================================================
@@ -1181,6 +1230,7 @@ A8E0..A8F1 ; Cantillation_Mark # Mn [18] COMBINING DEVANAGARI DIGIT ZERO..CO
11811123E ; Cantillation_Mark # Mn KHOJKI SIGN SUKUN 12301123E ; Cantillation_Mark # Mn KHOJKI SIGN SUKUN
118211366..1136C ; Cantillation_Mark # Mn [7] COMBINING GRANTHA DIGIT ZERO..COMBINING GRANTHA DIGIT SIX 123111366..1136C ; Cantillation_Mark # Mn [7] COMBINING GRANTHA DIGIT ZERO..COMBINING GRANTHA DIGIT SIX
118311370..11374 ; Cantillation_Mark # Mn [5] COMBINING GRANTHA LETTER A..COMBINING GRANTHA LETTER PA 123211370..11374 ; Cantillation_Mark # Mn [5] COMBINING GRANTHA LETTER A..COMBINING GRANTHA LETTER PA
1233113E1..113E2 ; Cantillation_Mark # Mn [2] TULU-TIGALARI VEDIC TONE SVARITA..TULU-TIGALARI VEDIC TONE ANUDATTA
1184 1234
1185# ================================================ 1235# ================================================
1186 1236
@@ -1318,6 +1368,7 @@ ABF0..ABF9 ; Number # Nd [10] MEETEI MAYEK DIGIT ZERO..MEETEI MAYEK DIGIT NI
1318114D0..114D9 ; Number # Nd [10] TIRHUTA DIGIT ZERO..TIRHUTA DIGIT NINE 1368114D0..114D9 ; Number # Nd [10] TIRHUTA DIGIT ZERO..TIRHUTA DIGIT NINE
131911650..11659 ; Number # Nd [10] MODI DIGIT ZERO..MODI DIGIT NINE 136911650..11659 ; Number # Nd [10] MODI DIGIT ZERO..MODI DIGIT NINE
1320116C0..116C9 ; Number # Nd [10] TAKRI DIGIT ZERO..TAKRI DIGIT NINE 1370116C0..116C9 ; Number # Nd [10] TAKRI DIGIT ZERO..TAKRI DIGIT NINE
1371116D0..116E3 ; Number # Nd [20] MYANMAR PAO DIGIT ZERO..MYANMAR EASTERN PWO KAREN DIGIT NINE
132111730..11739 ; Number # Nd [10] AHOM DIGIT ZERO..AHOM DIGIT NINE 137211730..11739 ; Number # Nd [10] AHOM DIGIT ZERO..AHOM DIGIT NINE
13221173A..1173B ; Number # No [2] AHOM NUMBER TEN..AHOM NUMBER TWENTY 13731173A..1173B ; Number # No [2] AHOM NUMBER TEN..AHOM NUMBER TWENTY
132311950..11959 ; Number # Nd [10] DIVES AKURU DIGIT ZERO..DIVES AKURU DIGIT NINE 137411950..11959 ; Number # Nd [10] DIVES AKURU DIGIT ZERO..DIVES AKURU DIGIT NINE
@@ -1326,6 +1377,8 @@ ABF0..ABF9 ; Number # Nd [10] MEETEI MAYEK DIGIT ZERO..MEETEI MAYEK DIGIT NI
132611D50..11D59 ; Number # Nd [10] MASARAM GONDI DIGIT ZERO..MASARAM GONDI DIGIT NINE 137711D50..11D59 ; Number # Nd [10] MASARAM GONDI DIGIT ZERO..MASARAM GONDI DIGIT NINE
132711DA0..11DA9 ; Number # Nd [10] GUNJALA GONDI DIGIT ZERO..GUNJALA GONDI DIGIT NINE 137811DA0..11DA9 ; Number # Nd [10] GUNJALA GONDI DIGIT ZERO..GUNJALA GONDI DIGIT NINE
132811F50..11F59 ; Number # Nd [10] KAWI DIGIT ZERO..KAWI DIGIT NINE 137911F50..11F59 ; Number # Nd [10] KAWI DIGIT ZERO..KAWI DIGIT NINE
138016130..16139 ; Number # Nd [10] GURUNG KHEMA DIGIT ZERO..GURUNG KHEMA DIGIT NINE
138116D70..16D79 ; Number # Nd [10] KIRAT RAI DIGIT ZERO..KIRAT RAI DIGIT NINE
1329 1382
1330# ================================================ 1383# ================================================
1331 1384
@@ -1335,7 +1388,7 @@ ABF0..ABF9 ; Number # Nd [10] MEETEI MAYEK DIGIT ZERO..MEETEI MAYEK DIGIT NI
1335# script, e.g. in Brahmi) 1388# script, e.g. in Brahmi)
1336# 1389#
1337# Note: These are different from Numbers, in the way that there is no known 1390# Note: These are different from Numbers, in the way that there is no known
1338# evidence of Brahmi Joining Numbers taking vowels or subjoined consonants. 1391# evidence of Brahmi Joining Numbers taking vowels or subjoined consonants.
1339# Until such evidence is found, implementations may assume that Brahmi 1392# Until such evidence is found, implementations may assume that Brahmi
1340# Joining Numbers only participate in shaping with other Brahmi Joining 1393# Joining Numbers only participate in shaping with other Brahmi Joining
1341# Numbers. 1394# Numbers.