Skip to content

Commit 7d736b1

Browse files
authored
Iŋliʃ fɷnotipic ɑlfɑbet (#1035)
* UnicodeData.txt lines from L2/24-277 * Typos in the UnicodeData.txt lines * Another typo * lb assignments according to the proposal, note lb=CL rather than lb=CP for U+2E63. * Latin letters, Common punctuation * Regenerate UCD * Failing test for the case pairs * Another typo in the UnicodeData lines * up to block * Regenerate UCD * Test the unpaired lowercase letters * Test ɷ * failing test for the parentheses * bpb bmg * Failing test for the exclamation marks * Terminal wigglies * Regenerate UCD * Test passes * Lo and compare them to ꟻ * Regenerate UCD * Ignore Block * More apt (and failing) comparison for the i-like letters * Soften the dots * Regenerate UCD * Ignore 𝼚’s Do_Not_Emit sequence * traces * more traces… * meow * meow? * moo * CI is haunted * … * Revert traces Revert "CI is haunted" This reverts commit 6e9b5f5. Revert "moo" This reverts commit 2eec56c. Revert "meow?" This reverts commit 6bff11e. Revert "meow" This reverts commit 35fe8e8. Revert "more traces…" This reverts commit 8a8d5be. Revert "traces" This reverts commit f88af9a.
1 parent 2845fea commit 7d736b1

23 files changed

+495
-96
lines changed

unicodetools/data/ucd/dev/BidiBrackets.txt

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -155,6 +155,8 @@
155155
2E5A; 2E59; c # TOP HALF RIGHT PARENTHESIS
156156
2E5B; 2E5C; o # BOTTOM HALF LEFT PARENTHESIS
157157
2E5C; 2E5B; c # BOTTOM HALF RIGHT PARENTHESIS
158+
2E62; 2E63; o # LEFT PARENTHESIS WITH MIDDLE RING
159+
2E63; 2E62; c # RIGHT PARENTHESIS WITH MIDDLE RING
158160
3008; 3009; o # LEFT ANGLE BRACKET
159161
3009; 3008; c # RIGHT ANGLE BRACKET
160162
300A; 300B; o # LEFT DOUBLE ANGLE BRACKET

unicodetools/data/ucd/dev/BidiMirroring.txt

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -463,6 +463,8 @@
463463
2E5A; 2E59 # TOP HALF RIGHT PARENTHESIS
464464
2E5B; 2E5C # BOTTOM HALF LEFT PARENTHESIS
465465
2E5C; 2E5B # BOTTOM HALF RIGHT PARENTHESIS
466+
2E62; 2E63 # LEFT PARENTHESIS WITH MIDDLE RING
467+
2E63; 2E62 # RIGHT PARENTHESIS WITH MIDDLE RING
466468
3008; 3009 # LEFT ANGLE BRACKET
467469
3009; 3008 # RIGHT ANGLE BRACKET
468470
300A; 300B # LEFT DOUBLE ANGLE BRACKET

unicodetools/data/ucd/dev/CaseFolding.txt

Lines changed: 13 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# CaseFolding-18.0.0.txt
2-
# Date: 2025-11-28, 16:57:47 GMT
2+
# Date: 2025-11-29, 01:41:28 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -1251,6 +1251,7 @@ A7D6; C; A7D7; # LATIN CAPITAL LETTER MIDDLE SCOTS S
12511251
A7D8; C; A7D9; # LATIN CAPITAL LETTER SIGMOID S
12521252
A7DA; C; A7DB; # LATIN CAPITAL LETTER LAMBDA
12531253
A7DC; C; 019B; # LATIN CAPITAL LETTER LAMBDA WITH STROKE
1254+
A7DD; C; 0277; # LATIN CAPITAL LETTER CLOSED OMEGA
12541255
A7E2; C; 027C; # LATIN CAPITAL LETTER R WITH LONG LEG
12551256
A7F5; C; A7F6; # LATIN CAPITAL LETTER REVERSED HALF H
12561257
AB6C; C; AB4B; # LATIN CAPITAL LETTER SCRIPT R
@@ -1652,6 +1653,17 @@ FF3A; C; FF5A; # FULLWIDTH LATIN CAPITAL LETTER Z
16521653
1DF4A; C; 1DF4B; # LATIN CAPITAL LETTER BARRED M
16531654
1DF4D; C; 1DF4E; # LATIN CAPITAL LETTER BARRED N
16541655
1DF51; C; 1DF52; # LATIN CAPITAL LETTER BARRED V
1656+
1DF68; C; 1DF69; # LATIN CAPITAL LETTER PHONOTYPIC A WITH SWASH
1657+
1DF6A; C; 1DF6B; # LATIN CAPITAL LETTER PHONOTYPIC ROUNDTOP A
1658+
1DF6C; C; 1DF6D; # LATIN CAPITAL LETTER REVERSED SCRUPLE
1659+
1DF6E; C; 1DF6F; # LATIN CAPITAL LETTER PHONOTYPIC DIPHTHONG AI
1660+
1DF72; C; 1DF73; # LATIN CAPITAL LETTER O WITH CURL
1661+
1DF74; C; 1DF75; # LATIN CAPITAL LETTER CLOSED OMEGA WITH LONG STEM
1662+
1DF76; C; 1DF77; # LATIN CAPITAL LETTER TURNED CLOSED OMEGA
1663+
1DF78; C; 1DF79; # LATIN CAPITAL LETTER PHONOTYPIC TH
1664+
1DF7A; C; 1DF7B; # LATIN CAPITAL LETTER U WITH HOOK TAIL
1665+
1DF7C; C; 1DF7D; # LATIN CAPITAL LETTER U WITH NOTCH AT BOTTOM
1666+
1DF7E; C; 1DF7F; # LATIN CAPITAL LETTER REVERSED ENLARGED SMALL U
16551667
1E900; C; 1E922; # ADLAM CAPITAL LETTER ALIF
16561668
1E901; C; 1E923; # ADLAM CAPITAL LETTER DAALI
16571669
1E902; C; 1E924; # ADLAM CAPITAL LETTER LAAM

unicodetools/data/ucd/dev/DerivedAge.txt

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# DerivedAge-18.0.0.txt
2-
# Date: 2025-11-28, 23:54:50 GMT
2+
# Date: 2025-11-29, 01:41:29 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -2131,6 +2131,8 @@ FDC8..FDCE ; 17.0 # [7] ARABIC LIGATURE RAHIMAHU ALLAAH TAAALAA..ARABIC LIG
21312131
208F ; 18.0 # MODIFIER LETTER HIGH AND LOW VERTICAL LINE
21322132
209D..209F ; 18.0 # [3] LATIN SUBSCRIPT SMALL LETTER W..LATIN SUBSCRIPT SMALL LETTER Z
21332133
20C2..20C3 ; 18.0 # [2] RUFIYAA SIGN..UAE DIRHAM SIGN
2134+
2E60..2E63 ; 18.0 # [4] WIGGLY EXCLAMATION MARK..RIGHT PARENTHESIS WITH MIDDLE RING
2135+
A7DD ; 18.0 # LATIN CAPITAL LETTER CLOSED OMEGA
21342136
A7E2 ; 18.0 # LATIN CAPITAL LETTER R WITH LONG LEG
21352137
AB6C..AB6D ; 18.0 # [2] LATIN CAPITAL LETTER SCRIPT R..LATIN CAPITAL LETTER SCRIPT R WITH RING
21362138
107BB..107BF ; 18.0 # [5] MODIFIER LETTER SMALL TURNED T..MODIFIER LETTER SMALL ESH WITH DOUBLE BAR
@@ -2150,12 +2152,13 @@ AB6C..AB6D ; 18.0 # [2] LATIN CAPITAL LETTER SCRIPT R..LATIN CAPITAL LETTER
21502152
1B168 ; 18.0 # KATAKANA LETTER SMALL ARCHAIC YE
21512153
1DF1F..1DF24 ; 18.0 # [6] LATIN SMALL LETTER D-ETH DIGRAPH..LATIN SMALL LETTER T-THETA DIGRAPH
21522154
1DF2B..1DF59 ; 18.0 # [47] LATIN SMALL LETTER DEZH DIGRAPH WITH CURL..LATIN SMALL LETTER SPLIT U
2155+
1DF68..1DF81 ; 18.0 # [26] LATIN CAPITAL LETTER PHONOTYPIC A WITH SWASH..LATIN CAPITAL LETTER E WITH BENT TOPBAR
21532156
1DFCD..1DFFF ; 18.0 # [51] MODIFIER LETTER SMALL TURNED R WITH MID-HEIGHT LEFT HOOK..MODIFIER LETTER SMALL T WITH HOOK AND RETROFLEX HOOK
21542157
1F7DB ; 18.0 # BULLET IN DOUBLE CIRCLE
21552158
1F7F1..1F7FF ; 18.0 # [15] CIRCLE WITH DOUBLE VERTICAL AND HORIZONTAL LINE..RHOMBUS
21562159
2B81E ; 18.0 # CJK UNIFIED IDEOGRAPH-2B81E
21572160
3D000..3FC3F ; 18.0 # [11328] SEAL CHARACTER-3D000..SEAL CHARACTER-3FC3F
21582161

2159-
# Total code points: 12841
2162+
# Total code points: 12872
21602163

21612164
# EOF

0 commit comments

Comments
 (0)