| Commit message (Collapse) | Author | Age | Files | Lines | |
|---|---|---|---|---|---|
| * | Add WordBreakPropertyData | 2025-05-15 | 1 | -0/+146 | |
| | | | | | Passes some simple lookup tests. | ||||
| * | Unicode 16.0 | 2025-04-30 | 1 | -0/+7 | |
| | | | | | | Went smoothly, needed to add some scripts and adjust the magic numbers, but other than that, all set. | ||||
| * | Add c0 and c1 control width options | 2025-03-20 | 1 | -8/+8 | |
| | | | | | | | | This allows a build of DisplayWidth to give characters in those classes a width, for cases where they'll be printed with a substitute in the final display. It also raises the size of possible characters from an i3 to an i4, to accommodate printing C1s as e.g. <80> or \u{80}. | ||||
| * | FoldData: Minimize Changes_When_Casefolded data | 2024-06-27 | 1 | -3/+18 | |
| | | | | | | | | | | | | | | | | | | | Only a few codepoints have a mapping in CaseFolding.txt but do not have the Changes_When_Casefolded property set. So, FoldData can just store a list of those particular codepoints and then re-use the encoded CaseFolding.txt data alongside it in order to implement changesWhenCaseFolded. This reduces the size of fold.bin.z from 4,387 bytes (4.28KiB) to 1,165 bytes (1.13KiB). This also seemingly introduced a very slight performance regression in zg_caseless. Before: zg CaseFold.compatCaselessMatch: result: 626, took: 258ns zg CaseFold.canonCaselessMatch: result: 626, took: 129ns After: zg CaseFold.compatCaselessMatch: result: 626, took: 263ns zg CaseFold.canonCaselessMatch: result: 626, took: 131ns | ||||
| * | Implements new case fold data encoding by @sqeek502 #8 | 2024-06-26 | 1 | -5/+6 | |
| | | |||||
| * | Added changes when casefolded back | 2024-06-26 | 1 | -31/+49 | |
| | | |||||
| * | Implemented sqeek502s case fold | 2024-06-26 | 1 | -97/+192 | |
| | | |||||
| * | Updated build.zig b.path | 2024-05-19 | 1 | -3/+0 | |
| | | |||||
| * | Removed emoji modifier display width = 0 case. | 2024-05-03 | 1 | -1/+1 | |
| | | |||||
| * | Updated README | 2024-03-31 | 16 | -48/+32 | |
| | | |||||
| * | PropsData and errdefers for init fns | 2024-03-28 | 2 | -0/+275 | |
| | | |||||
| * | ScriptsData and made all Datas const | 2024-03-27 | 1 | -0/+309 | |
| | | |||||
| * | Friendly general category methods | 2024-03-27 | 1 | -30/+30 | |
| | | |||||
| * | Using diff for lowercase mapping | 2024-03-26 | 1 | -7/+7 | |
| | | |||||
| * | Using diff for uppercase mapping | 2024-03-26 | 1 | -7/+7 | |
| | | |||||
| * | Removed title case processing | 2024-03-26 | 1 | -58/+0 | |
| | | |||||
| * | CaseData | 2024-03-25 | 5 | -25/+330 | |
| | | |||||
| * | NumericData | 2024-03-24 | 1 | -0/+141 | |
| | | |||||
| * | Moved case fold stuff to src/Caser.zig | 2024-03-01 | 6 | -6/+6 | |
| | | |||||
| * | Changes when case folded check; 20ms faster | 2024-03-01 | 1 | -3/+51 | |
| | | |||||
| * | Normalizer.eqlIgnoreCase compatibility caseless matching | 2024-03-01 | 1 | -0/+76 | |
| | | |||||
| * | General Category with GenCatData | 2024-02-27 | 1 | -0/+172 | |
| | | |||||
| * | Using NormPropsData in NormData; No Ziglyph deps in NOrmalizer | 2024-02-27 | 1 | -0/+135 | |
| | | |||||
| * | Using HangulData in NormData | 2024-02-27 | 1 | -0/+134 | |
| | | |||||
| * | Using NormData nfkd | 2024-02-26 | 1 | -0/+65 | |
| | | |||||
| * | Using NormData and CanonData in Normalizer | 2024-02-26 | 1 | -0/+68 | |
| | | |||||
| * | Using separate data struct model. | 2024-02-26 | 3 | -76/+40 | |
| | | |||||
| * | Replaced ccc_map with table. 20ms faster | 2024-02-20 | 1 | -0/+128 | |
| | | |||||
| * | Cleaned up directory structure | 2024-02-19 | 2 | -5/+5 | |
| | | |||||
| * | Using argsWithAllocator for Windows | 2024-02-18 | 2 | -2/+4 | |
| | | |||||
| * | Added cjk build option to set ambiguous width to 2 at build time | 2024-02-18 | 1 | -1/+6 | |
| | | |||||
| * | display_width with table | 2024-02-16 | 1 | -0/+243 | |
| | | |||||
| * | Removed codegen/indic.zig | 2024-02-15 | 1 | -164/+0 | |
| | | |||||
| * | New single byte props table | 2024-02-15 | 3 | -236/+280 | |
| | | |||||
| * | emoji direct array access | 2024-02-14 | 1 | -13/+2 | |
| | | |||||
| * | gbp and indic direct array access | 2024-02-14 | 2 | -84/+8 | |
| | | |||||
| * | Passing Unicode 15.1.0 Grapheme Break Tests | 2024-02-14 | 1 | -0/+188 | |
| | | |||||
| * | Code reorg; Added UCD | 2024-02-14 | 2 | -0/+299 | |