summaryrefslogtreecommitdiff
path: root/codegen (follow)
Commit message (Collapse)AuthorAgeFilesLines
* Add WordBreakPropertyDataGravatar Sam Atman2025-05-151-0/+146
| | | | Passes some simple lookup tests.
* Unicode 16.0Gravatar Sam Atman2025-04-301-0/+7
| | | | | Went smoothly, needed to add some scripts and adjust the magic numbers, but other than that, all set.
* Add c0 and c1 control width optionsGravatar Sam Atman2025-03-201-8/+8
| | | | | | | This allows a build of DisplayWidth to give characters in those classes a width, for cases where they'll be printed with a substitute in the final display. It also raises the size of possible characters from an i3 to an i4, to accommodate printing C1s as e.g. <80> or \u{80}.
* FoldData: Minimize Changes_When_Casefolded dataGravatar Ryan Liptak2024-06-271-3/+18
| | | | | | | | | | | | | | | | | | Only a few codepoints have a mapping in CaseFolding.txt but do not have the Changes_When_Casefolded property set. So, FoldData can just store a list of those particular codepoints and then re-use the encoded CaseFolding.txt data alongside it in order to implement changesWhenCaseFolded. This reduces the size of fold.bin.z from 4,387 bytes (4.28KiB) to 1,165 bytes (1.13KiB). This also seemingly introduced a very slight performance regression in zg_caseless. Before: zg CaseFold.compatCaselessMatch: result: 626, took: 258ns zg CaseFold.canonCaselessMatch: result: 626, took: 129ns After: zg CaseFold.compatCaselessMatch: result: 626, took: 263ns zg CaseFold.canonCaselessMatch: result: 626, took: 131ns
* Implements new case fold data encoding by @sqeek502 #8Gravatar Jose Colon Rodriguez2024-06-261-5/+6
|
* Added changes when casefolded backGravatar Jose Colon Rodriguez2024-06-261-31/+49
|
* Implemented sqeek502s case foldGravatar Jose Colon Rodriguez2024-06-261-97/+192
|
* Updated build.zig b.pathGravatar Jose Colon Rodriguez2024-05-191-3/+0
|
* Removed emoji modifier display width = 0 case.Gravatar Jose Colon Rodriguez2024-05-031-1/+1
|
* Updated READMEGravatar Jose Colon Rodriguez2024-03-3116-48/+32
|
* PropsData and errdefers for init fnsGravatar Jose Colon Rodriguez2024-03-282-0/+275
|
* ScriptsData and made all Datas constGravatar Jose Colon Rodriguez2024-03-271-0/+309
|
* Friendly general category methodsGravatar Jose Colon Rodriguez2024-03-271-30/+30
|
* Using diff for lowercase mappingGravatar Jose Colon Rodriguez2024-03-261-7/+7
|
* Using diff for uppercase mappingGravatar Jose Colon Rodriguez2024-03-261-7/+7
|
* Removed title case processingGravatar Jose Colon Rodriguez2024-03-261-58/+0
|
* CaseDataGravatar Jose Colon Rodriguez2024-03-255-25/+330
|
* NumericDataGravatar Jose Colon Rodriguez2024-03-241-0/+141
|
* Moved case fold stuff to src/Caser.zigGravatar Jose Colon Rodriguez2024-03-016-6/+6
|
* Changes when case folded check; 20ms fasterGravatar Jose Colon Rodriguez2024-03-011-3/+51
|
* Normalizer.eqlIgnoreCase compatibility caseless matchingGravatar Jose Colon Rodriguez2024-03-011-0/+76
|
* General Category with GenCatDataGravatar Jose Colon Rodriguez2024-02-271-0/+172
|
* Using NormPropsData in NormData; No Ziglyph deps in NOrmalizerGravatar Jose Colon Rodriguez2024-02-271-0/+135
|
* Using HangulData in NormDataGravatar Jose Colon Rodriguez2024-02-271-0/+134
|
* Using NormData nfkdGravatar Jose Colon Rodriguez2024-02-261-0/+65
|
* Using NormData and CanonData in NormalizerGravatar Jose Colon Rodriguez2024-02-261-0/+68
|
* Using separate data struct model.Gravatar Jose Colon Rodriguez2024-02-263-76/+40
|
* Replaced ccc_map with table. 20ms fasterGravatar Jose Colon Rodriguez2024-02-201-0/+128
|
* Cleaned up directory structureGravatar Jose Colon Rodriguez2024-02-192-5/+5
|
* Using argsWithAllocator for WindowsGravatar Jose Colon Rodriguez2024-02-182-2/+4
|
* Added cjk build option to set ambiguous width to 2 at build timeGravatar Jose Colon Rodriguez2024-02-181-1/+6
|
* display_width with tableGravatar Jose Colon Rodriguez2024-02-161-0/+243
|
* Removed codegen/indic.zigGravatar Jose Colon Rodriguez2024-02-151-164/+0
|
* New single byte props tableGravatar Jose Colon Rodriguez2024-02-153-236/+280
|
* emoji direct array accessGravatar Jose Colon Rodriguez2024-02-141-13/+2
|
* gbp and indic direct array accessGravatar Jose Colon Rodriguez2024-02-142-84/+8
|
* Passing Unicode 15.1.0 Grapheme Break TestsGravatar Jose Colon Rodriguez2024-02-141-0/+188
|
* Code reorg; Added UCDGravatar Jose Colon Rodriguez2024-02-142-0/+299