zg - Mirror of https://codeberg.org/atman/zg/

	Commit message (Collapse)	Author	Age	Files	Lines
*	Various small iterator improvementswork-branch	Sam Atman	2025-05-13	1	-9/+46
\|
*	Add reverse CodePoint iterator	Sam Atman	2025-05-09	1	-6/+75
\|
*	Make DisplayWidth.setup publicv0.14.0-rc2	Sam Atman	2025-05-04	1	-1/+7
\| \| \| \|	Also adds setupWithGraphemes variant.
*	Remove inner setup from GeneralCategories	Sam Atman	2025-05-01	1	-10/+1
\| \| \| \| \|	It was one `try` block away from only returning Allocator.Error, so now there's no need to filter errors in an outer `catch`.
*	Update Unicode version in README.md	Sam Atman	2025-04-30	1	-0/+1
\| \| \| \| \| \| \|	Lets me slip these in: Closes #12 Closes #14
*	Unicode 16.0	Sam Atman	2025-04-30	1	-1/+7
\| \| \| \| \|	Went smoothly, needed to add some scripts and adjust the magic numbers, but other than that, all set.
*	Allocation Failure Tests	Sam Atman	2025-04-30	11	-91/+178
\| \| \| \| \| \| \| \| \| \|	These turned up an excessive amount of allocations in CanonData and CompatData, which have been reduced to two through the somewhat squirrely use of 'magic numbers'. There are now allocation tests for every allocated structure in the library, and they run to completion in a reasonable amount of time. So, that's nice.
*	Setup variants for all allocating modules	Sam Atman	2025-04-30	7	-146/+228
\| \| \| \| \| \| \| \|	This harmonizes the allocating modules in a couple of ways. All can now be constructed by pointer, and all treat various miscellaneous read failures as `unreachable`, which indeed they should be. The README has been updated to inform users of this option.
*	Update README.md to new API	Sam Atman	2025-04-30	1	-10/+10
\|
*	Rest of the Renamings	Sam Atman	2025-04-30	5	-0/+0
\| \| \| \|	These get different names, but don't otherwise change.
*	Remove FoldData, make CaseFolding	Sam Atman	2025-04-30	4	-167/+218
\| \| \| \| \|	CaseFolding now has the FoldData, and can be initialized with a copy of Normalize if wanted.
*	Merge NormData with Normalize	Sam Atman	2025-04-30	10	-278/+269
\|
*	grapheme now Graphemes, Data files gone	Sam Atman	2025-04-30	4	-193/+4
\|
*	Factor out 'Data' for grapheme and DisplayWidth	Sam Atman	2025-04-30	6	-119/+313
\| \| \| \| \|	In the process of refactoring the whole library, so that it doesn't expose anything called "Data" separately from user functionality.
*	Add general tests step	Sam Atman	2025-04-29	7	-44/+49
\| \| \| \| \| \|	After a considerable slog, all tests are reachable from the test step, and pass. Almost every failure was related to the change away from the inclusion of an allocator on this or that.
*	Add result.toOwned() to Normalize.zig	Sam Atman	2025-04-29	1	-0/+9
\| \| \| \| \| \|	Closes #29 The README is also updated to reflect this change.
*	All the std.mem.Allocators that were stored just for init and deinit	lch361	2025-04-29	15	-102/+86
\| \| \| \|	methods were removed, mem.Allocators were added to deinit as arguments.
*	Bump copyright year, isolate iterator tests	Sam Atman	2025-04-29	1	-13/+18
\|
*	Add c0 and c1 control width options	Sam Atman	2025-03-20	2	-32/+36
\| \| \| \| \| \| \|	This allows a build of DisplayWidth to give characters in those classes a width, for cases where they'll be printed with a substitute in the final display. It also raises the size of possible characters from an i3 to an i4, to accommodate printing C1s as e.g. <80> or \u{80}.
*	Fix leak of cwcf_exceptions in FoldData	Ryan Liptak	2024-12-04	1	-0/+2
\| \| \| \|	Closes #20
*	Add peek() to Grapheme.Iterator	Sam Atman	2024-11-02	2	-0/+95
\| \| \| \| \|	This does the expected thing: returns the next ?Grapheme without mutation of the iteration state.
*	Replace deprecated uses of std.mem.split	Sam Atman	2024-11-02	1	-8/+8
\|
*	WidthData: define error set as mem.Allocator.Error	Tim Culverhouse	2024-10-14	1	-5/+5
\| \| \| \| \| \| \|	The reader is a static embedded file. All of the reads are readInt. This function should not ever fail at runtime with a read error. Make all read errors unreachable, leaving only allocation errors as the error set.
*	GraphemeData: define error set as mem.Allocator.Error	Tim Culverhouse	2024-10-14	1	-7/+7
\| \| \| \| \| \| \|	The reader is a static embedded file. All of the reads are either a readInt or a readAll into a previously allocated buffer. This function should not ever fail at runtime with a read error. Make all read errors unreachable, leaving only allocation errors as the error set.
*	refactor CodePoint.Iterator into a reusable fn	Jonathan Raphaelson	2024-07-05	1	-57/+79
\| \| \| \| \| \|	without changing the algorithm at all, move the responsibility of decoding a u8 slice out of the iterator, and into a reusable function so that it can be used by consumers of the library
*	FoldData: Minimize Changes_When_Casefolded data	Ryan Liptak	2024-06-27	1	-5/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Only a few codepoints have a mapping in CaseFolding.txt but do not have the Changes_When_Casefolded property set. So, FoldData can just store a list of those particular codepoints and then re-use the encoded CaseFolding.txt data alongside it in order to implement changesWhenCaseFolded. This reduces the size of fold.bin.z from 4,387 bytes (4.28KiB) to 1,165 bytes (1.13KiB). This also seemingly introduced a very slight performance regression in zg_caseless. Before: zg CaseFold.compatCaselessMatch: result: 626, took: 258ns zg CaseFold.canonCaselessMatch: result: 626, took: 129ns After: zg CaseFold.compatCaselessMatch: result: 626, took: 263ns zg CaseFold.canonCaselessMatch: result: 626, took: 131ns
*	Removed all inlines	Jose Colon Rodriguez	2024-06-26	11	-33/+35
\|
*	Added changes when casefolded back	Jose Colon Rodriguez	2024-06-26	1	-2/+6
\|
*	Implemented sqeek502s case fold	Jose Colon Rodriguez	2024-06-26	2	-36/+53
\|
*	Normalize: Mark utf8Encode errors as unreachable, use explicit error sets	Ryan Liptak	2024-06-25	1	-11/+11
\| \| \| \|	These utf8Encode calls are converting normalized codepoints back into UTF-8, so the codepoints can be assumed to be valid.
*	codepoint: prevent panic when last cp too short	Tim Culverhouse	2024-06-10	1	-0/+11
\| \| \| \| \| \| \| \| \| \| \|	If the last codepoint in a byte slice is incomplete (IE has a length of 3 but there are only 2 bytes remaining), the iterator will panic. Instead of panicking, prefer to return a replacement character. This strategy is similar to that in the block just above which returns a replacement character if the first byte is not valid. In this latter block, we also consume only one byte and allow the iterator to continue. This allows for sections of text which may have a single byte incorrect near the end of the slice.
*	Merge pull request 'DisplayWidth: explicitly set width to 2 when VS16 is ↵	Jose Colon	2024-04-11	1	-0/+4
\|\ \| \| \| \| \| \| \| \| \| \|	found' (#3) from rockorager/zg:vs-16 into master Reviewed-on: https://codeberg.org/dude_the_builder/zg/pulls/3
\| *	DisplayWidth: explicitly set width to 2 when VS16 is found	Tim Culverhouse	2024-04-11	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Explicitly set the width of an emoji to two when the next codepoint is a VS16 selector. Add unit test for this case. This is essentially the same PR as https://codeberg.org/dude_the_builder/ziglyph/pulls/11
* \|	grapheme: export grapheme.State struct	Tim Culverhouse	2024-04-11	1	-1/+1
\|/ \| \| \| \| \|	The public function `graphemeBreak` requires a reference to a State struct, however this type is not exported. Export the type to allow users of zg to use this type and call graphemeBreak.
*	NormData init now takes pointer to uninitialized Self to avoid stack copy ↵	Jose Colon Rodriguez	2024-04-02	3	-14/+20
\| \| \| \|	issues.
*	Updated README	Jose Colon Rodriguez	2024-03-31	14	-87/+36
\|
*	Split out Unicode tests to separate file	Jose Colon Rodriguez	2024-03-28	3	-185/+195
\|
*	Merged NumericData into PropsData	Jose Colon Rodriguez	2024-03-28	2	-69/+44
\|
*	PropsData and errdefers for init fns	Jose Colon Rodriguez	2024-03-28	13	-22/+179
\|
*	ScriptsData and made all Datas const	Jose Colon Rodriguez	2024-03-27	17	-57/+283
\|
*	Friendly general category methods	Jose Colon Rodriguez	2024-03-27	1	-30/+116
\|
*	Rename DisplayWidthData	Jose Colon Rodriguez	2024-03-27	1	-7/+7
\|
*	rm src/main.zig	Jose Colon Rodriguez	2024-03-26	1	-93/+0
\|
*	GraphemeData and Normalize non-pub fns	Jose Colon Rodriguez	2024-03-26	2	-13/+13
\|
*	Using diff for lowercase mapping	Jose Colon Rodriguez	2024-03-26	1	-2/+3
\|
*	Using diff for uppercase mapping	Jose Colon Rodriguez	2024-03-26	1	-2/+3
\|
*	Removed title case processing	Jose Colon Rodriguez	2024-03-26	1	-35/+15
\|
*	CaseData	Jose Colon Rodriguez	2024-03-25	1	-0/+223
\|
*	NumericData	Jose Colon Rodriguez	2024-03-24	2	-12/+95
\|
*	Rename CaseFold and Normalize	Jose Colon Rodriguez	2024-03-23	3	-15/+15
\|