zg - Mirror of https://codeberg.org/atman/zg/

	Commit message (Collapse)	Author	Files	Lines
2025-07-08	Add Words.zig example to README	Sam Atman	1	-0/+3

2025-06-01	Add graphemeAtIndex + iterate before and after	Sam Atman	1	-2/+58
	That completes the set. I do think it's possible to bum a few more cycles from the implementation, but, I'm not going to. It passes the acceptance suite and that's what it needs to do.
2025-05-23	Make offset size configurable	Sam Atman	1	-7/+9
	Hopefully I can talk users out of taking advantage of this configuration but I'll have better luck with that if it's available.
2025-05-15	wordAtIndex passes conformance	Sam Atman	1	-1/+0
	I removed the initAtIndex functions from the public vocabulary, because the last couple of days of sweat and blood prove that it's hard to use correctly. That's probably it for WordBreak, now to fix the overlong bug on v0.14 and get this integrated with the new reverse grapheme iterator.
2025-05-15	Add format for CodePoint	Sam Atman	1	-2/+10

2025-05-15	Hooked up break test, some bugs squashed	Sam Atman	1	-10/+0
	The handling of ignorables is really different, because they 'adhere' to the future of the iteration, not the past.
2025-05-15	Reverse Word Iterator	Sam Atman	1	-1/+1
	Next up I hook it to the tests.
2025-05-15	Begin conformance test	Sam Atman	1	-0/+5
	I'm not sure the details of this strategy can actually be made to work. But, something can.
2025-05-15	Various small iterator improvements	Sam Atman	1	-4/+51

2025-05-15	Add reverse CodePoint iterator	Sam Atman	1	-1/+67

2025-05-15	Maximal Subparts tests	Sam Atman	1	-37/+114
	The decoder now properly returns substitution bytes according to Substitution of Maximal Subparts, with tests to prove it.
2025-05-15	Replace CodePoint Decoding with Hörhmann Method	Sam Atman	1	-59/+204
	This still needs a small barrage of tests to confirm that it correctly performs substitution of maximal subparts (Unicode 16.0.0 §3.9.6). I'm pretty sure this edition is 'overly maximal' actually, the name of the algorithm is somewhat misleading as to what it actually does.
2025-05-14	Add overlong test, which should fail	Sam Atman	1	-2/+15
	But does not.
2025-05-13	Various small iterator improvementswork-branch	Sam Atman	1	-9/+46

2025-05-09	Add reverse CodePoint iterator	Sam Atman	1	-6/+75

2024-07-05	refactor CodePoint.Iterator into a reusable fn	Jonathan Raphaelson	1	-57/+79
	without changing the algorithm at all, move the responsibility of decoding a u8 slice out of the iterator, and into a reusable function so that it can be used by consumers of the library
2024-06-10	codepoint: prevent panic when last cp too short	Tim Culverhouse	1	-0/+11
	If the last codepoint in a byte slice is incomplete (IE has a length of 3 but there are only 2 bytes remaining), the iterator will panic. Instead of panicking, prefer to return a replacement character. This strategy is similar to that in the block just above which returns a replacement character if the first byte is not valid. In this latter block, we also consume only one byte and allow the iterator to continue. This allows for sections of text which may have a single byte incorrect near the end of the slice.
2024-02-18	Back to zg code_point. 4ms faster than Ghostty's Utf8Decoder	Jose Colon Rodriguez	1	-29/+39

2024-02-18	Code point code is now a method not a field.	Jose Colon Rodriguez	1	-39/+29

2024-02-18	Code point and grapheme are now namespaces.	Jose Colon Rodriguez	1	-19/+20

2024-02-17	Fixed isAsciiOnly and CodePointIterator ASCII bugs	Jose Colon Rodriguez	1	-3/+3

2024-02-17	GraphemeIterator ASCII optimization 3x faster	Jose Colon Rodriguez	1	-12/+15

2024-02-14	Removed readCodePoint and StreamingGraphemeIterator	Jose Colon Rodriguez	1	-50/+0

2024-02-13	Removed unreachables from CodePointIterator	Jose Colon Rodriguez	1	-0/+131