UTF8 performance

Vasily Chekalkin bacek at bacek.com
Sat Jan 9 23:45:48 UTC 2010


Nick Wellnhofer wrote:
> On 06/01/10 23:12, Nick Wellnhofer wrote:
>> Here is a preliminary patch.
> 
> Here is bigger patch that makes the following changes:
> 
> - Move the function pointers from string_iterator_t to encoding_t
> - Remove now unneeded iter_init from encoding_t
> - Introduce new STRING_ITER_ macros
> - Add iter_regress_and_decode function to encoding_t
> - Change the string iterator PMC to actually use the string iterator API
> - Change Parrot_str_split to use iterators
> - Optimize utf8_set_position to also search backward

Looks pretty good. Unfortunately, afaiu, this version requires 
deprecation notice. I'll apply first version and put deprecation notice 
for ENCODING_ITER_INIT and iter_init so we can apply full version in 
about 2 weeks time.

-- 
Bacek


More information about the parrot-dev mailing list