[RFC] Merge charsets and encodings

NotFound julian.notfound at gmail.com
Wed Aug 25 13:05:58 UTC 2010


> My proposal is to merge all the charset and encoding functions into a single
> kind of string vtable eliminating duplicates like hash and find_cclass. I
> would keep the name "encoding", so there would be seven encodings:
>
> - ascii
> - iso-8859-1
> - binary
> - utf8
> - utf16
> - ucs2
> - ucs4

I fully agree with that proposal.

Don't forget the 'binary' encoding. It does nothing, but is a
convenient way to distinguish text strings from binary streams of
chars

We should also consider a cleanup of the encoding names, preferably
using the forms standarized by Unicode.

-- 
Salu2


More information about the parrot-dev mailing list