[RFC] Merge charsets and encodings
NotFound
julian.notfound at gmail.com
Wed Aug 25 13:05:58 UTC 2010
> My proposal is to merge all the charset and encoding functions into a single
> kind of string vtable eliminating duplicates like hash and find_cclass. I
> would keep the name "encoding", so there would be seven encodings:
>
> - ascii
> - iso-8859-1
> - binary
> - utf8
> - utf16
> - ucs2
> - ucs4
I fully agree with that proposal.
Don't forget the 'binary' encoding. It does nothing, but is a
convenient way to distinguish text strings from binary streams of
chars
We should also consider a cleanup of the encoding names, preferably
using the forms standarized by Unicode.
--
Salu2
More information about the parrot-dev
mailing list