[parrot/parrot] 7789e9: RT #122341 treat all of 0x0a-0x0d,\x85,\x2028,\x20...

GitHub noreply at github.com
Tue Jul 22 22:50:36 UTC 2014

  Branch: refs/heads/FROGGS/newlines2-gh1087
  Home:   https://github.com/parrot/parrot
  Commit: 7789e9f9cac33b2ec0bd128169a3d366651d46fe
  Author: Tobias Leich <email at froggs.de>
  Date:   2014-07-22 (Tue, 22 Jul 2014)

  Changed paths:
    M src/string/encoding/shared.c
    M src/string/encoding/tables.c

  Log Message:
  RT #122341 treat all of 0x0a-0x0d,\x85,\x2028,\x2029 as newlines

The script tools/dev/gen_charset_tables.pl was not used because it removes
character properties of chars in the range of 0x81 to 0xFF. Additionally,
u_iscclass now checks for characters with enum_cclass_newline property,
which it did not do at all before.

Signed-off-by: Reini Urban <rurban at cpanel.net>

  Commit: c1d57d25fd0f33868c8e90c4a613a41aa647d2e6
  Author: Reini Urban <rurban at cpanel.net>
  Date:   2014-07-22 (Tue, 22 Jul 2014)

  Changed paths:
    M ChangeLog
    M src/string/encoding/tables.c
    M src/string/encoding/tables.h
    M tools/dev/gen_charset_tables.pl

  Log Message:
  regenerate src/string/encoding/tables.c

so that tools/dev/gen_charset_tables.pl is kept up-to-date.
(it was not)
Added \v to CCLASS_NEWLINE manually (confirmed).
\x85\xa0 already is in the whitespace cclass.
Removed the creation of the now defunct ascii table.

Beware: My new up-to-date libc removed the [[:punct::]] class
of several chars > 160.

Compare: https://github.com/parrot/parrot/compare/7789e9f9cac3^...c1d57d25fd0f

More information about the parrot-commits mailing list