[parrot/parrot] b30b45: RT #122341 treat all of 0x0a-0x0d,\x85,\x2028,\x20...

GitHub noreply at github.com
Tue Oct 14 15:24:36 UTC 2014

  Branch: refs/heads/master
  Home:   https://github.com/parrot/parrot
  Commit: b30b4585f8d984691f5ff78d12a056a3796a23c6
  Author: Tobias Leich <email at froggs.de>
  Date:   2014-10-14 (Tue, 14 Oct 2014)

  Changed paths:
    M src/string/encoding/shared.c
    M src/string/encoding/tables.c

  Log Message:
  RT #122341 treat all of 0x0a-0x0d,\x85,\x2028,\x2029 as newlines

The script tools/dev/gen_charset_tables.pl was not used because it removes
character properties of chars in the range of 0x81 to 0xFF. Additionally,
u_iscclass now checks for characters with enum_cclass_newline property,
which it did not do at all before.

  Commit: f62bc76ab20304eca76be5bc3bcb4f5bcb0ff444
  Author: Reini Urban <rurban at cpanel.net>
  Date:   2014-10-14 (Tue, 14 Oct 2014)

  Changed paths:
    M ChangeLog
    M config/gen/makefiles/root.in
    M src/string/encoding/tables.c
    M src/string/encoding/tables.h
    M tools/dev/gen_charset_tables.pl

  Log Message:
  [tools] typetables: fix gen_charset_tables.pl and regenerate

Removes defunct and since 2010 unused Parrot_ascii_typetable.
Adds \v to CCLASS_NEWLINE manually (confirmed),
\x85\xa0 confirmed to be now in the whitespace cclass, but
several old systems fail with the \xa0 (non-breaking whitespace)
test for whitespace.
Several chars 160..191 are not in the [[:punct:]] class anymore.

Added bootstrap-tables make target, update the tables automatically.
Improved src/string/encoding/tables.c pod.
Closes PR #1087

Compare: https://github.com/parrot/parrot/compare/bbc34189ec5f...f62bc76ab203

More information about the parrot-commits mailing list