[parrot/parrot] 9a105b: RT #122341 treat all of 0x0a-0x0d,\x85,\x2028,\x20...

GitHub noreply at github.com
Sat Oct 4 16:16:07 UTC 2014

  Branch: refs/heads/smoke-me/newlines2-gh1087
  Home:   https://github.com/parrot/parrot
  Commit: 9a105b619c150eaf289fdf1ea124002ef63875be
  Author: Tobias Leich <email at froggs.de>
  Date:   2014-10-04 (Sat, 04 Oct 2014)

  Changed paths:
    M src/string/encoding/shared.c
    M src/string/encoding/tables.c

  Log Message:
  RT #122341 treat all of 0x0a-0x0d,\x85,\x2028,\x2029 as newlines

The script tools/dev/gen_charset_tables.pl was not used because it removes
character properties of chars in the range of 0x81 to 0xFF. Additionally,
u_iscclass now checks for characters with enum_cclass_newline property,
which it did not do at all before.

  Commit: d815b6102dfc8b2e9170e8ebcb91ec0229737b7e
  Author: Reini Urban <rurban at cpanel.net>
  Date:   2014-10-04 (Sat, 04 Oct 2014)

  Changed paths:
    M ChangeLog
    M config/gen/makefiles/root.in
    M src/string/encoding/tables.c
    M src/string/encoding/tables.h
    M tools/dev/gen_charset_tables.pl

  Log Message:
  [tools] typetables: fix gen_charset_tables.pl and regenerate

Removes defunct and since 2010 unused Parrot_ascii_typetable.
Adds \v to CCLASS_NEWLINE manually (confirmed),
\x85\xa0 confirmed to be now in the whitespace cclass, but
several old systems fail with the \xa0 (non-breaking whitespace)
test for whitespace.
Several chars 160..191 are not in the [[:punct:]] class anymore.

Added bootstrap-tables make target, update the tables automatically.
Improved src/string/encoding/tables.c pod.
Closes PR #1087

Compare: https://github.com/parrot/parrot/compare/9a105b619c15^...d815b6102dfc

More information about the parrot-commits mailing list