[parrot/parrot] 391b79: RT #122341 treat all of 0x0a-0x0d,\x85,\x2028,\x20...

GitHub noreply at github.com
Sun Oct 5 14:22:29 UTC 2014


  Branch: refs/heads/smoke-me/newlines2-gh1087
  Home:   https://github.com/parrot/parrot
  Commit: 391b79b63049edf4d1d6569221f08648613a39d3
      https://github.com/parrot/parrot/commit/391b79b63049edf4d1d6569221f08648613a39d3
  Author: Tobias Leich <email at froggs.de>
  Date:   2014-10-05 (Sun, 05 Oct 2014)

  Changed paths:
    M src/string/encoding/shared.c
    M src/string/encoding/tables.c

  Log Message:
  -----------
  RT #122341 treat all of 0x0a-0x0d,\x85,\x2028,\x2029 as newlines

The script tools/dev/gen_charset_tables.pl was not used because it removes
character properties of chars in the range of 0x81 to 0xFF. Additionally,
u_iscclass now checks for characters with enum_cclass_newline property,
which it did not do at all before.


  Commit: 146316774d57165f455b9eb13a40524a5dd1c108
      https://github.com/parrot/parrot/commit/146316774d57165f455b9eb13a40524a5dd1c108
  Author: Reini Urban <rurban at cpanel.net>
  Date:   2014-10-05 (Sun, 05 Oct 2014)

  Changed paths:
    M ChangeLog
    M config/gen/makefiles/root.in
    M src/string/encoding/tables.c
    M src/string/encoding/tables.h
    M tools/dev/gen_charset_tables.pl

  Log Message:
  -----------
  [tools] typetables: fix gen_charset_tables.pl and regenerate

Removes defunct and since 2010 unused Parrot_ascii_typetable.
Adds \v to CCLASS_NEWLINE manually (confirmed),
\x85\xa0 confirmed to be now in the whitespace cclass, but
several old systems fail with the \xa0 (non-breaking whitespace)
test for whitespace.
Several chars 160..191 are not in the [[:punct:]] class anymore.

Added bootstrap-tables make target, update the tables automatically.
Improved src/string/encoding/tables.c pod.
Closes PR #1087


Compare: https://github.com/parrot/parrot/compare/391b79b63049^...146316774d57


More information about the parrot-commits mailing list