[parrot/parrot] 0ebc3b: RT #122341 treat all of 0x0a-0x0d,\x85,\x2028,\x20...

GitHub noreply at github.com
Sat Oct 4 14:40:48 UTC 2014


  Branch: refs/heads/smoke-me/newlines2-gh1087
  Home:   https://github.com/parrot/parrot
  Commit: 0ebc3b8280e8849bb87db6ee52fd814a791b0329
      https://github.com/parrot/parrot/commit/0ebc3b8280e8849bb87db6ee52fd814a791b0329
  Author: Tobias Leich <email at froggs.de>
  Date:   2014-10-04 (Sat, 04 Oct 2014)

  Changed paths:
    M src/string/encoding/shared.c
    M src/string/encoding/tables.c

  Log Message:
  -----------
  RT #122341 treat all of 0x0a-0x0d,\x85,\x2028,\x2029 as newlines

The script tools/dev/gen_charset_tables.pl was not used because it removes
character properties of chars in the range of 0x81 to 0xFF. Additionally,
u_iscclass now checks for characters with enum_cclass_newline property,
which it did not do at all before.


  Commit: 77808481b2717e149015a55a1493217cf542d8ed
      https://github.com/parrot/parrot/commit/77808481b2717e149015a55a1493217cf542d8ed
  Author: Reini Urban <rurban at cpanel.net>
  Date:   2014-10-04 (Sat, 04 Oct 2014)

  Changed paths:
    M ChangeLog
    M config/gen/makefiles/root.in
    M src/string/encoding/tables.c
    M src/string/encoding/tables.h
    M tools/dev/gen_charset_tables.pl

  Log Message:
  -----------
  [tools] typetables: fix gen_charset_tables.pl and regenerate

Removes defunct and since 2010 unused Parrot_ascii_typetable.
Adds \v to CCLASS_NEWLINE manually (confirmed),
\x85\xa0 confirmed to be now in the whitespace cclass, but
several old systems fail with the \xa0 (non-breaking whitespace)
test for whitespace.
Several chars 160..191 are not in the [[:punct:]] class anymore.

Added bootstrap-tables make target, update the tables automatically.
Improved src/string/encoding/tables.c pod.
Closes PR #1087


Compare: https://github.com/parrot/parrot/compare/0ebc3b8280e8^...77808481b271


More information about the parrot-commits mailing list