Add string length tests for supplementary Unicode code points #52

fge · 2014-04-08T00:06:23Z

That is, code points outside the Basic Multilingual Plane (U+0000 to U+FFFF).

This will test that validators correctly account for the number of code points
inside strings, as required by RFC 7159, section 1:

A string is a sequence of zero or more Unicode characters [UNICODE].
Note that this citation references the latest version of Unicode
rather than a specific release. It is not expected that future
changes in the UNICODE specification will impact the syntax of JSON.

In order to keep it "easy" for storage, the UTF-16 escape sequence (as defined
by RFC 7159, section 7) is used to represent these characters. The chosen code
point is U+1F4A9 (yes, Unicode defines that):

http://www.fileformat.info/info/unicode/char/1F4A9/index.htm

geraintluff · 2014-04-08T12:42:50Z

Looks great. :)

I'm seeing a Travis error saying the test descriptions are too long (should be < 60 chars), though. Would you be able to shorten them?

fge · 2014-04-08T13:10:09Z

Uuuhuh.

Should mention that in the README before contributing... OK, at least the Travis report tells me how to run sanity checks.

But 60 chars? That's pretty short!

That is, code points outside the Basic Multilingual Plane (U+0000 to U+FFFF). This will test that validators correctly account for the number of _code points_ inside strings, as required by RFC 7159, section 1: A string is a sequence of zero or more Unicode characters [UNICODE]. Note that this citation references the latest version of Unicode rather than a specific release. It is not expected that future changes in the UNICODE specification will impact the syntax of JSON. In order to keep it "easy" for storage, the UTF-16 escape sequence (as defined by RFC 7159, section 7) is used to represent these characters. The chosen code point is U+1F4A9 (yes, Unicode defines that): http://www.fileformat.info/info/unicode/char/1F4A9/index.htm

fge · 2014-04-08T13:19:12Z

OK, fixed!

Julian · 2014-04-13T23:04:12Z

Yeah, the 60 char thing is mostly since some people (well, me, and I'm sure other people :D) use the descriptions to construct names of the test methods on xUnit test cases, so keeping those <80 means keeping the descriptions short.

Although I've wanted to add a full description field once in awhile for longer descriptions. Anyways, I'll fix and merge, thanks!

fge · 2014-04-13T23:08:11Z

Is it normal that travis didn't pick up the latest branch? Or it doesn't like push -f?

Julian · 2014-04-13T23:11:33Z

Hm. Did the force push have changes in it? The commit I see is bc84af3 which still looks like the first one. Sorry about that either way though.

Julian merged commit bc84af3 into json-schema-org:develop Apr 13, 2014

karenetheridge mentioned this pull request Dec 1, 2023

Updates min/maxLenth tests to mention graphemes rather than unicode code points #710

Merged

micolous mentioned this pull request Mar 28, 2025

Extend treatment on Unicode and/or its security considerations json-schema-org/json-schema-spec#215

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add string length tests for supplementary Unicode code points #52

Add string length tests for supplementary Unicode code points #52

fge commented Apr 8, 2014

geraintluff commented Apr 8, 2014

fge commented Apr 8, 2014

fge commented Apr 8, 2014

Julian commented Apr 13, 2014

fge commented Apr 13, 2014

Julian commented Apr 13, 2014

Add string length tests for supplementary Unicode code points #52

Add string length tests for supplementary Unicode code points #52

Conversation

fge commented Apr 8, 2014

geraintluff commented Apr 8, 2014

fge commented Apr 8, 2014

fge commented Apr 8, 2014

Julian commented Apr 13, 2014

fge commented Apr 13, 2014

Julian commented Apr 13, 2014