Message8046

Author amak
Recipients amak, oberstet, santa4nt
Date 2013-06-15.11:37:11
SpamBayes Score -1.0
Marked as misclassified Yes
Message-id <1371296232.66.0.431845954888.issue2061@psf.upfronthosting.co.za>
In-reply-to
Content
\ud800 is an unpaired surrogate which is illegal in the UTF-16 representation used by jython.

It is legal in cpythons UCS-2, but does not actually represent any real world characters.

It never appears in the real world, only in cpython-specific tests which are broken outside of the cpython world. Expecting these tests to pass on any platform that does not use UCS-2 is a broken expectation.

Resolving as a duplicate of #2048

http://bugs.jython.org/issue2048

Which itself is a duplicate of these bug reports

Jython doesn't allow to use unmapped unicode codepoint
http://bugs.jython.org/issue1707

Invalid Unicode characters cause compile-time error (CPython divergence)
http://bugs.jython.org/issue1836
History
Date User Action Args
2013-06-15 11:37:12amaksetmessageid: <1371296232.66.0.431845954888.issue2061@psf.upfronthosting.co.za>
2013-06-15 11:37:12amaksetrecipients: + amak, oberstet, santa4nt
2013-06-15 11:37:12amaklinkissue2061 messages
2013-06-15 11:37:11amakcreate