Message6859

Author amak
Recipients amak, fwierzbicki, jeff.allen
Date 2012-03-19.20:25:53
SpamBayes Score 2.5187648e-05
Marked as misclassified No
Message-id <1332188753.86.0.943342726031.issue1836@psf.upfronthosting.co.za>
In-reply-to
Content
The character "\udef0" is in the range 0xD800-0xDFFF, i.e. it is an "unpaired surrogate".

http://en.wikipedia.org/wiki/UTF-16

Cpython accepts it, because cpython uses UCS-2, for which "\def0" is a valid character.

Java, and thus jython, uses UTF-16, which supports surrogate pairs for encoding characters outside the Basic Multilingual Plane.

If you retry your code with values outside the range oxD800-0xDFFF, it will work.

Or provide a proper surrogate pair to decode.

This bug should be closed as "invalid" or "wont fix".
History
Date User Action Args
2012-03-19 20:25:53amaksetmessageid: <1332188753.86.0.943342726031.issue1836@psf.upfronthosting.co.za>
2012-03-19 20:25:53amaksetrecipients: + amak, fwierzbicki, jeff.allen
2012-03-19 20:25:53amaklinkissue1836 messages
2012-03-19 20:25:53amakcreate