Issue2342

classification
Title: Raw strings get different exception for lone surrogates
Type: behaviour Severity: minor
Components: Core Versions: Jython 2.7
Milestone:
process
Status: open Resolution:
Dependencies: Superseder:
Assigned To: Nosy List: gsnedders
Priority: Keywords:

Created on 2015-04-26.01:00:49 by gsnedders, last changed 2015-04-26.01:01:26 by gsnedders.

Messages
msg9967 (view) Author: Geoffrey Sneddon (gsnedders) Date: 2015-04-26.01:00:48
gsnedders@vanveen:~$ cat foo.py 
u"\ud800
gsnedders@vanveen:~$ jython foo.py
UnicodeDecodeError: 'unicodeescape' codec can't decode bytes in position 2-8: illegal Unicode character
gsnedders@vanveen:~$ cat bar.py
ur"\ud800"
gsnedders@vanveen:~$ jython bar.py 
ValueError: unpaired surrogate 0xd800 at code unit 0

This… is not a behaviour I'd expect. Obviously low priority, and possibly soon to be obsolete from #2340, but would be nice to make them consistent!
msg9968 (view) Author: Geoffrey Sneddon (gsnedders) Date: 2015-04-26.01:01:26
(Somehow I lost a " from the end of foo.py when copying that. The obvious syntax error shouldn't be there.)
History
Date User Action Args
2015-04-26 01:01:26gsnedderssetmessages: + msg9968
2015-04-26 01:00:49gsnedderscreate