Issue1746957
Created on 2007-07-03.08:01:13 by pekka.klarck, last changed 2007-07-08.00:09:17 by cgroves.
| History | |||
|---|---|---|---|
| Date | User | Action | Args |
| 2007-07-03 08:01:13 | pekka.klarck | create | |
Created on 2007-07-03.08:01:13 by pekka.klarck, last changed 2007-07-08.00:09:17 by cgroves.
| Messages | |||
|---|---|---|---|
| msg1697 (view) | Author: Pekka Klärck (pekka.klarck) | Date: 2007-07-03.08:01:13 | |
At least when decoding invalid utf-8 byte to unicode you get out an unicode object which has a weird 'uu' prefix. This is illustrated by the example below.
Jython 2.2rc1 on java1.5.0_11
Type "copyright", "credits" or "license" for more information.
>>> u = '\xFF'.decode('utf-8', 'replace')
>>> u
uu'\uFFFD'
>>> type(u)
<type 'unicode'>
>>> print u
?
There's also some discussion about this at Jython users mailing list in the beginning of July 2007 as a sub thread of "character encoding issues" thread. Following link ought to point to my mail about this.
http://sourceforge.net/mailarchive/message.php?msg_name=f5f747f10707020428t479239cdsa139465fffdfc87%40mail.gmail.com
|
|||
| msg1698 (view) | Author: Charlie Groves (cgroves) | Date: 2007-07-08.00:09:17 | |
Fixed in r3285. |
|||
| History | |||
|---|---|---|---|
| Date | User | Action | Args |
| 2007-07-03 08:01:13 | pekka.klarck | create | |
Supported by Python Software Foundation,
Powered by Roundup