Message5594

Author morganwahl
Recipients morganwahl
Date 2010-03-26.16:10:02
SpamBayes Score 5.1417896e-06
Marked as misclassified No
Message-id <1269619803.53.0.891424394838.issue1583@psf.upfronthosting.co.za>
In-reply-to
Content
I'm not sure where to write the bug description, but here goes:

I'm parsing an xml file in utf-8 (declared). When i call Node.data on a text node that contains the character U+00C5 (capital A with ring above) it's returning a byte-string whose repr is: '\xc5' . 0xc5 is the ISO-8859-15 (and cp1252) encoding of U+00C5, but of course U+00C5 is undef in ASCII. Thus, I get an error when joining the byte-string returned by Node.data with a unicode string, since my default encoding is ascii.

I'm using jython 2.5.1
History
Date User Action Args
2010-03-26 16:10:03morganwahlsetmessageid: <1269619803.53.0.891424394838.issue1583@psf.upfronthosting.co.za>
2010-03-26 16:10:03morganwahlsetrecipients: + morganwahl
2010-03-26 16:10:03morganwahllinkissue1583 messages
2010-03-26 16:10:02morganwahlcreate