Re: [Python-Dev] surrogatepass - she's a witch, burn 'er! [was: Cleaning up ...]

From: Isaac Morland <ijmo...@uwaterloo.ca>

Fri, 29 Aug 2014 07:22:10 -0400 (EDT)

On Fri, 29 Aug 2014, M.-A. Lemburg wrote:

> On 29.08.2014 02:41, Stephen J. Turnbull wrote:> Since Python allows working with lone surrogates in Unicode (they> are valid code points) and we're using UTF-8 for marshal, we needed> a way to make sure that Python 3 also optionally supports working> with lone surrogates in such UTF-8 streams (nowadays called CESU-8:> http://en.wikipedia.org/wiki/CESU-8).

If I want that wouldn't I specify "cesu-8" as the encoding?

i.e., instead of .decode ('utf-8') I would use .decode ('cesu-8').  Right 
now, trying this I get that cesu-8 is an unknown encoding but that could 
be changed without affecting the behaviour of the utf-8 codec.

It seems to me that .decode ('utf-8') should decode exactly and only valid 
utf-8, including the non-use of surrogate pairs as an intermediate 
encoding step.

Isaac Morland			CSCF Web Guru
DC 2554C, x36650		WWW Software Specialist
_______________________________________________
Python-Dev mailing list
Pyth...@python.org
https://mail.python.org/mailman/listinfo/python-dev
Unsubscribe: https://mail.python.org/mailman/options/python-dev/python-dev-ml%40activestate.com

Recent Messages in this Thread
[Python-Dev] Cleaning up surrogate escaped strings (was Bytes path related qu...	Nick Coghlan	Aug 28, 2014 12:26 pm
[Python-Dev] Cleaning up surrogate escaped strings (was Bytes path related qu...	Stephen J. Turnbull	Aug 29, 2014 12:32 am
Re: [Python-Dev] Cleaning up surrogate escaped strings (was Bytes path relate...	Nick Coghlan	Aug 29, 2014 04:55 am
[Python-Dev] surrogatepass - she's a witch, burn 'er! [was: Cleaning up ...]	Stephen J. Turnbull	Aug 29, 2014 12:41 am
Re: [Python-Dev] surrogatepass - she's a witch, burn 'er! [was: Cleaning up ...	M.-A. Lemburg	Aug 29, 2014 07:48 am
Re: [Python-Dev] surrogatepass - she's a witch, burn 'er! [was: Cleaning up ...	Isaac Morland	Aug 29, 2014 11:22 am
Re: [Python-Dev] surrogatepass - she's a witch, burn 'er! [was: Cleaning up ...	M.-A. Lemburg	Aug 29, 2014 12:18 pm
Re: [Python-Dev] surrogatepass - she's a witch, burn 'er! [was: Cleaning up ...	Greg Ewing	Aug 29, 2014 11:37 pm
Re: [Python-Dev] surrogatepass - she's a witch, burn 'er!	Stephen J. Turnbull	Aug 30, 2014 04:21 am
Re: [Python-Dev] surrogatepass - she's a witch, burn 'er! [was: Cleaning up ...	M.-A. Lemburg	Aug 30, 2014 10:03 am

◄ Messages in this thread ►

Previous post: Re: [Python-Dev] Bytes path related questions for Guido

Next post: Re: [Python-Dev] surrogatepass - she's a witch, burn 'er! [was: Cleaning up ...]

Subscribe to the python-dev RSS feed

Accounts

List Archives

Feedback & Information

ActiveState

© 2019 ActiveState Software Inc. All rights reserved. ActiveState®, Komodo®, ActiveState Perl Dev Kit®, ActiveState Tcl Dev Kit®, ActivePerl®, ActivePython®, and ActiveTcl® are registered trademarks of ActiveState. All other marks are property of their respective owners.