Welcome, guest | Sign In | My Account | Store | Cart

Recipe 466293 revision 2

>>> len('a\\nb')
4
>>> len('a\\nb'.decode('string_escape'))
3
>>>

Or for unicode strings

>>> len(u'\N{euro sign}\\nB')
4
>>> len(u'\N{euro sign}\\nB'.encode('utf-8').decode('string_escape').decode('utf-8'))
3


This compares to naive approach to decode character escape by writing 
your own scanner in pure Python. For example:

def decode(s):
    output = []
    iterator = iter(s)
    for c in iterator:
      if c == '\\':
        ...enter your state machine and decode...
      else:
        output.append(c)
    return ''.join(output)

or

def decode(s):
    return s\
        .replace('\\n','\n')\
        .replace('\\t','\t')\
        ...and so on for the few escapes supported...

The navie approaches are expected to be much slower.

« Back to Recipe 466293

History

revision 2 (18 years ago)
previous revisions are not available

Accounts

Code Recipes

Feedback & Information

ActiveState

© 2024 ActiveState Software Inc. All rights reserved. ActiveState®, Komodo®, ActiveState Perl Dev Kit®, ActiveState Tcl Dev Kit®, ActivePerl®, ActivePython®, and ActiveTcl® are registered trademarks of ActiveState. All other marks are property of their respective owners.