Python 3.4 decode bytes

Question

I am trying to write a file in python, and I can't find a way to decode a byte object before writing the file, basically, I am trying to decode this bytes string:

Les \xc3\x83\xc2\xa9vad\xc3\x83\xc2\xa9s

into this, which is the original text I'm trying to recover:

Les évadés

I tried using the .decode('utf-8') and encode('utf-8') but nothing seems to work...

I always get Les Ã©vadÃ©s as a result... I am using python 3.4.3

Anyone can help?

Python3 uses utf8 as default encoding. From where are you getting that string? — Bhargav Rao
– Bhargav Rao, Commented Jun 9, 2015 at 18:25
What you're showing is utf-8 being interpreted as if it were latin-1. My guess is that Python is producing the correct output, but whatever you're printing it with is set to expect latin-1 rather than utf-8. — Peter DeGlopper
– Peter DeGlopper, Commented Jun 9, 2015 at 18:33
I'm guessing you're on Windows? Then don't use UTF-8, Windows uses different encodings by default. — Mark Ransom
– Mark Ransom, Commented Jun 9, 2015 at 21:08

James Pringle · Accepted Answer · 2015-06-09 18:52:59Z

And if you want a Python 3 solution:

b = b'Les \xc3\x83\xc2\xa9vad\xc3\x83\xc2\xa9s' u = b.decode('utf-8').encode('latin-1').decode('utf-8') print(u) # Les évadés

holdenweb · Accepted Answer · 2015-06-09 19:08:15Z

-1

What you need to do is to decode and then encode:

s = "Les \xc3\x83\xc2\xa9vad\xc3\x83\xc2\xa9s" utf = s.decode('utf-8') latin = utf.encode("latin-1","ignore") print latin

--> Les évadés

edited Jun 9, 2015 at 19:08

holdenweb

37.8k7 gold badges62 silver badges80 bronze badges

answered Jun 9, 2015 at 18:48

E.T

1,1452 gold badges10 silver badges19 bronze badges

1 Comment

ShadowRanger Over a year ago

s is a str, so you can't decode it on Python 3, nor is encodeing the proper final step to convert to str. And error handler of "ignore" is basically saying "throw my data on the floor, I don't care". I can see why someone down-voted. You wrote incorrect Python 2 code, that can't even run on Python 3.

Collectives™ on Stack Overflow

Python 3.4 decode bytes

2 Answers 2

Comments

1 Comment

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

1 Comment

Linked

Related