I have a string contains Unicode characters and I want to convert it to UTF-8 in python.
s = '\u0628\u06cc\u0633\u06a9\u0648\u06cc\u062a' I want convert s to UTF format.
I have a string contains Unicode characters and I want to convert it to UTF-8 in python.
s = '\u0628\u06cc\u0633\u06a9\u0648\u06cc\u062a' I want convert s to UTF format.
Add u as prefix for the string s then encode it in utf-8.
Your code will look like this:
s = u'\u0628\u06cc\u0633\u06a9\u0648\u06cc\u062a' s_encoded = s.encode('utf-8') print(s_encoded) I hope this helps.
u prefix isn't necessary. But the .encode('utf8') is definitely right.Add the below line in the top of your .py file.
# -*- coding: utf-8 -*- It allows you to encode strings directly in your python script, like this:
# -*- coding: utf-8 -*- s = '\u0628\u06cc\u0633\u06a9\u0648\u06cc\u062a' print(s) Output :
بیسکویت