In order to clean some string, I have to remove some substring that contains some special UTF-8 characters.
example:
source = "Skoda" to_be_clean = "Škoda Rapid" I need to replace from to_be_clean the string source by nothing. Obviously, the to_be_clean string contains some special character. Is there a way to do this task simply. Here is how I am doing it today.
output = to_be_clean.replace(source + ' ', '') I was thinking about a regular expression but I need to list all the possible characters.
"Škoda"equal to"Skoda"so that you can then remove it? There are many questions about removing accents from Unicode; have you googled those?