I am having problem with decoding byte string that I have to send from one computer to another. File is format PDF. I get error that goes:
fileStrings[i] = fileStrings[i].decode() UnicodeDecodeError: 'utf-8' codec can't decode byte 0xda in position 648: invalid continuation byte Any ideas of how to remove b' ' marking? I need to compile file back up, but i also need to know its size in bytes before sending it and I figured I will know it by decoding each byte string (Works for txt files but not for pdf ones..)
Code is:
with open(inputne, "rb") as file: while 1: readBytes= file.read(dataMaxSize) fileStrings.append(readBytes) if not readBytes: break readBytes= '' filesize=0 for i in range(0, len(fileStrings)): fileStrings[i] = fileStrings[i].decode() filesize += len(fileStrings[i]) Edit: For anyone having same issue, parameter len() will give you size without b''.
∞is one symbol but 3 bytes:b'\xe2\x88\x9e', or 8 bytes in UTF32.