658 questions
2 votes
1 answer
92 views
vb6 : split unicode text file to smaller files of maximum specified size
I have a big file of text, lines by lines ended with a chr(13) in Windows. I want to store the text in the access mdb file memo field which has a max size limit of 64K only. The text file contain ...
0 votes
1 answer
59 views
What makes FileRead wrongly decode Unicode characters?
In file.txt I have a content with Unicode characters encoding in UTF-8. In script.ahk I FileRead it: #Requires AutoHotkey v2.0+ content := FileRead("file.txt"), "`n UTF-8" :*:a:: {...
0 votes
1 answer
76 views
Handling strings with high Unicode codepoints (above U+FFFF)
In Kotlin, how can I iterate over a string that contains Unicode characters above U+FFFF? Example code: val s = "Hëllø! € 😀" for (c in s) { println("$c ${c.code}") } Actual ...
0 votes
1 answer
120 views
Will php mb_strlen($str,‘utf8’) ever return a greater result than JavaScript .length?
I'm working on an Angular 17 reactive form where I send the form data to a PHP API on the server and store it in a database. I would like the user to be able to input emojis to the form so I have set ...
0 votes
1 answer
339 views
Is it safe to convert emails with other characters than a-Z to lower case?
Any modern email service provider treats emails as case insensitive meaning that in my application I should allows users to log in both using [email protected] and [email protected]. In terms of the ...
1 vote
1 answer
213 views
Convert raw string (having escape characters) to unicode/utf8 string [duplicate]
In Python 3, how to convert an ASCII raw-string (that includes escape characters) into a proper unicode string? As an example: a = "ä" # note the umlaut b = bytearray(...
1 vote
1 answer
505 views
How to add accents to letters in java
I am trying to combine alphabetical characters with accents in java. For example: Combining the letter "e" (\u0065) with a combing grave accent (\u0300). I have attempted numerous ways in ...
0 votes
1 answer
168 views
convert byte array to strings split by NUL character
I am sorry, if this is much of a dumb question. But I can't really figure this out, and I bet it has to be much simpler than I think. I have a byte[] array which contains several Unicode Strings, each ...