Linked Questions
84 questions linked to/from Why does modern Perl avoid UTF-8 by default?
4 votes
0 answers
232 views
Enabling non-ASCII characters in Perl [duplicate]
Possible Duplicate: Why does modern Perl avoid UTF-8 by default? I have one problem with order script written in Perl which is used to generate order/email based on user input. The problem is ...
136 votes
6 answers
145k views
How can I output UTF-8 from Perl?
I am trying to write a Perl script using the utf8 pragma, and I'm getting unexpected results. I'm using Mac OS X 10.5 (Leopard), and I'm editing with TextMate. All of my settings for both my editor ...
15 votes
8 answers
2k views
Possible problems with String reversing using charAt method
I saw a comment here that all solutions with charAt are wrong. I could not exactly understand and find something about charAt on internet. As I look the source code it just returns an element from the ...
22 votes
8 answers
13k views
Match letter in any language
How can I match a letter from any language using a regex in python 3? re.match([a-zA-Z]) will match the english language characters but I want all languages to be supported simultaneously. I don't ...
10 votes
6 answers
60k views
Java - Regex for Full Name
How can I validate regex for full name? I only want alphabets (no numericals) and only spaces for the regex. This is what I have done so far. Would you please help me fix the regex? Thank you very ...
16 votes
4 answers
21k views
perl: Uncaught exception: malformed UTF-8 character in JSON string
Related to this question and this answer (to another question) I am still unable to process UTF-8 with JSON. I have tried to make sure all the required voodoo is invoked based on recommendations from ...
10 votes
2 answers
11k views
Perl regular expression matching on large Unicode code points
I am trying to replace various characters with either a single quote or double quote. Here is my test file: # Replace all with double quotes " fullwidth “ left ” right „ low " normal # Replace all ...
6 votes
6 answers
5k views
Can I depend on the behavior of charCodeAt() and fromCharCode() to remain the same?
I have written a personal web app that uses charCodeAt() to convert text that is input by the user into the relevant character codes (for example ⊇ is converted to 8839 for storage), which is then ...
11 votes
3 answers
12k views
Regex for matching accent characters
Aim: I want to separate words to count their frequency in a document and then do some calculations on those frequencies. The words can begin/contain/end with any of the following: numbers alphabets (...
7 votes
6 answers
2k views
How well does your language support unicode in practice?
I'm looking into new languages, kind of craving for one where I no longer need to worry about charset problems amongst inordinate amounts of other niggles I have with PHP for a new project. I tend to ...
24 votes
2 answers
4k views
How to match string with diacritic in perl?
For example, match "Nation" in ""Îñţérñåţîöñåļîžåţîöñ" without extra modules. Is it possible in new Perl versions (5.14, 5.15 etc)? I found an answer! Thanks to tchrist Rigth solution with UCA ...
17 votes
2 answers
1k views
When do you need to `END { close STDOUT}` in Perl?
In the tchrists broilerplate i found this explicit closing of STDOUT in the END block. END { close STDOUT } I know END and close, but i'm missing why it is needed. When start searching about it, ...
23 votes
3 answers
2k views
How to make Mason2 UTF-8 clean?
Reformulating the question, because @optional asked me it wasn't clear and linked one HTML::Mason based solution Four easy steps to make Mason UTF-8 Unicode clean with Apache, mod_perl, and DBI , what ...
10 votes
2 answers
3k views
Perl Unicode internals - mess with utf8
Before anyone will tells me to RTFM, I must say - I have digged through: Why does modern Perl avoid UTF-8 by default? Checklist for going the Unicode way with Perl How to match string with diacritic ...
9 votes
3 answers
714 views
Perl: what's included in use 5.##.#?
I've seen several posts that state use 5.12.0; in Perl enables certain features/pragmas by default (e.g., use strict;). Another example is in UTF-8 and perl where it is stated that use 5.14.0; is ...