10 questions
2 votes
1 answer
135 views
Sort strings, treating hyphen, slash, and space as equal, using UCA collation
Problem I'm using Saxon-EE 11 and my platform's language is en-us. I'm attempting to implement custom sorting behavior for an <xsl:sort> instruction by specifying a UCA collation. Ignoring the ...
0 votes
0 answers
135 views
Adding a UCA Collation to a Unicode Character Set, why it is doesn't work?
In Unicode Locale Data Markup Language(LDML), since version 24, the element and its sub-elements is deprecated. But the MySQL example still uses deprecated element. The collation defined when I ...
0 votes
1 answer
140 views
How to make some punctuation characters indexable in MySQL FULLTEXT indexed field
I have a fulltext indexed field with charset utf8mb4 on MySQL 8.0 I need to be able to search for queries like "km/h" or "A-B" but with the current charset definition, slash and ...
1 vote
1 answer
90 views
icu (uca) support for frisian collation
In frisian the y is and i and sorts just after it, see http://download.mimer.com/pub/developer/charts/frisian.htm. I try to sort data using xquery processor saxonica using frisian language code, or ...
1 vote
0 answers
122 views
Custom MySQL Collation Not Working
My goal is to sort a few numbers the same as a handful of characters. ie: 4 sorts the same as A or a 3 sorts the same as E or e Why isn't this working? I've added the following to /usr/share/...
0 votes
0 answers
511 views
Implementing sample code for unicode collation algorithm
I have the following requirement in my project. I need to sort strings based on order of the characters provided by the client. For example: Order provided by the user:d,a,A,D,z,p,P,Z So if we have ...
3 votes
3 answers
3k views
What is the theory behind unicode collation sorting
What is the theory behind unicode sorting? I understand how it works, but I don't understand why they decided on this standard for collation sorting. It seems that when you have two strings to ...
4 votes
1 answer
531 views
Invert Unicode String Collation Keys
I'm have an index which stores text strings for search, both in their original form and their collated form (Collated form is used for searching the index, Original is displayed in the results). The ...
6 votes
1 answer
245 views
How does the handling of combining characters in the Unicode Collation Algorithm work?
I maintain an open-source, pure-Python implementation of the Unicode Collation Algorithm called pyuca. While it meets my needs in sorting Ancient Greek text (and seems to meet the needs of many other ...
3 votes
1 answer
894 views
Is there Unicode Collation Algorithm (UCA) code for Delphi?
Collation under the Unicode Technical Standard #10 (UCA), which is a separate thing from being Unicode Compliant, in case you were wondering about that, implies not only ordering/sorting but also ...