简而言之,如果您的应用程序是多语言的并且在同一个表中存储多种语言,那么您大部分都是搞砸了,并且应该担心在数据库之外进行排序/整理 – utf8_general_ci就像其他任何一样好.
如果它只支持单一语言,你可以通过在数据库级别设置正确的排序规则来做得很好 – 在你的情况下,确实是utf8_danish_ci,因为如果维基百科是任何东西,那么它与挪威语相同.
[H]ere are some of the ways languages vary in ordering strings:
The letters A-Z can be sorted in a different order than in English.
For example,in Lithuanian,“y” is sorted between “i” and “k”.
Combinations of letters can be treated as if they were one letter. For
example,in traditional Spanish “ch” is treated as a single letter,
and sorted between “c” and “d”.
Accented letters can be treated as minor variants of the unaccented
letter. For example,“é” can be treated equivalent to “e”.
Accented letters can be treated as distinct letters. For example,“?”
in Danish is treated as a separate letter that sorts just after “Z”.
Unaccented letters that are considered distinct in one language can be
indistinct in another. For example,the letters “v” and “w” are two
different letters according to English. However,“v” and “w” are
considered variant forms of the same letter in Swedish.
A letter can be treated as if it were two letters. For example,in
traditional German “?” is compared as if it were “ae”.
Thai requires that the order of certain letters be reversed.
French requires that letters sorted with accents at the end of the
string be sorted ahead of accents in the beginning of the string. For
example,the word “c?te” sorts before “coté” because the acute accent
on the final “e” is more significant than the circumflex on the “o”.
Sometimes lowercase letters sort before uppercase letters. The reverse
is required in other situations. For example,lowercase letters are
usually sorted before uppercase letters in English. Latvian letters
are the exact opposite.
Even in the same language,different applications might require
different sorting orders. For example,in German dictionaries,“?f”
would come before “of”. In phone books the situation is the exact
opposite.
Sorting orders can change over time due to government regulations or new characters/scripts in Unicode.