Magnus Hagander wrote:
>>>The following bug has been logged online:
>>>
>>>Bug reference: 2317
>>>Logged by: TomasKlockar
>>>Email address: tomask@omicron.se
>>>PostgreSQL version: 7.3.2/7.4.7
>>>Operating system: linux(fedora)
>>>Description: Wrong sorting order for (VW)
>>>Details:
>>>
>>>SELECT cname FROM clients ORDER BY cname;
>>>
>>>sorts W before V and that is a mistake.
>>>
>>>the result is
>>>
>>>V
>>>W
>>>WHI
>>>Vi
>>>Wi
>>>Volvo
>>>
>>>Correct order would be to place all vV before all wW.
>>>
>>>I think the database was initialized with UTF8 but it might
>>>
>>>
>>have been
>>
>>
>>>ISO_8859-1, however the error is easily repeatable and ill
>>>
>>>
>>be happy to
>>
>>
>>>provide a dump.
>>>
>>>I tested this on 7.3.2 and 7.4.7 and the error was repeatable.
>>>
>>>
>>I think the most important thing we'd need to know is what
>>locale the database was initialized with in order to try to reproduce.
>>
>>If you put similar data in a file and use the unix "sort"
>>command with the same locale, do you get the same order?
>>
>>
>
>That looks like the typical swedish locale, which sorts V and W as the
>same character. Yes, that can be very annoying for some of us :-), but
>that's the way it's defined.
>
>//Magnus
>
>
I would say that sorting V and W as the same character may work in a
dictionary or sometimes when sorting names, however in that case why
don't it sort C and K as the same character? and in some cases C and S
as the same.
The best sorting algorithm should sort all characters as separate
characters, and if you want fancy sorting you should need to turn it on
for a table.
Fancy sorting, like sorting Carlsson and Karlsson together since they
are equal in the same way as Viktor=Wiktor=Victor=Wictor. Cesar and
Sesar would also have to be sorted together since they are pronounced
the same in swedish, and if you look in a phonebook you would find them
at the same place.
Now I get the english words was and vas sorted together when I need them
separated.
Currently the sortingfunction in postgreSQL have disqualified itself
from beeing used, and I do the sorting in java which treat them as
separate caracters.
This is my 5 cent,
/Tomas