Re: BUG #6457: Regexp not processing word (with special characters on ends) correctly (UTF-8)

Поиск
Список
Период
Сортировка
От Duncan Rance
Тема Re: BUG #6457: Regexp not processing word (with special characters on ends) correctly (UTF-8)
Дата
Msg-id 4EFDC1C4-F0C7-42A3-859C-6E98867CC614@dunquino.com
обсуждение исходный текст
Ответ на Re: BUG #6457: Regexp not processing word (with special characters on ends) correctly (UTF-8)  (Tom Lane <tgl@sss.pgh.pa.us>)
Список pgsql-bugs
On 14 Feb 2012, at 18:28, Tom Lane wrote:
>
> Oh, I see the reason for this: the code in cclass() in regc_locale.c
> doesn't go further up than U+00FF, so no codes above that will be
> thought to be letters (or members of any other character class).
> Clearly we need to go further when we are dealing with UTF8.
> I'm not sure what a sane limit would be though.


The Basic Multilingual Plane goes up to FFFF:

https://en.wikipedia.org/wiki/Mapping_of_Unicode_characters#Planes

В списке pgsql-bugs по дате отправления:

Предыдущее
От: Duncan Rance
Дата:
Сообщение: Re: BUG #6457: Regexp not processing word (with special characters on ends) correctly (UTF-8)
Следующее
От: Félix GERZAGUET
Дата:
Сообщение: Re: BUG #6452: psql: can't change client encoding from the command line