Обсуждение: BUG #3525: Lithuanian characters doesn't work in queries with regular expressions

Поиск
Список
Период
Сортировка

BUG #3525: Lithuanian characters doesn't work in queries with regular expressions

От
"Rolandas Rudomanskis"
Дата:
The following bug has been logged online:

Bug reference:      3525
Logged by:          Rolandas Rudomanskis
Email address:      rolandasr@gmail.com
PostgreSQL version: 8.2.3
Operating system:   Debian Etch
Description:        Lithuanian characters doesn't work in queries with
regular expressions
Details:

The database is set to unicode. Debian locale is utf-8.

All lithuanian characters characters, except ė, ų, namely,
ą, č, ę, į, š, ū, ž in non-case sensitive queries specifying word
beginning, are ignored. SQL example:

select * from table1 where title ~* '\\m" + mytext + "';"

Re: BUG #3525: Lithuanian characters doesn't work in queries with regular expressions

От
Tom Lane
Дата:
"Rolandas Rudomanskis" <rolandasr@gmail.com> writes:
> Description:        Lithuanian characters doesn't work in queries with
> regular expressions

> The database is set to unicode. Debian locale is utf-8.

Yeah, the regex code doesn't handle localization issues (such as
case-insensitive matching) correctly for multibyte characters.
Can you use a single-byte encoding?  Alternatively, I believe
case-insensitive LIKE matching works for this.

            regards, tom lane