Обсуждение: Czech character broken?

Поиск
Список
Период
Сортировка

Czech character broken?

От
Magnus Hagander
Дата:
On our page at http://www.postgresql.org/community/international we have the following:
Informace o PostgreSQL v češtině

My validator complains that š is an invalid numeric entity in
UTF-8. Can someone who knows what it should be let us know what the
correct entity is?

//Magnus


Re: Czech character broken?

От
Michael Fuhr
Дата:
On Tue, Feb 13, 2007 at 10:06:47AM +0100, Magnus Hagander wrote:
> On our page at http://www.postgresql.org/community/international we have the following:
> Informace o PostgreSQL v češtině
> 
> My validator complains that š is an invalid numeric entity in
> UTF-8. Can someone who knows what it should be let us know what the
> correct entity is?

I don't know for sure since I don't speak Czech, but I'd guess the
correct entity is š <U+0161 LATIN SMALL LETTER S WITH CARON>.
The Windows-1250 mapping of that character is 154 (0x9a).

-- 
Michael Fuhr


Re: Czech character broken?

От
Magnus Hagander
Дата:
Michael Fuhr wrote:
> On Tue, Feb 13, 2007 at 10:06:47AM +0100, Magnus Hagander wrote:
>> On our page at http://www.postgresql.org/community/international we have the following:
>> Informace o PostgreSQL v češtině
>>
>> My validator complains that š is an invalid numeric entity in
>> UTF-8. Can someone who knows what it should be let us know what the
>> correct entity is?
> 
> I don't know for sure since I don't speak Czech, but I'd guess the
> correct entity is š <U+0161 LATIN SMALL LETTER S WITH CARON>.
> The Windows-1250 mapping of that character is 154 (0x9a).

Seems to be correct - at least my firefox on Windows still shows the
same character after fixing that.

Thanks.


//Magnus


Re: Czech character broken?

От
Michael Fuhr
Дата:
On Sun, Feb 18, 2007 at 06:30:14PM +0100, Magnus Hagander wrote:
> Michael Fuhr wrote:
> > I don't know for sure since I don't speak Czech, but I'd guess the
> > correct entity is š <U+0161 LATIN SMALL LETTER S WITH CARON>.
> > The Windows-1250 mapping of that character is 154 (0x9a).
> 
> Seems to be correct - at least my firefox on Windows still shows the
> same character after fixing that.

That character is 154 in Windows-1252 as well.  I'd guess that some
browsers, upon finding a C1 control character (128-159 / 0x80-0x9f),
assume that it's really a graphical character from one of the Windows
encodings and convert it appropriately.

-- 
Michael Fuhr