Re: [PATCH] json_lex_string: don't overread on bad UTF8

Поиск

Список

Период

Сортировка

От	Michael Paquier
Тема	Re: [PATCH] json_lex_string: don't overread on bad UTF8
Дата	2 мая 06:39:40
Msg-id	ZjMK_N0VokrEe1Ws@paquier.xyz обсуждение исходный текст
Ответ на	Re: [PATCH] json_lex_string: don't overread on bad UTF8 (Michael Paquier <michael@paquier.xyz>)
Ответы	Re: [PATCH] json_lex_string: don't overread on bad UTF8
Список	pgsql-hackers

Дерево обсуждения

On Thu, May 02, 2024 at 11:23:13AM +0900, Michael Paquier wrote:
> About the fact that we may finish by printing unfinished UTF-8
> sequences, I'd be curious to hear your thoughts.  Now, the information
> provided about the partial byte sequences can be also useful for
> debugging on top of having the error code, no?

By the way, as long as I have that in mind..  I am not sure that it is
worth spending cycles in detecting the unfinished sequences and make
these printable.  Wouldn't it be enough for more cases to adjust
token_error() to truncate the byte sequences we cannot print?

Another thing that I think would be nice would be to calculate the
location of what we're parsing on a given line, and provide that in
the error context.  That would not be backpatchable as it requires a
change in JsonLexContext, unfortunately, but it would help in making
more sense with an error if the incomplete byte sequence is at the
beginning of a token or after an expected character.
--
Michael

Вложения

signature.asc

В списке pgsql-hackers по дате отправления:

Предыдущее

От: Michael Paquier
Дата: 02 мая, 05:23:13
Сообщение: Re: [PATCH] json_lex_string: don't overread on bad UTF8

Следующее

От: Kashif Zeeshan
Дата: 02 мая, 06:42:41
Сообщение: Re: Document NULL

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: [PATCH] json_lex_string: don't overread on bad UTF8

Вложения

Предыдущее

Следующее