Re: could not read block XXXXX in file "base/YYYYY/ZZZZZZ": read only 160 of 8192 bytes

Поиск
Список
Период
Сортировка
От Kevin Grittner
Тема Re: could not read block XXXXX in file "base/YYYYY/ZZZZZZ": read only 160 of 8192 bytes
Дата
Msg-id 4DFB32F4020000250003E7E2@gw.wicourts.gov
обсуждение исходный текст
Ответ на Re: could not read block XXXXX in file "base/YYYYY/ZZZZZZ": read only 160 of 8192 bytes  (Антон Степаненко <zlobnynigga@yandex.ru>)
Ответы Re: could not read block XXXXX in file "base/YYYYY/ZZZZZZ": read only 160 of 8192 bytes  (Merlin Moncure <mmoncure@gmail.com>)
Список pgsql-bugs
***** **********<zlobnynigga@yandex.ru> wrote:
> 17.06.2011, 00:28, "Kevin Grittner" <Kevin.Grittner@wicourts.gov>:
>> ***** **********<zlobnynigga@yandex.ru>; wrote:
>>
>>>  [4-1] 2011-06-16 17:40:27 UTC LOG:  startup process (PID 15292)
>>>  was terminated by signal 7: Bus error
>>>  Signal 7 means  hardware problems. But all 10 replicas crashed
>>>  within 10 minutes, say from 13:35 to 13:45.
>>>  One important thing - all replicas and master are running on
>>>  openvz

>> On the face of it, the most likely cause would seem to be
>> hardware or the virtual environment.

> I noticed that crash takes place when shared buffers are almost
> full, i.e. SELECT SUM(size)  FROM adm.buffercache() returns 11670
> at about one minute before crash. Furthermore, last night I set
> buffers  to 11Gb, at it is working, no crash, all buffers are used
> (11120).

Well then, in a pinch you could always fall back to using what
works.

> I still do not believe that this is hardware problem.

How would an application cause a bus error?

> Each replica and master runs on dedicated server, no hardware is
> shared.

OK.  If they had been on the same blade chassis or something I would
have suspected hardware.

> There is only postgresql on each server, no any other
> software(just crond, zabbix, atop). Actually openvz is used only
> for portability(easily add new replicas or migrate one of them to
> new server).

Still, it emulates hardware, so you have to consider it a suspect
for any hardware problem -- at least if you want to solve that
problem.

> Master did not crash

Ah, that wasn't clear from the earlier post.  I'm not sure how
significant it is, but it's good to know.

> I think because it processes less SELECT queries, therefore his
> buffers do not reach limit.

In your shoes I would now be trying to construct a test program to
exercise progressively larger allocations of shared memory, and test
them both under openvz and without it.  Well, first I would probably
try loading the master with queries to drive it to use the full
shared_buffers space, *then* move on to the test program.

The relevant question here is why others can successfully use large
shared_buffers settings while you can't.  Something is different in
your environment.  What?

-Kevin

В списке pgsql-bugs по дате отправления:

Предыдущее
От: Tom Lane
Дата:
Сообщение: Re: Ident authentication fails due to bind error on server (8.4.8)
Следующее
От: Merlin Moncure
Дата:
Сообщение: Re: could not read block XXXXX in file "base/YYYYY/ZZZZZZ": read only 160 of 8192 bytes