Re: Optimising "in" queries

Поиск

Список

Период

Сортировка

От	Tom Lane
Тема	Re: Optimising "in" queries
Дата	24 августа 2007 г. 02:19:47
Msg-id	16318.1187921980@sss.pgh.pa.us обсуждение исходный текст
Ответ на	Re: Optimising "in" queries (Alvaro Herrera <alvherre@commandprompt.com>)
Список	pgsql-performance

Дерево обсуждения

Alvaro Herrera <alvherre@commandprompt.com> writes:
> Stephen Davies wrote:
>> While superficially equivalent, I have always believed that IN (a,b,c)
>> executed faster than =a or =b or =c. Am I wrong for PostgreSQL?

> Older versions of Postgres translated IN (a, b, c) into an OR'ed list of
> equalities.  Nowadays it is treated as an array; I think it's translated
> to = ANY ({a,b,c}), as you can see in the message you posted at the
> start of this thread.

If you're dealing with tables large enough that the index search work is
the dominant cost, all these variants ought to be exactly the same.
However, for smaller tables the planning time and executor startup time
are interesting, and on those measures the = ANY(array) formulation
should win because there's less "stuff" for the system to look at.
With "x=a OR x=b OR x=c" the planner actually has to deduce three times
that an indexscan on x is possible; with "x = ANY(ARRAY[a,b,c])" it
does that only once.  That's why I changed IN to expand to an array
construct instead of an OR tree.  I have to confess not having tried to
measure the consequences carefully, though.  I suspect it's not all
that interesting at only three items ... it's lists of hundreds or
thousands of items where this becomes a big deal.

            regards, tom lane

В списке pgsql-performance по дате отправления:

Предыдущее

От: "Steven Flatt"
Дата: 23 августа 2007 г., 22:25:31
Сообщение: Re: When/if to Reindex

Следующее

От: Tom Lane
Дата: 24 августа 2007 г., 03:59:21
Сообщение: Re: When/if to Reindex

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Re: Optimising "in" queries

Предыдущее

Следующее