TSearch2 vs. Apache Lucene

Поиск
Список
Период
Сортировка
От Joshua Kramer
Тема TSearch2 vs. Apache Lucene
Дата
Msg-id Pine.LNX.4.63.0512061143170.5133@localhost.localdomain
обсуждение исходный текст
Ответы Re: TSearch2 vs. Apache Lucene  (Michael Riess <mlriess@gmx.de>)
Re: TSearch2 vs. Apache Lucene  (Russell Garrett <russ@garrett.co.uk>)
Список pgsql-performance
Greetings all,

I'm going to do a performance comparison with DocMgr and PG81/TSearch2 on
one end, and Apache Lucene on the other end.

In order to do this, I'm going to create a derivative of the
docmgr-autoimport script so that I can specify one file to import at a
time.  I'll then create a Perl script which logs all details (such as
timing, etc.) as the test progresses.

As test data, I have approximately 9,000 text files from Project Gutenberg
ranging in size from a few hundred bytes to 4.5M.

I plan to test the speed of import of each file.  Then, I plan to write a
web-robot in Perl that will test the speed and number of results returned.

Can anyone think of a validation of this test, or how I should configure
PG to maximise import and search speed?  Can I maximise search speed and
import speed, or are those things mutually exclusive?  (Note that this
will be run on limited hardware - 900MHz Athlon with 512M of ram)

Has anyone ever compared TSearch2 to Lucene, as far as performance is
concerned?

Thanks,
-Josh

В списке pgsql-performance по дате отправления:

Предыдущее
От: Pailloncy Jean-Gerard
Дата:
Сообщение: Re: 8.1 count(*) distinct: IndexScan/SeqScan
Следующее
От: Tom Lane
Дата:
Сообщение: Re: Missed index opportunity for outer join?