Обсуждение: TSearch2 ... ignore word repetition for rank

Поиск

Список

Период

Сортировка

TSearch2 ... ignore word repetition for rank

От

Chris Gamache

Дата:

09 января 2004 г., 16:37:30

For my particular case, word repetition shouldn't be relevant in determining
the rank of a document. If I strip() the vector, I loose what relevance
proximity and weight add to the rank. It seems impossible, yet I ask anyway: Is
it possible to eliminate the second (third, fourth, fifth, etc.) occurrence of
any given word when its presence in the document is being scored, yet kept in
the equation for modifications to the score when proximity is being considered?

CG


__________________________________
Do you Yahoo!?
Yahoo! Hotjobs: Enter the "Signing Bonus" Sweepstakes
http://hotjobs.sweepstakes.yahoo.com/signingbonus

Re: TSearch2 ... ignore word repetition for rank

От

Teodor Sigaev

Дата:

09 января 2004 г., 16:55:39


Chris Gamache wrote:
> For my particular case, word repetition shouldn't be relevant in determining
> the rank of a document. If I strip() the vector, I loose what relevance
> proximity and weight add to the rank. It seems impossible, yet I ask anyway: Is
> it possible to eliminate the second (third, fourth, fifth, etc.) occurrence of
> any given word when its presence in the document is being scored, yet kept in
> the equation for modifications to the score when proximity is being considered?

I don't see the way except modify strip or rank functions...
--
Teodor Sigaev                                  E-mail: teodor@sigaev.ru

Вход в личный кабинет

Восстановление пароля

Подтверждение аккаунта

Изменение пароля

Обсуждение: TSearch2 ... ignore word repetition for rank

TSearch2 ... ignore word repetition for rank

Re: TSearch2 ... ignore word repetition for rank