List Info

Thread: RE: Phrase Query Performance Question and score threshold




RE: Phrase Query Performance Question and score threshold
country flaguser name
United States
2007-11-05 13:41:07
Hoss,
 
If I limit the documents returned based on a score threshold
(filter by score) will it be able to improve query
performance? My intuition is it won't be able to because you
will still have to calculate the score and then compare to
the threshold.
 
I know it may not be meaningful to do so based on the
following explanation
http://wiki.apache.
org/jakarta-lucene/LuceneFAQ#head-912c1f237bb002591853531829
48e5935f0c2f03
 
But it might work for me because the documents are not
"natural language" but constructed following
certain rules. If I really want to try this can you offer
advice on the best way to implement score threshold in SOLR
with minimum overhead? 
 
Appreciate if anyone can help
 
Thank you
Haishan
 
 
 



> Date: Fri, 2 Nov 2007 12:31:29 -0700> From:
hossman_lucenefucit.org> To: solr-userlucene.apache.org> Subject: Re: Phrase Query
Performance Question> > > : It still feels to me
that you are trying doing something unique with your> :
phrase queries. Unfortunately, you still haven't said what
you are trying to> : do in general terms, which makes it
very difficult for people to help you.> > Agreed. This
seems very special case, but we dont' know what the case
is.> > If there are specific phrases you know in
advance that you will care > about, and those phrases
occur as frequetnly as the individual >
"words", then the best way to deal with them is to
index each "phrase" as > a single Term (and
ignore the individual words)> > Speaking more
generally to mike's point...> > http
://people.apache.org/~hossman/#xyproblem> Your
question appears to be an "XY Problem" ... that
is: you are dealing> with "X", you are assuming
"Y" will help you, and you are asking about
"Y"> without giving more details about the
"X" so that we can understand the> full issue.
Perhaps the best solution doesn't involve "Y" at
all?> See Also: 
http://www.perlmonks.org/index.pl?node_id=542341>
> > > > > -Hoss> 
____________________________________________________________
_____
Windows Live Hotmail and Microsoft Office Outlook – together
at last.  Get it now.
http://office.microsoft.com/en-u
s/outlook/HA102225181033.aspx?pid=CL100626971033
Re: Phrase Query Performance Question and score threshold
user name
2007-11-05 13:55:21
On 11/5/07, Haishan Chen <haishanmsn.com> wrote:
> If I limit the documents returned based on a score
threshold (filter by score) will it be able to improve query
performance?

No.

Taking a different approach can really speed up queries
though.
To figure out what approach you should take, we need to know
what you
are trying to do.
As Hoss said: http://p
eople.apache.org/~hossman/#xyproblem


How many different phrase queries are you having performance
issues with?

-Yonik

[1-2]

about | contact  Other archives ( Real Estate discussion Medical topics )