List Info

Thread: Search Particulars




Search Particulars
user name
2006-02-23 18:51:17
	I'm not sure I understand what you're getting at.  In
this case
I've added a comma separated list of names of meta tags
that I want to
index and search against.  I've written a parse filter, an
index filter
and this query filter that all read in that list of meta
tags from the
nutch-site.xml file.  

	That much seems to work.  In the explain link I can see
that the
fields are in the index and the ranking of pages are
affected by them,
but if I search for a term which is in one of the meta tags,
but not in
any other fields I get 0 results.

Thanks,

Jake.

-----Original Message-----
From: Raghavendra Prabhu [mailto:rrprabhugmail.com] 
Sent: Thursday, February 23, 2006 1:19 PM
To: nutch-userlucene.apache.org
Subject: Re: Search Particulars

Okie
I am new to this topic

But do u add metatags to a particular field

if so shud not that field also appear as in the field path

The normal nutch maybe does not look at that field at all ?
Maybe this
is
the reason ?

Unless you give the metadatafield and search for the keyword

Rgds
Prabhu



Search Particulars
user name
2006-02-23 19:03:31
Let me say u have meta1,meta2 in the meta tags in
nutch-site.xml

The code which u have written attempts to find this meta
data(either meta1
or meta2) in nutch fields

When you index data,the index filter propably writes the
metadata which you
get it into some field.Do you write this metadata which you
gather into a
separate field

For example  define a field called METADATAFIELD and store
this here.

Then the query shud be extended by a query-METADATAFIELD

Try this out  Search as
METADATAFIELD:meta1 (this shud fetch you some result )

If this works the content is there in the field (You can
check it out in
this manner to see if your implementation is right)


The BasicQuery filter i guess looks in only four different
fields like
URL,title,content and anchor

I was wondering whether you should add this also to
BasicQueryFilter fields

I do not know whether i am helping you in the right
direction. But this is
my view

Hope this helps

Rgds
Prabhu


On 2/24/06, Vanderdray, Jacob <JVanderdrayaarp.org> wrote:
>
>        I'm not sure I understand what you're getting
at.  In this case
> I've added a comma separated list of names of meta
tags that I want to
> index and search against.  I've written a parse
filter, an index filter
> and this query filter that all read in that list of
meta tags from the
> nutch-site.xml file.
>
>        That much seems to work.  In the explain link I
can see that the
> fields are in the index and the ranking of pages are
affected by them,
> but if I search for a term which is in one of the meta
tags, but not in
> any other fields I get 0 results.
>
> Thanks,
>
> Jake.
>
> -----Original Message-----
> From: Raghavendra Prabhu [mailto:rrprabhugmail.com]
> Sent: Thursday, February 23, 2006 1:19 PM
> To: nutch-userlucene.apache.org
> Subject: Re: Search Particulars
>
> Okie
> I am new to this topic
>
> But do u add metatags to a particular field
>
> if so shud not that field also appear as in the field
path
>
> The normal nutch maybe does not look at that field at
all ? Maybe this
> is
> the reason ?
>
> Unless you give the metadatafield and search for the
keyword
>
> Rgds
> Prabhu
>
>
>
>
Search Particulars
user name
2006-02-23 22:09:03
Vanderdray, Jacob wrote:
> 	I'm not sure I understand what you're getting at. 
In this case
> I've added a comma separated list of names of meta
tags that I want to
> index and search against.  I've written a parse
filter, an index filter
> and this query filter that all read in that list of
meta tags from the
> nutch-site.xml file.  
> 
> 	That much seems to work.  In the explain link I can
see that the
> fields are in the index and the ranking of pages are
affected by them,
> but if I search for a term which is in one of the meta
tags, but not in
> any other fields I get 0 results.

Are you using RawFieldQueryFilter?  If so, are you
specifying a non-zero 
boost to the constructor?  RawFieldQueryFilter defaults to a
zero boost. 
  Query terms with a zero boost are automatically converted
into 
filters.  And filters cannot select documents, only remove
them.

Doug
[1-3]

about | contact  Other archives ( Real Estate discussion Medical topics )