List Info

Thread: search a subdirectory (New to Lucene)




search a subdirectory (New to Lucene)
user name
2006-02-22 18:00:09
I'm new to Lucene and was wondering what is the best way to
perform a search on a subdirectory or subdirectories within
the index?  My thought at this point is to build a query to
first search for files in the required directory(ies) and
then use that query to make a QueryFilter and use that
QueryFilter in the actual search.  Is there an easier way?
 
On an unrelated note, does anybody know of a way to get
results a the section level within a document?  For example,
could I find not just a document that matches my query, but
the paragraph within that document that best matches the
query?
 
thanks,
 
John
search a subdirectory (New to Lucene)
user name
2006-02-22 21:18:10
I presume by saying "subdirectory" you're
referring to filesystem  
directories and you're indexing a directory tree of files. 
 If you  
index the path (perhaps relative from the root is best) as a
keyword  
field (untokenized, but indexed) you could perform filtering
on a / 
path/subpath sort of way using PrefixQuery.

As for paragraphs - how you index a "document"
is entirely  
application dependent.  Maybe it makes sense to parse the
documents  
before handing them to Lucene such that you're creating a
Lucene  
Document for each paragraph rather than for each entire
file.   
Slicing the granularity of a domain into Documents is a
fascinating  
topic 

	Erik


On Feb 22, 2006, at 1:00 PM, John Hamilton wrote:

> I'm new to Lucene and was wondering what is the best
way to perform  
> a search on a subdirectory or subdirectories within the
index?  My  
> thought at this point is to build a query to first
search for files  
> in the required directory(ies) and then use that query
to make a  
> QueryFilter and use that QueryFilter in the actual
search.  Is  
> there an easier way?
>
> On an unrelated note, does anybody know of a way to get
results a  
> the section level within a document?  For example,
could I find not  
> just a document that matches my query, but the
paragraph within  
> that document that best matches the query?
>
> thanks,
>
> John


------------------------------------------------------------
---------
To unsubscribe, e-mail: java-user-unsubscribelucene.apache.org
For additional commands, e-mail: java-user-helplucene.apache.org

[1-2]

about | contact  Other archives ( Real Estate discussion Medical topics )