I presume by saying "subdirectory" you're
referring to filesystem
directories and you're indexing a directory tree of files.
If you
index the path (perhaps relative from the root is best) as a
keyword
field (untokenized, but indexed) you could perform filtering
on a /
path/subpath sort of way using PrefixQuery.
As for paragraphs - how you index a "document"
is entirely
application dependent. Maybe it makes sense to parse the
documents
before handing them to Lucene such that you're creating a
Lucene
Document for each paragraph rather than for each entire
file.
Slicing the granularity of a domain into Documents is a
fascinating
topic
Erik
On Feb 22, 2006, at 1:00 PM, John Hamilton wrote:
> I'm new to Lucene and was wondering what is the best
way to perform
> a search on a subdirectory or subdirectories within the
index? My
> thought at this point is to build a query to first
search for files
> in the required directory(ies) and then use that query
to make a
> QueryFilter and use that QueryFilter in the actual
search. Is
> there an easier way?
>
> On an unrelated note, does anybody know of a way to get
results a
> the section level within a document? For example,
could I find not
> just a document that matches my query, but the
paragraph within
> that document that best matches the query?
>
> thanks,
>
> John
------------------------------------------------------------
---------
To unsubscribe, e-mail: java-user-unsubscribe lucene.apache.org
For additional commands, e-mail: java-user-help lucene.apache.org
|