List Info

Thread: searching repeated and untokenized fields




searching repeated and untokenized fields
user name
2006-05-02 01:46:33
On Mon, 1 May 2006, Alf Eaton wrote:

>>> Secondly, it doesn't seem to be possible (in
PyLucene 1.9.1) to search an 
>>> untokenized field using a term that contains
spaces. For a document that 
>>> has a creator "Doe J", the query
>>> creator:"Doe J"
>>> doesn't return any results, and
>>> creatoroe J
>>> doesn't match what it needs to.
>> 
>> Again, please send in code that reproduces the
problem. If you can make 
>> sure that what you're trying to do work in Java
Lucene, that's a plus.
>> Ideally, your sample code would be organized as
unit tests.
>
> Good idea to do the tests: I realised that
StandardAnalyzer was converting 
> the search terms to lowercase when used in QueryParser,
but not when adding 
> untokenized fields to the document using IndexWriter,
so the two weren't 
> matching. Fixed now, thanks (and it's presumably not a
PyLucene problem).

Yeah, I don't think the StandardAnalyzer tokenizes
untokenized fields.
This is definitely a question for java-userlucene.apache.org.

Andi..
_______________________________________________
pylucene-dev mailing list
pylucene-devosafoundation.org
http://lists.osafoundation.org/mailman/listinfo/pylu
cene-dev
[1]

about | contact  Other archives ( Real Estate discussion Medical topics )