On Oct 22, 2007, at 4:06 PM, Erik Hatcher wrote:
> On Oct 22, 2007, at 3:45 PM, kshadkhast wrote:
>> I have problem searching accented words with wild
card. although I
>> have
>> configured schema using <filter
>>
class="solr.ISOLatin1AccentFilterFactory"/>
>> both in index and query part.
>> it is working for q=chrétien and find documents
with "chretien" but
>> searching for q=chré* does not work, but q=chre*
works fine.
>> is this a bug or I am doing something wrong?
>
> It's a bit tricky here.... Lucene's QueryParser, the
heart of
> Solr's query parsing, does not analyze wildcard query
parts.
> Consider stemmed words, for example, on why that is a
problem. In
> this case it does make sense to run it through a filter
that
> normalizes diacritics on characters, but unfortunately
Solr doesn't
> support what you need at this point.
Further on this, QueryParser does have some settings
specific to
wildcard queries, such as lowercasing the prefix part.
Perhaps this is a case that Solr could address with a third
analyzer
configuration (it already has "query", and
"index" differentiation)
that could be incorporated for wildcard queries. Thoughts
on that?
Erik
|