hello,
I'm pulling data into Lucene several times an hour,
approaching a
total document count of ~2 million. Sometimes I pull in
brand new
data, other times I replace an existing document with an
updated
copy. The number of documents that I update in Lucene will
pretty
much never be more than a thousand or so.
I have a Solr interface exposed to another part of our
system, and
it's basically sitting on top of Lucene as a read-only view
into the
index. I can perform updates and optimizations in Lucene and
Solr
will keep searching just fine, but I've discovered that
changes in
Lucene are not visible through Solr.
For example, say there is no matching document in Lucene for
id=1000,
so if I query Solr using id:1000, I will correctly find 0
matches.
But then I import new data into Lucene, pulling in a new
document
where id=1000. At that point, the query for id:1000 should
find one
match, but it doesn't (0 still). If I bounce Solr, I can see
the
results just fine.
I thought it might be a caching issue, but I have all of the
cache
options disabled in solfconfig.xml and the problem persists.
I also
ran Lucene optimization while Solr was running, but again no
fix. If
anyone has any suggestions for configuring / poking Solr
somehow so
that it will see new changes in Lucene, please let me know.
cheers,
Kaan
|