Hi! This is the ezmlm program. I'm managing the
java-user lucene.apache.org mailing list.
I'm working for my owner, who can be reached
at java-user-owner lucene.apache.org.
Messages to you from the java-user mailing list seem to
have been bouncing. I've attached a copy of the first
bounce
message I received.
If this message bounces too, I will send you a probe. If the
probe bounces,
I will remove your address from the java-user mailing list,
without further notice.
I've kept a list of which messages from the java-user
mailing list have
bounced from your address.
Copies of these messages may be in the archive.
To retrieve a set of messages 123-145 (a maximum of 100 per
request),
send a short message to:
<java-user-get.123_145 lucene.apache.org>
To receive a subject and author list for the last 100 or so
messages,
send a short message to:
<java-user-index lucene.apache.org>
Here are the message numbers:
30250
30251
30252
--- Enclosed is a copy of the bounce message I received.
Return-Path: <>
Received: (qmail 89082 invoked by uid 99); 13 Sep 2007
18:46:20 -0000
Received: from nike.apache.org (HELO nike.apache.org)
(192.87.106.230)
by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 13 Sep
2007 11:46:20 -0700
X-ASF-Spam-Status: No, hits=-0.0 required=10.0
tests=SPF_HELO_PASS
X-Spam-Check-By: apache.org
Received: from [66.98.192.98] (HELO starfire.yahoo.com)
(66.98.192.98)
by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 13 Sep
2007 18:47:55 +0000
Received: by starfire.yahoo.com (Postfix)
id 0366A15090C; Thu, 13 Sep 2007 13:45:53 -0500 (CDT)
Date: Thu, 13 Sep 2007 13:45:53 -0500 (CDT)
From: MAILER-DAEMON yahoo.com (Mail Delivery System)
Subject: Undelivered Mail Returned to Sender
To: java-user-return-30250-bond=yahoo.com lucene.apache.org
Auto-Submitted: auto-replied
MIME-Version: 1.0
Content-Type: multipart/report;
report-type=delivery-status;
boundary="90F92208061.1189709153/starfire.yahoo.com&qu
ot;
Content-Transfer-Encoding: 8bit
Message-Id: <20070913184553.0366A15090C starfire.yahoo.com>
X-Virus-Checked: Checked by ClamAV on apache.org
This is a MIME-encapsulated message.
--90F92208061.1189709153/starfire.yahoo.com
Content-Description: Notification
Content-Type: text/plain; charset=us-ascii
This is the mail system at host starfire.yahoo.com.
I'm sorry to have to inform you that your message could not
be delivered to one or more recipients. It's attached
below.
For further assistance, please send mail to postmaster.
If you do so, please include this problem report. You can
delete your own text from the attached returned message.
The mail system
<bond yahoo.com>: cannot update mailbox /var/mail/bond
for user bond. error
writing message: File too large
--90F92208061.1189709153/starfire.yahoo.com
Content-Description: Delivery report
Content-Type: message/delivery-status
Reporting-MTA: dns; starfire.yahoo.com
X-Postfix-Queue-ID: 90F92208061
X-Postfix-Sender: rfc822;
java-user-return-30250-bond=yahoo.com lucene.apache.org
Arrival-Date: Thu, 13 Sep 2007 13:45:52 -0500 (CDT)
Final-Recipient: rfc822; bond yahoo.com
Original-Recipient: rfc822;bond yahoo.com
Action: failed
Status: 5.2.2
Diagnostic-Code: x-unix; input/output error
--90F92208061.1189709153/starfire.yahoo.com
Content-Description: Undelivered Message
Content-Type: message/rfc822
Content-Transfer-Encoding: 8bit
X-Virus-Scanned: amavisd-new at yahoo.com
Received: from starfire.yahoo.com ([127.0.0.1])
by localhost (starfire.yahoo.com [127.0.0.1]) (amavisd-new,
port 10024)
with ESMTP id zluOz5xVsIEP for <bond yahoo.com>;
Thu, 13 Sep 2007 13:45:46 -0500 (CDT)
Received: from mail.apache.org (hermes.apache.org
[140.211.11.2])
by starfire.yahoo.com (Postfix) with SMTP id 95225150909
for <bond yahoo.com>; Thu, 13 Sep 2007 13:45:45
-0500 (CDT)
Received: (qmail 85441 invoked by uid 500); 13 Sep 2007
18:45:34 -0000
Mailing-List: contact java-user-help lucene.apache.org; run by
ezmlm
Precedence: bulk
List-Help: <mailto:java-user-help lucene.apache.org>
List-Unsubscribe: <mailto:java-user-unsubscribe lucene.apache.org>
List-Post: <mailto:java-user lucene.apache.org>
List-Id: <java-user.lucene.apache.org>
Reply-To: java-user lucene.apache.org
Delivered-To: mailing list java-user lucene.apache.org
Received: (qmail 85430 invoked by uid 99); 13 Sep 2007
18:45:34 -0000
Received: from athena.apache.org (HELO athena.apache.org)
(140.211.11.136)
by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 13 Sep
2007 11:45:34 -0700
X-ASF-Spam-Status: No, hits=2.0 required=10.0
tests=HTML_MESSAGE,SPF_PASS
X-Spam-Check-By: apache.org
Received-SPF: pass (athena.apache.org: domain of
erickerickson gmail.com designates 64.233.182.186 as
permitted sender)
Received: from [64.233.182.186] (HELO
nf-out-0910.google.com) (64.233.182.186)
by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 13 Sep
2007 18:45:29 +0000
Received: by nf-out-0910.google.com with SMTP id
d3so525985nfc
for <java-user lucene.apache.org>; Thu,
13 Sep 2007 11:45:07 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
d=gmail.com; s=beta;
h=domainkey-signature:received:received:message-id:date:from
:to:subject:in-reply-to:mime-version:content-type:references
;
bh=5YhaE70kLm8Dr8nSYNMimQ5zBNE+XccKr95DPFkXrvU=;
b=Xw3e2sAYaR9t+rbkuA+PMQ9yq4RRZwue5gp6u67r7Bgyj8la9CiEl4eGTZ
lt8lT87Kw4/iuoyvIxyEkBXvz/5bb9CrNu00nz1OeEDhQA/kxq9dK9is14XY
WclIS70lbMnM+y8e627a+vZbU2i80e7KkSlzCMvTRyHYtbMumz3Pc=
DomainKey-Signature: a=rsa-sha1; c=nofws;
d=gmail.com; s=beta;
h=received:message-id:date:from:to:subject:in-reply-to:mime-
version:content-type:references;
b=A/KmlKvE89SNkw6pgiBZPOPsaQ2jjglEUFDgG4yhOTWnq/yruZe6ARpcEK
CYYTq2jkpNrIq1kMeiQOs/9OUHTlBgoxuZcnAcr/GtVAPGxyaKUnYSC5/IgW
ymTimgl4THKlUejFFK2wIxX89w0oiSHv0pBgA/z5e0xF6Fzq458Hs=
Received: by 10.82.134.12 with SMTP id
h12mr1272628bud.1189709107391;
Thu, 13 Sep 2007 11:45:07 -0700 (PDT)
Received: by 10.82.190.14 with HTTP; Thu, 13 Sep 2007
11:45:07 -0700 (PDT)
Message-ID:
<359a92830709131145v5e5b43a0ha42576a546fb61dc mail.gmail.com>
Date: Thu, 13 Sep 2007 14:45:07 -0400
From: "Erick Erickson" <erickerickson gmail.com>
To: java-user lucene.apache.org
Subject: Re: regarding FieldSelector
In-Reply-To:
<34b8543c0709130714j465b1f44j71f904928ad7b2a4 mail.gmail.com>
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----=_Part_22295_1438996.1189709107363"
References:
<34b8543c0709120213m619a117aj1f988334890a4cd9 mail.gmail.com>
<6495BFAA-0337-4830-B672-AFCEFDD3A9F9 apache.org>
<34b8543c0709120440h4b5f1ecau397ed25d16b35a74 mail.gmail.com>
<359a92830709120653y162100fdv97aa5585350781d7 mail.gmail.com>
<34b8543c0709130150o2bf746beg995cfb00a6a97f68 mail.gmail.com>
<359a92830709130701m6a67d299k7f43936ee55c86f5 mail.gmail.com>
<34b8543c0709130714j465b1f44j71f904928ad7b2a4 mail.gmail.com>
X-Virus-Checked: Checked by ClamAV on apache.org
------=_Part_22295_1438996.1189709107363
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: 7bit
Content-Disposition: inline
I'm not entirely sure. So what I'd do if I were you is write
a
little test program and step through it in the debugger and
see <G>....
But, if you're only allowing the user to fetch a single
document
at a time, I don't think it matters enough to worry about.
If, on the
other hand, you're allowing the user to display some
combination
of, say, 5 fields for a *list* of documents, I'd make them
all lazy
and then you can write a HitCollector to get the list
"lazily".
Best
Erick
On 9/13/07, Mohammad Norouzi <mnrz57 gmail.com> wrote:
>
> well, actually, I have 5 index directory and it will
increase in future.
> and
> the thing is that each document about 20 fields on
average. considering
> many
> users may connect to the system (we anticipate 500
users at this time) I
> want to know whether this will make performance issue
or not.
>
> we provided a feature to select which fields they want
to be displayed so
> I
> know that only 5 or 6 fields are important to my users.
I don't know the
> way
> I stated in my last email, I mean
searcher.doc(doc_id).get("field_name"),
> make the Lucene to load all fields of the document or
only the given name?
> if yes, I mean if all the fields are loaded I think
it's better to make
> them
> lazy.
>
> what do you suggest?
>
> thanks
>
>
> On 9/13/07, Erick Erickson <erickerickson gmail.com> wrote:
> >
> > Do you have any evidence that you're having a
performance issue? If
> > not, I'd just do the simple thing and ignore the
rest. The performance
> > issues I found were because I was spinning through
many, many
> > documents. If you're only worrying about one
document at a time,
> > it may not be an issue.
> >
> > If you *are* having performance issues, I'd
*strongly* recommend
> > that you measure to find out where the problem is
before trying
> > a solution. Otherwise you'll optimize code that
isn't the problem.
> >
> > Best
> > Erick
> >
> > On 9/13/07, Mohammad Norouzi <mnrz57 gmail.com> wrote:
> > >
> > > Thanks
> > > as I saw the documents, we can only use this
great field selector in
> > > IndexReader.document() method the problem is
I have a Searcher in my
> > > result
> > > set structure and when the client calls
getString("a_field_name") at
> > that
> > > time I invoke the
searcher.doc(current_doc_id).get("a_field_name),
> > > I already collected the result IDs. so in my
case, I can't use
> > > FieldSelector.
> > >
> > > Do I have to revise the way of retrieving
documents in my code?
> > >
> > >
> > >
> > > On 9/12/07, Erick Erickson
<erickerickson gmail.com> wrote:
> > > >
> > > > Well, it depends on what "improve
the search process" means
> > > > in your context <G>..
> > > >
> > > > But I had a case similar to yours that I
wrote up in the Wiki where
> > > > my search times improved about 10X by
using lazy loading. You
> > > > might want to read that entry here...
> > > >
> > > > http://wiki.apache.org/lucene-java/FieldSelectorPerfo
rmance
> > > >
> > > > Note the peculiar characteristics of my
data set, I really suspect
> > > > that a 10x improvement in retrieval
speed is atypical...
> > > >
> > > > As for when lazily-loaded fields
actually get loaded, I didn't
> really
> > > > have to explore it very fully, but a
short experiment should do it
> > > > for you.....
> > > >
> > > > Best
> > > > Erick
> > > >
> > > > On 9/12/07, Mohammad Norouzi
<mnrz57 gmail.com> wrote:
> > > > >
> > > > > Hi Grant,
> > > > > Really thanks for your nice
document about advanced Lucene. it was
> > > very
> > > > > useful for me.
> > > > >
> > > > > as I understand, we can set some
large fields to be lazily
> loading,
> > > now
> > > > my
> > > > > question is when it will be loaded?
it make sense when we call
> > > > > doc.get("field_name")
> > > > > it will load from the index, Am I
right?
> > > > >
> > > > > in my application, I've provided a
result set structure to
> navigate
> > > > > between
> > > > > results and documents and provide a
get(String fieldname) method
> > just
> > > > like
> > > > > java.sql.ResultSet.getString()
method, and also this result set
> > > > implements
> > > > > HitCollector in order to collect my
own ID rather than Lucene's
> > > document
> > > > > id,
> > > > > so I think I can set my field ID to
be loaded always and the other
> > > > fields
> > > > > to
> > > > > be lazily loading, Does this
improve the search process?
> > > > >
> > > > > again, thank you very much indeed.
> > > > >
> > > > >
> > > > > On 9/12/07, Grant Ingersoll
<gsingers apache.org> wrote:
> > > > > >
> > > > > > Hi Mohammad,
> > > > > >
> > > > > > The typical use cases are:
> > > > > > 1. You have several small
fields used in a results display and
> one
> > > or
> > > > > > two large fields (i.e. the
original document) and you don't want
> > to
> > > > > > pay the cost of loading the
large fields for results display
> > because
> > > > > > most of them won't be chosen.
When a result is chosen, the
> lazily
> > > > > > loaded field will be
retrieved.
> > > > > >
> > > > > > 2. You only want to load
certain fields, or the first field, or
> > you
> > > > > > just want to know the size of
a field.
> > > > > >
> > > > > > Basically, it gives you
control over how fields are loaded from
> > disk
> > > > > > in Lucene.
> > > > > >
> > > > > > See my ApacheCon Europe
presentation
> > http://cnlp.org/presen
tations/
> > > > > > slides/AdvancedLuceneEU.pdf
for a few slides (towards the end)
> on
> > > > > > FieldSelector.
> > > > > >
> > > > > > On Sep 12, 2007, at 5:13 AM,
Mohammad Norouzi wrote:
> > > > > >
> > > > > > > Hi all,
> > > > > > >
> > > > > > > Can anyone explain what
is the FieldSelector and the usage or
> > > > > > > benefits of
> > > > > > > this structure? I read
the javadocs but I can't get for what
> > goal
> > > > > > > it is
> > > > > > > provided in Lucene.
> > > > > > >
> > > > > > > Thanks in advance
> > > > > > >
> > > > > > > --
> > > > > > > Regards,
> > > > > > > Mohammad
> > > > > > >
--------------------------
> > > > > > > see my blog: http://brainable.blogs
pot.com/
> > > > > > > another in Persian: http://fekre-mot
efavet.blogspot.com/
> > > > > >
> > > > > > --------------------------
> > > > > > Grant Ingersoll
> > > > > > http://lucene.granti
ngersoll.com
> > > > > >
> > > > > > Lucene Helpful Hints:
> > > > > > http://wiki.apache.org/lucene-java/BasicsOfPerformance
> > > > > > http://w
iki.apache.org/lucene-java/LuceneFAQ
> > > > > >
> > > > > >
> > > > > >
> > > > > >
> > >
------------------------------------------------------------
---------
> > > > > > To unsubscribe, e-mail:
java-user-unsubscribe lucene.apache.org
> > > > > > For additional commands,
e-mail:
> java-user-help lucene.apache.org
> > > > > >
> > > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Regards,
> > > > > Mohammad
> > > > > --------------------------
> > > > > see my blog: http://brainable.blogs
pot.com/
> > > > > another in Persian: http://fekre-mot
efavet.blogspot.com/
> > > > >
> > > >
> > >
> > >
> > >
> > > --
> > > Regards,
> > > Mohammad
> > > --------------------------
> > > see my blog: http://brainable.blogs
pot.com/
> > > another in Persian: http://fekre-mot
efavet.blogspot.com/
> > >
> >
>
>
>
> --
> Regards,
> Mohammad
> --------------------------
> see my blog: http://brainable.blogs
pot.com/
> another in Persian: http://fekre-mot
efavet.blogspot.com/
>
------=_Part_22295_1438996.1189709107363--
--90F92208061.1189709153/starfire.yahoo.com--
|