List Info

Thread: ezmlm warning




ezmlm warning
user name
2006-05-23 10:37:51
Hi! This is the ezmlm program. I'm managing the
nutch-devlucene.apache.org mailing list.

I'm working for my owner, who can be reached
at nutch-dev-ownerlucene.apache.org.


Messages to you from the nutch-dev mailing list seem to
have been bouncing. I've attached a copy of the first
bounce
message I received.

If this message bounces too, I will send you a probe. If the
probe bounces,
I will remove your address from the nutch-dev mailing list,
without further notice.


I've kept a list of which messages from the nutch-dev
mailing list have 
bounced from your address.

Copies of these messages may be in the archive.
To retrieve a set of messages 123-145 (a maximum of 100 per
request),
send an empty message to:
   <nutch-dev-get.123_145lucene.apache.org>

To receive a subject and author list for the last 100 or so
messages,
send an empty message to:
   <nutch-dev-indexlucene.apache.org>

Here are the message numbers:

   4870

--- Enclosed is a copy of the bounce message I received.

Return-Path: <>
Received: (qmail 60297 invoked by uid 99); 11 May 2006
14:28:54 -0000
Received: from asf.osuosl.org (HELO asf.osuosl.org)
(140.211.166.49)
    by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 May
2006 07:28:54 -0700
X-ASF-Spam-Status: No, hits=0.0 required=10.0
	tests=
X-Spam-Check-By: apache.org
Received: from [66.98.192.98] (HELO starfire.yahoo.com)
(66.98.192.98)
    by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 May
2006 07:28:53 -0700
Received: by starfire.yahoo.com (Postfix)
	id 4BAB75602AA; Thu, 11 May 2006 09:23:42 -0500 (CDT)
Date: Thu, 11 May 2006 09:23:42 -0500 (CDT)
From: MAILER-DAEMONyahoo.com (Mail Delivery System)
Subject: Undelivered Mail Returned to Sender
To: nutch-dev-return-4870-bond=yahoo.comlucene.apache.org
MIME-Version: 1.0
Content-Type: multipart/report; report-type=delivery-status;
	boundary="5DB895602AF.1147357422/starfire.yahoo.com&
quot;
Message-Id: <20060511142342.4BAB75602AAstarfire.yahoo.com>
X-Virus-Checked: Checked by ClamAV on apache.org

This is a MIME-encapsulated message.

--5DB895602AF.1147357422/starfire.yahoo.com
Content-Description: Notification
Content-Type: text/plain

This is the Postfix program at host starfire.yahoo.com.

I'm sorry to have to inform you that your message could not
be delivered to one or more recipients. It's attached
below.

For further assistance, please send mail to
<postmaster>

If you do so, please include this problem report. You can
delete your own text from the attached returned message.

			The Postfix program

<bondyahoo.com>: Command time limit exceeded:
"/usr/local/sbin/dbmail-smtp"

--5DB895602AF.1147357422/starfire.yahoo.com
Content-Description: Delivery report
Content-Type: message/delivery-status

Reporting-MTA: dns; starfire.yahoo.com
X-Postfix-Queue-ID: 5DB895602AF
X-Postfix-Sender: rfc822;
nutch-dev-return-4870-bond=yahoo.comlucene.apache.org
Arrival-Date: Thu, 11 May 2006 08:22:08 -0500 (CDT)

Final-Recipient: rfc822; bondyahoo.com
Action: failed
Status: 5.0.0
Diagnostic-Code: X-Postfix; Command time limit exceeded:
    "/usr/local/sbin/dbmail-smtp"

--5DB895602AF.1147357422/starfire.yahoo.com
Content-Description: Undelivered Message
Content-Type: message/rfc822

Received: from starfire.yahoo.com ([127.0.0.1])
 by localhost (starfire.yahoo.com [127.0.0.1]) (amavisd-new,
port 10024)
 with ESMTP id 01385-07 for <bondyahoo.com>;
 Thu, 11 May 2006 08:22:02 -0500 (CDT)
Received: from mail.apache.org (hermes.apache.org
[209.237.227.199])
	by starfire.yahoo.com (Postfix) with SMTP id 73D7E5602A0
	for <bondyahoo.com>; Thu, 11 May 2006 08:22:02
-0500 (CDT)
Received: (qmail 35142 invoked by uid 500); 11 May 2006
13:21:55 -0000
Mailing-List: contact nutch-dev-helplucene.apache.org; run by
ezmlm
Precedence: bulk
List-Help: <mailto:nutch-dev-helplucene.apache.org>
List-Unsubscribe: <mailto:nutch-dev-unsubscribelucene.apache.org>
List-Post: <mailto:nutch-devlucene.apache.org>
List-Id: <nutch-dev.lucene.apache.org>
Reply-To: nutch-devlucene.apache.org
Delivered-To: mailing list nutch-devlucene.apache.org
Received: (qmail 34999 invoked by uid 500); 11 May 2006
13:21:54 -0000
Delivered-To: apmail-incubator-nutch-devincubator.apache.org
Received: (qmail 34876 invoked by uid 99); 11 May 2006
13:21:53 -0000
Received: from asf.osuosl.org (HELO asf.osuosl.org)
(140.211.166.49)
    by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 May
2006 06:21:53 -0700
X-ASF-Spam-Status: No, hits=0.0 required=10.0
	tests=
X-Spam-Check-By: apache.org
Received: from [209.237.227.198] (HELO brutus.apache.org)
(209.237.227.198)
    by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 May
2006 06:21:52 -0700
Received: from brutus (localhost [127.0.0.1])
	by brutus.apache.org (Postfix) with ESMTP id 56B02714293
	for <nutch-devincubator.apache.org>; Thu, 11 May 2006
13:21:07 +0000 (GMT)
Message-ID: <4407842.1147353667352.JavaMail.jirabrutus>
Date: Thu, 11 May 2006 13:21:07 +0000 (GMT+00:00)
From: "Andrzej Bialecki  (JIRA)" <jiraapache.org>
To: nutch-devincubator.apache.org
Subject: [jira] Commented: (NUTCH-267) Indexer doesn't
consider linkdb when
 calculating boost value
In-Reply-To: <18782617.1147139902532.JavaMail.jirabrutus>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit
X-Virus-Checked: Checked by ClamAV on apache.org
X-Virus-Scanned: amavisd-new at yahoo.com

    [ http://issues.apache.org/jira/browse
/NUTCH-267?page=comments#action_12379072 ] 

Andrzej Bialecki  commented on NUTCH-267:
-----------------------------------------

Hmm, resetting the score to 0 is also dubious - it's as if
we didn't want it to be re-crawled if we can't find any
inlinks to it... I believe it should be reset to the
following value:

    newScore = initialScore - sum(distributedScoreM) +
sum(incomingScoreN)

where initialScore is the score we got from previous
iterations (or injectedScore), sum(distributedScoreM) is
what we have distributed to M outlinks from that page, and
sum(incomingScoreN) is what is contributed by N inlinks.
Current formula omits the sum(distributedScoreM); it also
doesn't provide any way to "sponsor" pages with
no incoming links so that they won't get broke (the concept
of "virtual nodes" I mentioned above).

Re: summing logs: yes, but then why use "sqrt(opic) *
docSimilarity" instead of "log(opic *
docSimilarity)"?

> Indexer doesn't consider linkdb when calculating boost
value
>
------------------------------------------------------------
>
>          Key: NUTCH-267
>          URL: http:/
/issues.apache.org/jira/browse/NUTCH-267
>      Project: Nutch
>         Type: Bug

>   Components: indexer
>     Versions: 0.8-dev
>     Reporter: Chris Schneider
>     Priority: Minor

>
> Before OPIC was implemented (Nutch 0.7, very early
Nutch 0.8-dev), if indexer.boost.by.link.count was true, the
indexer boost value was scaled based on the log of the # of
inbound links:
>     if (boostByLinkCount)
>       res *= (float)Math.log(Math.E + linkCount);
> This is no longer true (even before Andrzej implemented
scoring filters). Instead, the boost value is just the
square root (or some other scorePower) of the page score.
Shouldn't the invertlinks command, which creates the
linkdb, have some affect on the boost value calculated
during indexing (either via the OPICScoringFilter or some
other built-in filter)?

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa

-
For more information on JIRA, see:
   http://www.atl
assian.com/software/jira


--5DB895602AF.1147357422/starfire.yahoo.com--
[1]

about | contact  Other archives ( Real Estate discussion Medical topics )