|
List Info
Thread: Created: (NUTCH-318) log4j not proper configured, readdb doesnt give any information
|
|
| Created: (NUTCH-318) log4j not proper
configured, readdb doesnt give any
information |

|
2006-07-10 19:07:29 |
log4j not proper configured, readdb doesnt give any
information
------------------------------------------------------------
---
Key: NUTCH-318
URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
Project: Nutch
Type: Bug
Versions: 0.8-dev
Reporter: Stefan Groschupf
Priority: Critical
Fix For: 0.8-dev
In the latest .8 sources the readdb command doesn't dump
any information anymore.
This is realeated to the miss configured log4j.properties
file.
changing:
log4j.rootLogger=INFO,DRFA
to:
log4j.rootLogger=INFO,DRFA,stdout
dumps the information to the console, but not in a nice way.
What makes me wonder is that these information should be
also in the log file, but the arn't, so there are may be
even here problems.
Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atl
assian.com/software/jira
|
|
| Updated: (NUTCH-318) log4j not proper
configured, readdb doesnt give any
information |

|
2006-07-25 19:40:23 |
[ http://issues.apache.org/jira/browse/NUTCH-318?page=all
a> ]
Sami Siren updated NUTCH-318:
-----------------------------
Fix Version/s: 0.9-dev
(was: 0.8-dev)
> log4j not proper configured, readdb doesnt give any
information
>
------------------------------------------------------------
---
>
> Key: NUTCH-318
> URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
> Project: Nutch
> Issue Type: Bug
> Affects Versions: 0.8-dev
> Reporter: Stefan Groschupf
> Priority: Critical
> Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't
dump any information anymore.
> This is realeated to the miss configured
log4j.properties file.
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice
way.
> What makes me wonder is that these information should
be also in the log file, but the arn't, so there are may be
even here problems.
> Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atl
assian.com/software/jira
|
|
| Commented: (NUTCH-318) log4j not proper
configured, readdb doesnt give any
information |

|
2006-07-25 20:16:15 |
[ http://issues.apache.org/jira/browse
/NUTCH-318?page=comments#action_12423433 ]
Stefan Groschupf commented on NUTCH-318:
----------------------------------------
Shouldn't that be fixed in .8 since by today this tool just
produce no output?!
> log4j not proper configured, readdb doesnt give any
information
>
------------------------------------------------------------
---
>
> Key: NUTCH-318
> URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
> Project: Nutch
> Issue Type: Bug
> Affects Versions: 0.8-dev
> Reporter: Stefan Groschupf
> Priority: Critical
> Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't
dump any information anymore.
> This is realeated to the miss configured
log4j.properties file.
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice
way.
> What makes me wonder is that these information should
be also in the log file, but the arn't, so there are may be
even here problems.
> Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atl
assian.com/software/jira
|
|
| Commented: (NUTCH-318) log4j not proper
configured, readdb doesnt give any
information |

|
2006-07-26 05:59:14 |
[ http://issues.apache.org/jira/browse
/NUTCH-318?page=comments#action_12423531 ]
Sami Siren commented on NUTCH-318:
----------------------------------
Perhaps this is happening in distributed setup? in 1 machine
setup output is done to log file see NUTCH-315
> log4j not proper configured, readdb doesnt give any
information
>
------------------------------------------------------------
---
>
> Key: NUTCH-318
> URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
> Project: Nutch
> Issue Type: Bug
> Affects Versions: 0.8-dev
> Reporter: Stefan Groschupf
> Priority: Critical
> Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't
dump any information anymore.
> This is realeated to the miss configured
log4j.properties file.
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice
way.
> What makes me wonder is that these information should
be also in the log file, but the arn't, so there are may be
even here problems.
> Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atl
assian.com/software/jira
|
|
| Commented: (NUTCH-318) log4j not proper
configured, readdb doesnt give any
information |

|
2006-07-26 06:27:17 |
[ http://issues.apache.org/jira/browse
/NUTCH-318?page=comments#action_12423539 ]
Stefan Groschupf commented on NUTCH-318:
----------------------------------------
Yes this happens only in a distributed environment. Please
also see my last mail in the hadoop dev list. I think there
are more general logging problems, that only occurs in a
distributed environment. So you will not track them down
using local runner.
> log4j not proper configured, readdb doesnt give any
information
>
------------------------------------------------------------
---
>
> Key: NUTCH-318
> URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
> Project: Nutch
> Issue Type: Bug
> Affects Versions: 0.8-dev
> Reporter: Stefan Groschupf
> Priority: Critical
> Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't
dump any information anymore.
> This is realeated to the miss configured
log4j.properties file.
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice
way.
> What makes me wonder is that these information should
be also in the log file, but the arn't, so there are may be
even here problems.
> Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atl
assian.com/software/jira
|
|
| Commented: (NUTCH-318) log4j not proper
configured, readdb doesnt give any
information |

|
2006-07-26 06:43:14 |
[ http://issues.apache.org/jira/browse
/NUTCH-318?page=comments#action_12423546 ]
Sami Siren commented on NUTCH-318:
----------------------------------
I agree so the
next thing to do is change readdb -stats to print to stdout,
i'll go ahead and do that. Are there any other commands
dicovered to be changed in similoar way?
> log4j not proper configured, readdb doesnt give any
information
>
------------------------------------------------------------
---
>
> Key: NUTCH-318
> URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
> Project: Nutch
> Issue Type: Bug
> Affects Versions: 0.8-dev
> Reporter: Stefan Groschupf
> Priority: Critical
> Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't
dump any information anymore.
> This is realeated to the miss configured
log4j.properties file.
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice
way.
> What makes me wonder is that these information should
be also in the log file, but the arn't, so there are may be
even here problems.
> Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atl
assian.com/software/jira
|
|
| Commented: (NUTCH-318) log4j not proper
configured, readdb doesnt give any
information |

|
2006-07-26 06:37:19 |
[ http://issues.apache.org/jira/browse
/NUTCH-318?page=comments#action_12423542 ]
Andrzej Bialecki commented on NUTCH-318:
-----------------------------------------
I think also that producing no output on the console is
confusing to new users, especially in the
"local" mode. It's not immediately obvious
where to look for results of your commands, especially
commands like 'readdb -stats', which users naturally
expect to produce some output on the console.
> log4j not proper configured, readdb doesnt give any
information
>
------------------------------------------------------------
---
>
> Key: NUTCH-318
> URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
> Project: Nutch
> Issue Type: Bug
> Affects Versions: 0.8-dev
> Reporter: Stefan Groschupf
> Priority: Critical
> Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't
dump any information anymore.
> This is realeated to the miss configured
log4j.properties file.
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice
way.
> What makes me wonder is that these information should
be also in the log file, but the arn't, so there are may be
even here problems.
> Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atl
assian.com/software/jira
|
|
| Commented: (NUTCH-318) log4j not proper
configured, readdb doesnt give any
information |

|
2006-07-26 07:13:14 |
[ http://issues.apache.org/jira/browse
/NUTCH-318?page=comments#action_12423557 ]
Sami Siren commented on NUTCH-318:
----------------------------------
could this be solved by just adding folowing line into
conf/log4j.properties?
log4j.logger.org.apache.nutch.crawl.CrawlDbReader=INFO,stdou
t
for me it produces following output to stdout:
bin/nutch readdb ../nutch-0.8-release/crawl/crawldb -stats
2006-07-26 10:09:28,839 INFO crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(210)) - CrawlDb
statistics start: ../nutch-0.8-release/crawl/crawldb
2006-07-26 10:09:31,203 INFO crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(261)) - Statistics for
CrawlDb: ../nutch-0.8-release/crawl/crawldb
2006-07-26 10:09:31,204 INFO crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(277)) - TOTAL urls:
60
2006-07-26 10:09:31,206 INFO crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(272)) - avg score: 1.015
2006-07-26 10:09:31,206 INFO crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(270)) - max score: 1.103
2006-07-26 10:09:31,208 INFO crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(268)) - min score: 1.012
2006-07-26 10:09:31,209 INFO crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(277)) - retry 0: 60
2006-07-26 10:09:31,209 INFO crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(276)) - status 1
(DB_unfetched): 59
2006-07-26 10:09:31,211 INFO crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(276)) - status 2
(DB_fetched): 1
2006-07-26 10:09:31,212 INFO crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(282)) - CrawlDb
statistics: done
of course it would look more nice if we create another
format for such cases also (perhaps remove some unneccessary
info)
> log4j not proper configured, readdb doesnt give any
information
>
------------------------------------------------------------
---
>
> Key: NUTCH-318
> URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
> Project: Nutch
> Issue Type: Bug
> Affects Versions: 0.8-dev
> Reporter: Stefan Groschupf
> Priority: Critical
> Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't
dump any information anymore.
> This is realeated to the miss configured
log4j.properties file.
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice
way.
> What makes me wonder is that these information should
be also in the log file, but the arn't, so there are may be
even here problems.
> Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atl
assian.com/software/jira
|
|
| Commented: (NUTCH-318) log4j not proper
configured, readdb doesnt give any
information |

|
2006-07-26 06:58:15 |
[ http://issues.apache.org/jira/browse
/NUTCH-318?page=comments#action_12423555 ]
Andrzej Bialecki commented on NUTCH-318:
-----------------------------------------
Ok, go ahead, it makes sense in this case. However, there
are many places where displaying INFO messages on the
console also makes sense, again - especially in the local
mode. Otherwise you have to start a new console, and do a
'tail -f logs/hadoop.log', which seems like an awkward and
complicated way to see if your command makes any progress or
is stuck.
So, I would vote for adding stdout for now. I know it
produces messy output, and a lot of it - perhaps we could
exclude some stuff from stdout, like plugin repository
loading, config parsing etc. But my feeling is that at least
some info should be displayed on the console.
> log4j not proper configured, readdb doesnt give any
information
>
------------------------------------------------------------
---
>
> Key: NUTCH-318
> URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
> Project: Nutch
> Issue Type: Bug
> Affects Versions: 0.8-dev
> Reporter: Stefan Groschupf
> Priority: Critical
> Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't
dump any information anymore.
> This is realeated to the miss configured
log4j.properties file.
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice
way.
> What makes me wonder is that these information should
be also in the log file, but the arn't, so there are may be
even here problems.
> Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atl
assian.com/software/jira
|
|
| Commented: (NUTCH-318) log4j not proper
configured, readdb doesnt give any
information |

|
2006-07-26 08:48:16 |
[ http://issues.apache.org/jira/browse
/NUTCH-318?page=comments#action_12423579 ]
Sami Siren commented on NUTCH-318:
----------------------------------
i just committed some changes to log4j configuration for
some command line tools to trunk, is this satisfactory
solution to this problem from Nutch's side?
h
ttp://svn.apache.org/viewvc/lucene/nutch/trunk/conf/log4j.pr
operties?r1=416100&r2=425675&diff_format=h
> log4j not proper configured, readdb doesnt give any
information
>
------------------------------------------------------------
---
>
> Key: NUTCH-318
> URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
> Project: Nutch
> Issue Type: Bug
> Affects Versions: 0.8-dev
> Reporter: Stefan Groschupf
> Priority: Critical
> Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't
dump any information anymore.
> This is realeated to the miss configured
log4j.properties file.
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice
way.
> What makes me wonder is that these information should
be also in the log file, but the arn't, so there are may be
even here problems.
> Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atl
assian.com/software/jira
|
|
|
|