List Info

Thread: Created: (NUTCH-318) log4j not proper configured, readdb doesnt give any information




Created: (NUTCH-318) log4j not proper configured, readdb doesnt give any information
user name
2006-07-10 19:07:29
log4j not proper configured, readdb doesnt give any
information
------------------------------------------------------------
---

         Key: NUTCH-318
         URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
     Project: Nutch
        Type: Bug

    Versions: 0.8-dev    
    Reporter: Stefan Groschupf
    Priority: Critical
     Fix For: 0.8-dev


In the latest .8 sources the readdb command doesn't dump
any information anymore. 
This is realeated to the miss configured log4j.properties
file. 
changing:
log4j.rootLogger=INFO,DRFA
to:
log4j.rootLogger=INFO,DRFA,stdout
dumps the information to the console, but not in a nice way.


What makes me wonder  is that these information should be
also in the log file, but the arn't, so there are may be
even here problems.
Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?


-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa

-
For more information on JIRA, see:
   http://www.atl
assian.com/software/jira

Updated: (NUTCH-318) log4j not proper configured, readdb doesnt give any information
user name
2006-07-25 19:40:23
     [ http://issues.apache.org/jira/browse/NUTCH-318?page=all ]

Sami Siren updated NUTCH-318:
-----------------------------

    Fix Version/s: 0.9-dev
                       (was: 0.8-dev)

> log4j not proper configured, readdb doesnt give any
information
>
------------------------------------------------------------
---
>
>                 Key: NUTCH-318
>                 URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 0.8-dev
>            Reporter: Stefan Groschupf
>            Priority: Critical
>             Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't
dump any information anymore. 
> This is realeated to the miss configured
log4j.properties file. 
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice
way. 
> What makes me wonder  is that these information should
be also in the log file, but the arn't, so there are may be
even here problems.
> Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators: http://issues.apache.org/jira/secure/Administrators.jspa

-
For more information on JIRA, see: http://www.atl
assian.com/software/jira

        
Commented: (NUTCH-318) log4j not proper configured, readdb doesnt give any information
user name
2006-07-25 20:16:15
    [ http://issues.apache.org/jira/browse
/NUTCH-318?page=comments#action_12423433 ] 
            
Stefan Groschupf commented on NUTCH-318:
----------------------------------------

Shouldn't that be fixed in .8 since by today this tool just
produce no output?!


> log4j not proper configured, readdb doesnt give any
information
>
------------------------------------------------------------
---
>
>                 Key: NUTCH-318
>                 URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 0.8-dev
>            Reporter: Stefan Groschupf
>            Priority: Critical
>             Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't
dump any information anymore. 
> This is realeated to the miss configured
log4j.properties file. 
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice
way. 
> What makes me wonder  is that these information should
be also in the log file, but the arn't, so there are may be
even here problems.
> Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators: http://issues.apache.org/jira/secure/Administrators.jspa

-
For more information on JIRA, see: http://www.atl
assian.com/software/jira

        
Commented: (NUTCH-318) log4j not proper configured, readdb doesnt give any information
user name
2006-07-26 05:59:14
    [ http://issues.apache.org/jira/browse
/NUTCH-318?page=comments#action_12423531 ] 
            
Sami Siren commented on NUTCH-318:
----------------------------------

Perhaps this is happening in distributed setup? in 1 machine
setup output is done to log file see NUTCH-315

> log4j not proper configured, readdb doesnt give any
information
>
------------------------------------------------------------
---
>
>                 Key: NUTCH-318
>                 URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 0.8-dev
>            Reporter: Stefan Groschupf
>            Priority: Critical
>             Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't
dump any information anymore. 
> This is realeated to the miss configured
log4j.properties file. 
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice
way. 
> What makes me wonder  is that these information should
be also in the log file, but the arn't, so there are may be
even here problems.
> Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators: http://issues.apache.org/jira/secure/Administrators.jspa

-
For more information on JIRA, see: http://www.atl
assian.com/software/jira

        
Commented: (NUTCH-318) log4j not proper configured, readdb doesnt give any information
user name
2006-07-26 06:27:17
    [ http://issues.apache.org/jira/browse
/NUTCH-318?page=comments#action_12423539 ] 
            
Stefan Groschupf commented on NUTCH-318:
----------------------------------------

Yes this happens only in a distributed environment. Please
also see my last mail in the hadoop dev list. I think there
are more general logging problems, that only occurs in a
distributed environment. So you will not track them down
using local runner.

> log4j not proper configured, readdb doesnt give any
information
>
------------------------------------------------------------
---
>
>                 Key: NUTCH-318
>                 URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 0.8-dev
>            Reporter: Stefan Groschupf
>            Priority: Critical
>             Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't
dump any information anymore. 
> This is realeated to the miss configured
log4j.properties file. 
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice
way. 
> What makes me wonder  is that these information should
be also in the log file, but the arn't, so there are may be
even here problems.
> Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators: http://issues.apache.org/jira/secure/Administrators.jspa

-
For more information on JIRA, see: http://www.atl
assian.com/software/jira

        
Commented: (NUTCH-318) log4j not proper configured, readdb doesnt give any information
user name
2006-07-26 06:43:14
    [ http://issues.apache.org/jira/browse
/NUTCH-318?page=comments#action_12423546 ] 
            
Sami Siren commented on NUTCH-318:
----------------------------------

I agree  so the
next thing to do is change readdb -stats to print to stdout,
i'll go ahead and do that. Are there any other commands
dicovered to be changed in similoar way?

> log4j not proper configured, readdb doesnt give any
information
>
------------------------------------------------------------
---
>
>                 Key: NUTCH-318
>                 URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 0.8-dev
>            Reporter: Stefan Groschupf
>            Priority: Critical
>             Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't
dump any information anymore. 
> This is realeated to the miss configured
log4j.properties file. 
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice
way. 
> What makes me wonder  is that these information should
be also in the log file, but the arn't, so there are may be
even here problems.
> Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators: http://issues.apache.org/jira/secure/Administrators.jspa

-
For more information on JIRA, see: http://www.atl
assian.com/software/jira

        
Commented: (NUTCH-318) log4j not proper configured, readdb doesnt give any information
user name
2006-07-26 06:37:19
    [ http://issues.apache.org/jira/browse
/NUTCH-318?page=comments#action_12423542 ] 
            
Andrzej Bialecki  commented on NUTCH-318:
-----------------------------------------

I think also that producing no output on the console is
confusing to new users, especially in the
"local" mode. It's not immediately obvious
where to look for results of your commands, especially 
commands like 'readdb -stats', which users naturally
expect to produce some output on the console.

> log4j not proper configured, readdb doesnt give any
information
>
------------------------------------------------------------
---
>
>                 Key: NUTCH-318
>                 URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 0.8-dev
>            Reporter: Stefan Groschupf
>            Priority: Critical
>             Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't
dump any information anymore. 
> This is realeated to the miss configured
log4j.properties file. 
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice
way. 
> What makes me wonder  is that these information should
be also in the log file, but the arn't, so there are may be
even here problems.
> Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators: http://issues.apache.org/jira/secure/Administrators.jspa

-
For more information on JIRA, see: http://www.atl
assian.com/software/jira

        
Commented: (NUTCH-318) log4j not proper configured, readdb doesnt give any information
user name
2006-07-26 07:13:14
    [ http://issues.apache.org/jira/browse
/NUTCH-318?page=comments#action_12423557 ] 
            
Sami Siren commented on NUTCH-318:
----------------------------------

could this be solved by just adding folowing line into
conf/log4j.properties?

log4j.logger.org.apache.nutch.crawl.CrawlDbReader=INFO,stdou
t

for me it produces following output to stdout:
bin/nutch readdb ../nutch-0.8-release/crawl/crawldb -stats
2006-07-26 10:09:28,839 INFO  crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(210)) - CrawlDb
statistics start: ../nutch-0.8-release/crawl/crawldb
2006-07-26 10:09:31,203 INFO  crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(261)) - Statistics for
CrawlDb: ../nutch-0.8-release/crawl/crawldb
2006-07-26 10:09:31,204 INFO  crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(277)) - TOTAL urls:     
60
2006-07-26 10:09:31,206 INFO  crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(272)) - avg score: 1.015
2006-07-26 10:09:31,206 INFO  crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(270)) - max score: 1.103
2006-07-26 10:09:31,208 INFO  crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(268)) - min score: 1.012
2006-07-26 10:09:31,209 INFO  crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(277)) - retry 0:   60
2006-07-26 10:09:31,209 INFO  crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(276)) - status 1
(DB_unfetched):  59
2006-07-26 10:09:31,211 INFO  crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(276)) - status 2
(DB_fetched):    1
2006-07-26 10:09:31,212 INFO  crawl.CrawlDbReader
(CrawlDbReader.java:processStatJob(282)) - CrawlDb
statistics: done

of course it would look more nice if we create another
format for such cases also (perhaps remove some unneccessary
info)


> log4j not proper configured, readdb doesnt give any
information
>
------------------------------------------------------------
---
>
>                 Key: NUTCH-318
>                 URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 0.8-dev
>            Reporter: Stefan Groschupf
>            Priority: Critical
>             Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't
dump any information anymore. 
> This is realeated to the miss configured
log4j.properties file. 
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice
way. 
> What makes me wonder  is that these information should
be also in the log file, but the arn't, so there are may be
even here problems.
> Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators: http://issues.apache.org/jira/secure/Administrators.jspa

-
For more information on JIRA, see: http://www.atl
assian.com/software/jira

        
Commented: (NUTCH-318) log4j not proper configured, readdb doesnt give any information
user name
2006-07-26 06:58:15
    [ http://issues.apache.org/jira/browse
/NUTCH-318?page=comments#action_12423555 ] 
            
Andrzej Bialecki  commented on NUTCH-318:
-----------------------------------------

Ok, go ahead, it makes sense in this case. However, there
are many places where displaying INFO messages on the
console also makes sense, again - especially in the local
mode. Otherwise you have to start a new console, and do a
'tail -f logs/hadoop.log', which seems like an awkward and
complicated way to see if your command makes any progress or
is stuck.

So, I would vote for adding stdout for now. I know it
produces messy output, and a lot of it - perhaps we could
exclude some stuff from stdout, like plugin repository
loading, config parsing etc. But my feeling is that at least
some info should be displayed on the console.

> log4j not proper configured, readdb doesnt give any
information
>
------------------------------------------------------------
---
>
>                 Key: NUTCH-318
>                 URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 0.8-dev
>            Reporter: Stefan Groschupf
>            Priority: Critical
>             Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't
dump any information anymore. 
> This is realeated to the miss configured
log4j.properties file. 
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice
way. 
> What makes me wonder  is that these information should
be also in the log file, but the arn't, so there are may be
even here problems.
> Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators: http://issues.apache.org/jira/secure/Administrators.jspa

-
For more information on JIRA, see: http://www.atl
assian.com/software/jira

        
Commented: (NUTCH-318) log4j not proper configured, readdb doesnt give any information
user name
2006-07-26 08:48:16
    [ http://issues.apache.org/jira/browse
/NUTCH-318?page=comments#action_12423579 ] 
            
Sami Siren commented on NUTCH-318:
----------------------------------

i just committed some changes to log4j configuration for
some command line tools to trunk, is this satisfactory
solution to this problem from Nutch's side?

h
ttp://svn.apache.org/viewvc/lucene/nutch/trunk/conf/log4j.pr
operties?r1=416100&r2=425675&diff_format=h


> log4j not proper configured, readdb doesnt give any
information
>
------------------------------------------------------------
---
>
>                 Key: NUTCH-318
>                 URL: http:/
/issues.apache.org/jira/browse/NUTCH-318
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 0.8-dev
>            Reporter: Stefan Groschupf
>            Priority: Critical
>             Fix For: 0.9-dev
>
>
> In the latest .8 sources the readdb command doesn't
dump any information anymore. 
> This is realeated to the miss configured
log4j.properties file. 
> changing:
> log4j.rootLogger=INFO,DRFA
> to:
> log4j.rootLogger=INFO,DRFA,stdout
> dumps the information to the console, but not in a nice
way. 
> What makes me wonder  is that these information should
be also in the log file, but the arn't, so there are may be
even here problems.
> Also what is the different between
hadoop-XXX-jobtracker-XXX.out and
hadoop-XXX-jobtracker-XXX.log ?? Shouldn't there just one
of them?

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators: http://issues.apache.org/jira/secure/Administrators.jspa

-
For more information on JIRA, see: http://www.atl
assian.com/software/jira

        
[1-10] [11-13]

about | contact  Other archives ( Real Estate discussion Medical topics )