Hi John,
I had similar problem on Hadoop 0.12.0, where my jobs were
getting hung in
reduce copy phase. I suspect it is a bug in 0.12.0 (I assume
that it is not
stable yet.) for which a patch has been released on dev
forum (Devraj Das
released the patch). I moved back to 0.11.2 to solve the
problem.
thanks,
gaurav
Ion Badita wrote:
>
> Hi,
>
> I have 17 computers cluster running hadoop 0.12.0 with
jdk 6.
>
> I runned a job with 14 maps an 3 reducers. On the Job's
status map is 100%
> completed with 0 Failures. When click the 0 link (for
the map) it shows
> this stack traces:
>
> Map output lost, rescheduling:
getMapOutput(task_0003_m_000009_0,2) failed
> :
> java.io.FileNotFoundException:
> /dfs/mapred/local/task_0003_m_000009_0/file.out
> at
>
org.apache.hadoop.fs.ChecksumFileSystem.open(ChecksumFileSys
tem.java:324)
> at
org.apache.hadoop.fs.FileSystem.open(FileSystem.java:245)
> at
>
org.apache.hadoop.mapred.TaskTracker$MapOutputServlet.doGet(
TaskTracker.java:1654)
> at
javax.servlet.http.HttpServlet.service(HttpServlet.java:689)
> at
javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
> at
org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder
.java:427)
> at
>
org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(Web
ApplicationHandler.java:475)
> at
>
org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandl
er.java:567)
> at
org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
> at
>
org.mortbay.jetty.servlet.WebApplicationContext.handle(WebAp
plicationContext.java:635)
> at
org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
> at
org.mortbay.http.HttpServer.service(HttpServer.java:954)
> at
org.mortbay.http.HttpConnection.service(HttpConnection.java:
814)
> at
org.mortbay.http.HttpConnection.handleNext(HttpConnection.ja
va:981)
> at
org.mortbay.http.HttpConnection.handle(HttpConnection.java:8
31)
> at
>
org.mortbay.http.SocketListener.handleConnection(SocketListe
ner.java:244)
> at
org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:3
57)
> at
org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:5
34)
>
> -------
> Map output lost, rescheduling:
getMapOutput(task_0003_m_000009_0,2) failed
> :
> java.io.FileNotFoundException:
> /dfs/mapred/local/task_0003_m_000009_0/file.out
> at
>
org.apache.hadoop.fs.ChecksumFileSystem.open(ChecksumFileSys
tem.java:324)
> at
org.apache.hadoop.fs.FileSystem.open(FileSystem.java:245)
> at
>
org.apache.hadoop.mapred.TaskTracker$MapOutputServlet.doGet(
TaskTracker.java:1654)
> at
javax.servlet.http.HttpServlet.service(HttpServlet.java:689)
> at
javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
> at
org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder
.java:427)
> at
>
org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(Web
ApplicationHandler.java:475)
> at
>
org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandl
er.java:567)
> at
org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
> at
>
org.mortbay.jetty.servlet.WebApplicationContext.handle(WebAp
plicationContext.java:635)
> at
org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
> at
org.mortbay.http.HttpServer.service(HttpServer.java:954)
> at
org.mortbay.http.HttpConnection.service(HttpConnection.java:
814)
> at
org.mortbay.http.HttpConnection.handleNext(HttpConnection.ja
va:981)
> at
org.mortbay.http.HttpConnection.handle(HttpConnection.java:8
31)
> at
>
org.mortbay.http.SocketListener.handleConnection(SocketListe
ner.java:244)
> at
org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:3
57)
> at
org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:5
34)
>
> -------
>
>
>
> On reduces task list 2 reduces are finished and one
seems to be hannged on
> copy phase with this status: reduce > copy (13 of 14
at 0.06 MB/s) >
>
> Any help will be apreciated!
> Thanks
> John.
>
>
>
>
--
View this message in context: http://www.nabble.com/Unreported-Exception-tf3
380081.html#a9413104
Sent from the Hadoop Users mailing list archive at
Nabble.com.
|