List Info

Thread: Created: (HADOOP-2052) distcp mapper's status report misleading




Created: (HADOOP-2052) distcp mapper's status report misleading
country flaguser name
United States
2007-10-13 12:38:50
distcp mapper's status report misleading
----------------------------------------

                 Key: HADOOP-2052
                 URL: htt
ps://issues.apache.org/jira/browse/HADOOP-2052
             Project: Hadoop
          Issue Type: Bug
            Reporter: Runping Qi



When the mappers of distcp finish, the status page in the
web gui reports the data copied.
However, the reported number is far away from the real
number, which is very misleading.
For example, a particular mapper reported: 


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


Updated: (HADOOP-2052) distcp mapper's status report misleading
country flaguser name
United States
2007-10-13 12:40:50
     [ https://issues.apache.org/jira/browse/HADOOP-2052?page=co
m.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Runping Qi updated HADOOP-2052:
-------------------------------

    Component/s: mapred
    Description: 
When the mappers of distcp finish, the status page in the
web gui reports the data copied.
However, the reported number is far away from the real
number, which is very misleading.
For example, a particular mapper reported: 

Finished. Bytes copied: 4.3g

However, the actual file size is about 9GB.



  was:

When the mappers of distcp finish, the status page in the
web gui reports the data copied.
However, the reported number is far away from the real
number, which is very misleading.
For example, a particular mapper reported: 



> distcp mapper's status report misleading
> ----------------------------------------
>
>                 Key: HADOOP-2052
>                 URL: htt
ps://issues.apache.org/jira/browse/HADOOP-2052
>             Project: Hadoop
>          Issue Type: Bug
>          Components: mapred
>            Reporter: Runping Qi
>
> When the mappers of distcp finish, the status page in
the web gui reports the data copied.
> However, the reported number is far away from the real
number, which is very misleading.
> For example, a particular mapper reported: 
> Finished. Bytes copied: 4.3g
> However, the actual file size is about 9GB.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


Updated: (HADOOP-2052) distcp mapper's status report misleading
country flaguser name
United States
2007-10-13 12:51:50
     [ https://issues.apache.org/jira/browse/HADOOP-2052?page=co
m.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Runping Qi updated HADOOP-2052:
-------------------------------

    Description: 
When the mappers of distcp finish, the status page in the
web gui reports the data copied.
However, the reported number is far away from the real
number, which is very misleading.
For example, a particular mapper
task_200710131713_0001_m_000000_0  reported: 

Finished. Bytes copied: 4.3g

However, it does not say which file.
I thought it was for part-00000. But the file size of
part-00000
is about 9GB.

It will be much clearer if the status report  say something
like:

Finished copy file-xxxx: 4.3g
That way, I can easily check whether the size is correct.

 



  was:
When the mappers of distcp finish, the status page in the
web gui reports the data copied.
However, the reported number is far away from the real
number, which is very misleading.
For example, a particular mapper reported: 

Finished. Bytes copied: 4.3g

However, the actual file size is about 9GB.




> distcp mapper's status report misleading
> ----------------------------------------
>
>                 Key: HADOOP-2052
>                 URL: htt
ps://issues.apache.org/jira/browse/HADOOP-2052
>             Project: Hadoop
>          Issue Type: Bug
>          Components: mapred
>            Reporter: Runping Qi
>
> When the mappers of distcp finish, the status page in
the web gui reports the data copied.
> However, the reported number is far away from the real
number, which is very misleading.
> For example, a particular mapper
task_200710131713_0001_m_000000_0  reported: 
> Finished. Bytes copied: 4.3g
> However, it does not say which file.
> I thought it was for part-00000. But the file size of
part-00000
> is about 9GB.
> It will be much clearer if the status report  say
something like:
> Finished copy file-xxxx: 4.3g
> That way, I can easily check whether the size is
correct.
>  

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


Assigned: (HADOOP-2052) distcp mapper's status report misleading
country flaguser name
United States
2007-10-15 16:19:50
     [ https://issues.apache.org/jira/browse/HADOOP-2052?page=co
m.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris Douglas reassigned HADOOP-2052:
-------------------------------------

    Assignee: Chris Douglas

> distcp mapper's status report misleading
> ----------------------------------------
>
>                 Key: HADOOP-2052
>                 URL: htt
ps://issues.apache.org/jira/browse/HADOOP-2052
>             Project: Hadoop
>          Issue Type: Bug
>          Components: mapred
>            Reporter: Runping Qi
>            Assignee: Chris Douglas
>
> When the mappers of distcp finish, the status page in
the web gui reports the data copied.
> However, the reported number is far away from the real
number, which is very misleading.
> For example, a particular mapper
task_200710131713_0001_m_000000_0  reported: 
> Finished. Bytes copied: 4.3g
> However, it does not say which file.
> I thought it was for part-00000. But the file size of
part-00000
> is about 9GB.
> It will be much clearer if the status report  say
something like:
> Finished copy file-xxxx: 4.3g
> That way, I can easily check whether the size is
correct.
>  

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


Resolved: (HADOOP-2052) distcp mapper's status report misleading
country flaguser name
United States
2007-10-24 13:18:50
     [ https://issues.apache.org/jira/browse/HADOOP-2052?page=co
m.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Owen O'Malley resolved HADOOP-2052.
-----------------------------------

       Resolution: Duplicate
    Fix Version/s: 0.15.0

This was fixed by HADOOP-2048.

> distcp mapper's status report misleading
> ----------------------------------------
>
>                 Key: HADOOP-2052
>                 URL: htt
ps://issues.apache.org/jira/browse/HADOOP-2052
>             Project: Hadoop
>          Issue Type: Bug
>          Components: mapred
>            Reporter: Runping Qi
>            Assignee: Chris Douglas
>             Fix For: 0.15.0
>
>
> When the mappers of distcp finish, the status page in
the web gui reports the data copied.
> However, the reported number is far away from the real
number, which is very misleading.
> For example, a particular mapper
task_200710131713_0001_m_000000_0  reported: 
> Finished. Bytes copied: 4.3g
> However, it does not say which file.
> I thought it was for part-00000. But the file size of
part-00000
> is about 9GB.
> It will be much clearer if the status report  say
something like:
> Finished copy file-xxxx: 4.3g
> That way, I can easily check whether the size is
correct.
>  

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


[1-5]

about | contact  Other archives ( Real Estate discussion Medical topics )