List Info

Thread: Created: (HADOOP-1648) Add contrib jars to general hadoop CLASSPATH




Created: (HADOOP-1648) Add contrib jars to general hadoop CLASSPATH
country flaguser name
United States
2007-07-23 11:59:31
Add contrib jars to general hadoop CLASSPATH
--------------------------------------------

                 Key: HADOOP-1648
                 URL: htt
ps://issues.apache.org/jira/browse/HADOOP-1648
             Project: Hadoop
          Issue Type: Improvement
            Reporter: stack
            Priority: Minor
         Attachments: hadoop1648.patch

A mapreduce job that depends on any of the hadoop contrib
jars  must bundle the contrib jar into its job jar or copy
the contrib jar to the lib dir across the cluster because
hadoop contribs are not on the general hadoop CLASSPATH.  It
would be an improvement if such as the included hbase
mapreduce tasks did not require the running of this extra
step.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


Updated: (HADOOP-1648) Add contrib jars to general hadoop CLASSPATH
country flaguser name
United States
2007-07-23 11:59:31
     [ https://issues.apache.org/jira/browse/HADOOP-1648?page=co
m.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

stack updated HADOOP-1648:
--------------------------

    Attachment: hadoop1648.patch

First cut at a patch for bin/hadoop

> Add contrib jars to general hadoop CLASSPATH
> --------------------------------------------
>
>                 Key: HADOOP-1648
>                 URL: htt
ps://issues.apache.org/jira/browse/HADOOP-1648
>             Project: Hadoop
>          Issue Type: Improvement
>            Reporter: stack
>            Priority: Minor
>         Attachments: hadoop1648.patch
>
>
> A mapreduce job that depends on any of the hadoop
contrib jars  must bundle the contrib jar into its job jar
or copy the contrib jar to the lib dir across the cluster
because hadoop contribs are not on the general hadoop
CLASSPATH.  It would be an improvement if such as the
included hbase mapreduce tasks did not require the running
of this extra step.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


Updated: (HADOOP-1648) Add contrib jars to general hadoop CLASSPATH
country flaguser name
United States
2007-07-23 12:21:31
     [ https://issues.apache.org/jira/browse/HADOOP-1648?page=co
m.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

stack updated HADOOP-1648:
--------------------------

    Status: Patch Available  (was: Open)

Builds and passes all tests locally

> Add contrib jars to general hadoop CLASSPATH
> --------------------------------------------
>
>                 Key: HADOOP-1648
>                 URL: htt
ps://issues.apache.org/jira/browse/HADOOP-1648
>             Project: Hadoop
>          Issue Type: Improvement
>            Reporter: stack
>            Priority: Minor
>         Attachments: hadoop1648.patch
>
>
> A mapreduce job that depends on any of the hadoop
contrib jars  must bundle the contrib jar into its job jar
or copy the contrib jar to the lib dir across the cluster
because hadoop contribs are not on the general hadoop
CLASSPATH.  It would be an improvement if such as the
included hbase mapreduce tasks did not require the running
of this extra step.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


Commented: (HADOOP-1648) Add contrib jars to general hadoop CLASSPATH
country flaguser name
United States
2007-07-23 16:43:31
    [ https://issues.apache.org/jira/browse
/HADOOP-1648?page=com.atlassian.jira.plugin.system.issuetabp
anels:comment-tabpanel#action_12514791 ] 

Hadoop QA commented on HADOOP-1648:
-----------------------------------

+1

http://issues.apache.org/jira/secure/at
tachment/12362356/hadoop1648.patch applied and
successfully tested against trunk revision r558831.

Test results:   http://lucene.zones.apache.org:8080/h
udson/job/Hadoop-Patch/452/testReport/
Console output: http://lucene.zones.apache.org:8080/hudso
n/job/Hadoop-Patch/452/console

> Add contrib jars to general hadoop CLASSPATH
> --------------------------------------------
>
>                 Key: HADOOP-1648
>                 URL: htt
ps://issues.apache.org/jira/browse/HADOOP-1648
>             Project: Hadoop
>          Issue Type: Improvement
>            Reporter: stack
>            Priority: Minor
>         Attachments: hadoop1648.patch
>
>
> A mapreduce job that depends on any of the hadoop
contrib jars  must bundle the contrib jar into its job jar
or copy the contrib jar to the lib dir across the cluster
because hadoop contribs are not on the general hadoop
CLASSPATH.  It would be an improvement if such as the
included hbase mapreduce tasks did not require the running
of this extra step.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


Updated: (HADOOP-1648) Add contrib jars to general hadoop CLASSPATH
country flaguser name
United States
2007-07-23 23:26:31
     [ https://issues.apache.org/jira/browse/HADOOP-1648?page=co
m.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

stack updated HADOOP-1648:
--------------------------

    Status: In Progress  (was: Patch Available)

Putting aside for the moment as adding contrib jars will not
be sufficent solving this item.  Would also need to add
contrib conf directories to CLASSPATH but, at least in
hbase, the conf directory location can be set with a
hbase-particular environment variable (HBASE_CONF_DIR). 
Finding alternate contrib conf directories would require
polluting the top level hadoop script with contrib
specifics.

> Add contrib jars to general hadoop CLASSPATH
> --------------------------------------------
>
>                 Key: HADOOP-1648
>                 URL: htt
ps://issues.apache.org/jira/browse/HADOOP-1648
>             Project: Hadoop
>          Issue Type: Improvement
>            Reporter: stack
>            Priority: Minor
>         Attachments: hadoop1648.patch
>
>
> A mapreduce job that depends on any of the hadoop
contrib jars  must bundle the contrib jar into its job jar
or copy the contrib jar to the lib dir across the cluster
because hadoop contribs are not on the general hadoop
CLASSPATH.  It would be an improvement if such as the
included hbase mapreduce tasks did not require the running
of this extra step.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


Commented: (HADOOP-1648) Add contrib jars to general hadoop CLASSPATH
country flaguser name
United States
2007-07-25 21:01:31
    [ https://issues.apache.org/jira/browse
/HADOOP-1648?page=com.atlassian.jira.plugin.system.issuetabp
anels:comment-tabpanel#action_12515475 ] 

Dennis Kubes commented on HADOOP-1648:
--------------------------------------

I believe HADOOP-1622 will solve this issue without
polluting hadoop scripts with contrib specifics.  The
current patch for 1622 allow multiple resources including
directories, jars, files, etc. to be included in a map
reduce job.  All resources are jared on the fly into a
master job.jar that is deployed to the hadoop cluster.

> Add contrib jars to general hadoop CLASSPATH
> --------------------------------------------
>
>                 Key: HADOOP-1648
>                 URL: htt
ps://issues.apache.org/jira/browse/HADOOP-1648
>             Project: Hadoop
>          Issue Type: Improvement
>            Reporter: stack
>            Priority: Minor
>         Attachments: hadoop1648.patch
>
>
> A mapreduce job that depends on any of the hadoop
contrib jars  must bundle the contrib jar into its job jar
or copy the contrib jar to the lib dir across the cluster
because hadoop contribs are not on the general hadoop
CLASSPATH.  It would be an improvement if such as the
included hbase mapreduce tasks did not require the running
of this extra step.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


Commented: (HADOOP-1648) Add contrib jars to general hadoop CLASSPATH
country flaguser name
United States
2007-07-26 00:40:31
    [ https://issues.apache.org/jira/browse
/HADOOP-1648?page=com.atlassian.jira.plugin.system.issuetabp
anels:comment-tabpanel#action_12515515 ] 

stack commented on HADOOP-1648:
-------------------------------

Thanks for the suggestion Dennis.  Indeed HADOOP-1622 looks
promising.  I could add a 'job' argument to our hbase master
script and it would internally invoke bin/hadoop with
appropriate arguments to bundle HBASE_CONF_DIR and
hadoop-hbase.jar, etc.

> Add contrib jars to general hadoop CLASSPATH
> --------------------------------------------
>
>                 Key: HADOOP-1648
>                 URL: htt
ps://issues.apache.org/jira/browse/HADOOP-1648
>             Project: Hadoop
>          Issue Type: Improvement
>            Reporter: stack
>            Priority: Minor
>         Attachments: hadoop1648.patch
>
>
> A mapreduce job that depends on any of the hadoop
contrib jars  must bundle the contrib jar into its job jar
or copy the contrib jar to the lib dir across the cluster
because hadoop contribs are not on the general hadoop
CLASSPATH.  It would be an improvement if such as the
included hbase mapreduce tasks did not require the running
of this extra step.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


Commented: (HADOOP-1648) Add contrib jars to general hadoop CLASSPATH
country flaguser name
United States
2007-11-07 00:12:50
    [ https://issues.apache.org/jira/browse
/HADOOP-1648?page=com.atlassian.jira.plugin.system.issuetabp
anels:comment-tabpanel#action_12540676 ] 

stack commented on HADOOP-1648:
-------------------------------

Thinking about this issue more (brought on by how to run MR
jobs from hbase shell -- HADOOP-1608), my comment above on
23/Jul/07 09:25 PM is confusing what HBASE_CONF_DIR is for;
its content is used starting and stopping hbase clusters,
not config. for a MR job w/ hbase dependencies.

Config. for a MR job -- an hbase-site.xml w/ MR config. --
can either be copied across the cluster into
HADOOP_HOME/conf or bundled up into the job jar
(HADOOP-1622).

So, having contrib jars added to the hadoop CLASSPATH will
make it so things like the hbase TableOutput/InputFormats
are generally available and hbase doesn't have to be added
to every job jar that has a hbase dependency.

But what about contrib libs?  In hbase there is at least a
lucene jar that conceivably that a hbase MR might use
(indexing content).



> Add contrib jars to general hadoop CLASSPATH
> --------------------------------------------
>
>                 Key: HADOOP-1648
>                 URL: htt
ps://issues.apache.org/jira/browse/HADOOP-1648
>             Project: Hadoop
>          Issue Type: Improvement
>            Reporter: stack
>            Priority: Minor
>         Attachments: hadoop1648.patch
>
>
> A mapreduce job that depends on any of the hadoop
contrib jars  must bundle the contrib jar into its job jar
or copy the contrib jar to the lib dir across the cluster
because hadoop contribs are not on the general hadoop
CLASSPATH.  It would be an improvement if such as the
included hbase mapreduce tasks did not require the running
of this extra step.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


Commented: (HADOOP-1648) Add contrib jars to general hadoop CLASSPATH
country flaguser name
United States
2007-11-07 00:14:50
    [ https://issues.apache.org/jira/browse
/HADOOP-1648?page=com.atlassian.jira.plugin.system.issuetabp
anels:comment-tabpanel#action_12540678 ] 

stack commented on HADOOP-1648:
-------------------------------

TODO: Test that an hbase-site.xml added into
HADOOP_HOME/conf gets picked up by running job (should). 
Also figure how to get contrib libs into the mix.

> Add contrib jars to general hadoop CLASSPATH
> --------------------------------------------
>
>                 Key: HADOOP-1648
>                 URL: htt
ps://issues.apache.org/jira/browse/HADOOP-1648
>             Project: Hadoop
>          Issue Type: Improvement
>            Reporter: stack
>            Priority: Minor
>         Attachments: hadoop1648.patch
>
>
> A mapreduce job that depends on any of the hadoop
contrib jars  must bundle the contrib jar into its job jar
or copy the contrib jar to the lib dir across the cluster
because hadoop contribs are not on the general hadoop
CLASSPATH.  It would be an improvement if such as the
included hbase mapreduce tasks did not require the running
of this extra step.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


[1-9]

about | contact  Other archives ( Real Estate discussion Medical topics )