List Info

Thread: Re: Updated: (HADOOP-1180) NNbench test should be able to test the checksumfilesystem as well




Re: Updated: (HADOOP-1180) NNbench test should be able to test the checksumfilesystem as well
country flaguser name
United States
2007-03-29 13:46:46
So shouldn't fixing this test to conform to the new model in
 
HADOOP-1134 be the concern of the patch for HADOOP-1134?  As
it  
stand, I can't run NNBench at scale without using a raw file
system,  
which is what this patch is intended to allow.  HADOOP-928
caused  
this test to use a ChecksumFileSystem and subsequently we
saw our  
"read" TPS metric plummet from 20,000 to a couple
hundred.

Let's get our current benchmark back on track before we
commit  
HADOOP-1134 (which will likely take a while before it is
"Patch  
Available").

On Mar 29, 2007, at 11:29 AM, Doug Cutting (JIRA) wrote:

>
>      [ ht
tps://issues.apache.org/jira/browse/HADOOP-1180? 
>
page=com.atlassian.jira.plugin.system.issuetabpanels:all-tab
panel ]
>
> Doug Cutting updated HADOOP-1180:
> ---------------------------------
>
>     Status: Open  (was: Patch Available)
>
> -1 This patch may be rendered obsolete by HADOOP-1134. 
And, the  
> way it is written, the 'useChecksum=false' mode will
silently fail  
> to work once HADOOP-1134 is completed.  So, if we feel
we'll want  
> to continue to support this feature after HADOOP-1134,
then we  
> should add an explicit way of constructing an HDFS
FileSystem that  
> does not perform checksumming, rather than relying on
'instanceof  
> ChecksumFileSystem'.
>
>> NNbench test should be able to test the
checksumfilesystem as well  
>> as the raw filesystem
>>
------------------------------------------------------------
--------- 
>> -------------------
>>
>>                 Key: HADOOP-1180
>>                 URL: https://issues
.apache.org/jira/browse/ 
>> HADOOP-1180
>>             Project: Hadoop
>>          Issue Type: Bug
>>          Components: dfs
>>            Reporter: dhruba borthakur
>>         Assigned To: dhruba borthakur
>>         Attachments: nnbench.patch
>>
>>
>> The NNbench test should have the option of testing
a file system  
>> with checksums turned on and with checksums turned
off. The  
>> original behaviour of nnbench test was to test hdfs
without  
>> checksums.
>
> -- 
> This message is automatically generated by JIRA.
> -
> You can reply to this email to add a comment to the
issue online.
>


Re: Updated: (HADOOP-1180) NNbench test should be able to test the checksumfilesystem as well
country flaguser name
United States
2007-03-29 14:07:01
Nigel Daley wrote:
> So shouldn't fixing this test to conform to the new
model in HADOOP-1134 
> be the concern of the patch for HADOOP-1134? 

Yes, but, as it stands, this patch would silently stop
working correctly 
once HADOOP-1134 is committed.  It should instead be written
in a more 
robust way, that can survive expected changes.  Relying on
HDFS using 
ChecksumFileSystem isn't as reliable as an explicit
constructor that 
says "I want an unchecksummed FileSystem."

> As it stand, I can't run 
> NNBench at scale without using a raw file system, which
is what this 
> patch is intended to allow.

It seems strange to disable things in an undocumented and
unsupported 
way in order to get a benchmark to complete.  How does that
prove 
scalability?  Rather, leaving NNBench alone seems like a
strong argument 
for implementing HADOOP-1134 sooner.

Still, if you want to be able to disable checksums, for
benchmarks or 
whatever, we can permit that, but should do so explicitly.

> HADOOP-928 caused this test to use a 
> ChecksumFileSystem and subsequently we saw our
"read" TPS metric plummet 
> from 20,000 to a couple hundred.

Ah, NNBench used the 'raw' methods before, which was kind of
sneaky on 
its part, since it didn't benchmark the typical user
experience. 
Although the namenode performance should only halve at worst
with 
checksums as currently implemented, no?

> Let's get our current benchmark back on track before we
commit 
> HADOOP-1134 (which will likely take a while before it
is "Patch 
> Available").

I'd argue that we should fix the benchmark to accurately
reflect what 
users see, so that we see real improvement when HADOOP-1134
is 
committed.  That would make it a more useful and realistic
benchmark. 
However if you believe that a checksum-free benchmark is
still useful, I 
think it should be more future-proof.

Doug

Re: Updated: (HADOOP-1180) NNbench test should be able to test the checksumfilesystem as well
country flaguser name
United States
2007-03-29 16:48:35
Nigel Daley wrote:
> So shouldn't fixing this test to conform to the new
model in HADOOP-1134 
> be the concern of the patch for HADOOP-1134?  As it
stand, I can't run 
> NNBench at scale without using a raw file system, which
is what this 
> patch is intended to allow.  HADOOP-928 caused this
test to use a 
> ChecksumFileSystem and subsequently we saw our
"read" TPS metric plummet 
> from 20,000 to a couple hundred.

Wow! this would be a good test for 1134. I didn't expect the
TPS to be 
so different. I would expect TPS to remain closer to 20000
with CRCs 
with 1134

Raghu.

> Let's get our current benchmark back on track before we
commit 
> HADOOP-1134 (which will likely take a while before it
is "Patch 
> Available").
> 
> On Mar 29, 2007, at 11:29 AM, Doug Cutting (JIRA)
wrote:
> 
>>
>>      [ 
>> https://issues.apache.org/jira/browse/HADOOP-1180?page=co
m.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel 
>> ]
>>
>> Doug Cutting updated HADOOP-1180:
>> ---------------------------------
>>
>>     Status: Open  (was: Patch Available)
>>
>> -1 This patch may be rendered obsolete by
HADOOP-1134.  And, the way 
>> it is written, the 'useChecksum=false' mode will
silently fail to work 
>> once HADOOP-1134 is completed.  So, if we feel
we'll want to continue 
>> to support this feature after HADOOP-1134, then we
should add an 
>> explicit way of constructing an HDFS FileSystem
that does not perform 
>> checksumming, rather than relying on 'instanceof
ChecksumFileSystem'.
>>
>>> NNbench test should be able to test the
checksumfilesystem as well as 
>>> the raw filesystem
>>>
------------------------------------------------------------
---------------------------- 
>>>
>>>
>>>                 Key: HADOOP-1180
>>>                 URL: htt
ps://issues.apache.org/jira/browse/HADOOP-1180
>>>             Project: Hadoop
>>>          Issue Type: Bug
>>>          Components: dfs
>>>            Reporter: dhruba borthakur
>>>         Assigned To: dhruba borthakur
>>>         Attachments: nnbench.patch
>>>
>>>
>>> The NNbench test should have the option of
testing a file system with 
>>> checksums turned on and with checksums turned
off. The original 
>>> behaviour of nnbench test was to test hdfs
without checksums.
>>
>> --This message is automatically generated by JIRA.
>> -
>> You can reply to this email to add a comment to the
issue online.
>>
> 


[1-3]

about | contact  Other archives ( Real Estate discussion Medical topics )