[
HTTPS://ISSUES.APACHE.ORG/JIRA/BROWSE/NUTCH-506?PAGE=COM.ATL
ASSIAN.JIRA.PLUGIN.SYSTEM.ISSUETABPANELS:ALL-TABPANEL ]
DO?ACAN GüNEY RESOLVED NUTCH-506.
---------------------------------
RESOLUTION: FIXED
ASSIGNEE: DO?ACAN GüNEY
COMMITTED IN REV. 556946.
> NUTCH SHOULD DELEGATE COMPRESSION TO HADOOP
> -------------------------------------------
>
> KEY: NUTCH-506
> URL:
HTTPS://ISSUES.APACHE.ORG/JIRA/BROWSE/NUTCH-506
> PROJECT: NUTCH
> ISSUE TYPE: IMPROVEMENT
> REPORTER: DO?ACAN GüNEY
> ASSIGNEE: DO?ACAN GüNEY
> FIX FOR: 1.0.0
>
> ATTACHMENTS: COMPRESS.PATCH, NUTCH-506.PATCH
>
>
> SOME DATA STRUCTURES WITHIN NUTCH (SUCH AS CONTENT,
PARSETEXT) HANDLE THEIR OWN COMPRESSION. WE SHOULD DELEGATE
ALL COMPRESSIONS TO HADOOP.
> ALSO, NUTCH SHOULD RESPECT IO.SEQFILE.COMPRESSION.TYPE
SETTING. CURRENTLY EVEN IF IO.SEQFILE.COMPRESSION.TYPE IS
BLOCK OR RECORD, NUTCH OVERRIDES IT FOR SOME STRUCTURES AND
SETS IT TO NONE (HOWEVER, IMO, PARSETEXT SHOULD ALWAYS BE
COMPRESSED AS RECORD BECAUSE OF PERFORMANCE REASONS).
--
THIS MESSAGE IS AUTOMATICALLY GENERATED BY JIRA.
-
YOU CAN REPLY TO THIS EMAIL TO ADD A COMMENT TO THE ISSUE
ONLINE.
|