"Nutch" list archive


List Info




linkdb bug2006-12-30 19:19:083 
Created: (NUTCH-423) Add other index-basic fields as query plugins2006-12-29 00:48:301 
Issue with Boosting Fields2006-12-28 13:39:270 
Closed: (NUTCH-274) Empty row in/at end of URL-list results in error2006-12-28 00:22:230 
Closed: (NUTCH-273) When a page is redirected, the original url is NOT updated.2006-12-28 00:18:310 
Closed: (NUTCH-322) Fetcher discards ProtocolStatus, doesn't store redirected pa2006-12-28 00:18:240 
Created: (NUTCH-416) CrawlDatum status and CrawlDbReducer refactoring2006-12-28 00:14:223 
Created: (NUTCH-415) Generate should mark selected records in crawlDB2006-12-28 00:10:223 
Created: (NUTCH-419) unavailable robots.txt kills fetch2006-12-24 13:26:224 
Updated: (NUTCH-273) When a page is redirected, the original url is NOT updated.2006-12-24 07:34:371 
Extracting title from XHTML pages2006-12-21 13:01:454 
crawl null pointer2006-12-21 10:22:080 
implement thai language indexing and search2006-12-21 08:30:3710 
Updated: (NUTCH-272) Max. pages to crawl/fetch per site (emergency limit)2006-12-21 05:10:220 
difference between intranet and internet crawling2006-12-20 16:47:590 
Warning: set speculative execution to false2006-12-15 15:05:060 
hi all:2006-12-14 14:28:580 
NUTCH 0.8.1: Difficulties with Analyzers2006-12-13 16:21:540 
Fetching problem and FileProtocol bug in Nutch 0.8.12006-12-12 16:08:540 
Created: (NUTCH-414) parse-mp3 plugin concatenating previous tags for text field2006-12-12 15:29:200 
parse-mp3 plugin concatenating previous tags for text field2006-12-12 15:13:331 
Commented: (NUTCH-248) add support for internationalized domain names2006-12-11 18:54:220 
include hadoop native libs to nutch?2006-12-11 16:26:440 
Changing NutchConf params at Runtime.2006-12-11 15:39:090 
Fetching problem and FileProtocol bug in Nutch 0.8.12006-12-10 21:16:000 
Porn sites' link at the download page2006-12-10 19:45:002 
svn commit: r485076 - in /lucene/nutch/trunk/src: java/org/apache/nutch/metadata2006-12-10 16:01:541 
hi all:2006-12-10 05:28:441 
svn commit: r485076 - in /lucene/nutch/trunk/src: java/org/apache/nutch/metadata2006-12-10 00:05:561 
svn commit: r485076 - in /lucene/nutch/trunk/src: java/org/apache/nutch/metadata2006-12-09 22:56:110 
hi all:2006-12-09 07:59:040 
What's the status of Nutch-GUI?2006-12-08 20:35:1516 
Brochure for Nutch2006-12-08 20:26:112 
Created: (NUTCH-413) Fetcher ignores -noParsing command line option2006-12-08 20:16:243 
Want some idea abt distributed searching behind Nutch2006-12-08 16:46:440 
Nutch site crawling2006-12-07 10:47:200 
Full List of Metadata Fields2006-12-06 23:43:272 
Indexing and Re-crawling site2006-12-05 11:10:270 
Indexing and Re-crawling site2006-12-05 09:18:310 
Created: (NUTCH-412) plugin to parse the feed-url (rss/atom) of a blog2006-12-03 08:00:232 
Phrase query analysis-fr2006-12-02 22:45:220 
Commented: (NUTCH-224) Nutch doesn't handle Korean text at all2006-12-02 01:47:220 
Commented: (NUTCH-224) Nutch doesn't handle Korean text at all2006-12-02 01:41:220 
Protocol.secure2006-12-01 14:32:090 

about | contact  Other archives ( Real Estate discussion Medical topics )