"Nutch Lucene based search engine" list archive


List Info




NutchBean (and mergecrawl.sh)2007-07-31 20:58:070 
fetching stops for one hour2007-07-31 18:02:200 
NutchBean (and mergecrawl.sh)2007-07-31 16:25:050 
Tomcat without Apache2007-07-31 13:33:142 
Really big indexing and timeouts?2007-07-31 12:07:522 
spliting an index2007-07-31 12:06:140 
Error with Nutch 0.92007-07-31 11:13:251 
Re: slow generate process2007-07-31 06:08:025 
hung threads - NullPointerException in getPos(FSDataInputStream.java:87)2007-07-30 21:04:441 
How to create a wiki account for nutch-user2007-07-30 18:00:151 
Re: cygwin - Input path doesnt exist2007-07-30 11:24:490 
Re: eliminating almost duplicate URLs2007-07-30 09:54:052 
MergeSegs2007-07-30 07:28:050 
How do I remove ShowAllHits2007-07-30 04:32:532 
online indexing?2007-07-30 02:17:462 
Map ouput2007-07-29 19:05:402 
Fetching HTTPS behind Proxy fails - Patch exists, but is not included in 0.92007-07-29 10:11:480 
Problems running crawl with cygwin, JAVA_HOME not set2007-07-28 15:59:312 
Re: cygwin - Input path doesnt exist2007-07-28 12:06:061 
How to determine the number of pages in the index?2007-07-28 05:59:002 
cygwin and nightly builds2007-07-27 21:35:271 
cygwin - Input path doesnt exist2007-07-27 08:20:561 
Pull out a page from already processed pages, re-parse and replace2007-07-27 08:06:312 
Re: cygwin - Input path doesnt exist2007-07-27 02:56:271 
Pages in UTF-162007-07-27 02:24:322 
DownloadingNutch - svn co nutch nightly2007-07-27 01:00:111 
Re: NullPointerException fetching some sites with temp redirects2007-07-27 00:52:503 
eliminating almost duplicate URLs2007-07-26 22:58:500 
Multiple Nutch Instances2007-07-26 20:04:250 
Re: Redirected-to pages and not-there pages are fetched multiple times2007-07-26 19:05:490 
NullPointerException fetching some sites with temp redirects2007-07-26 18:21:074 
Redirected-to pages and not-there pages are fetched multiple times2007-07-26 18:17:562 
Re: Point of Note to Windows Users2007-07-26 12:28:261 
RE: RE : Nutch overhead to Lucene (or: why is Nutch 4 times slower than Lucene ?2007-07-26 11:17:021 
unable to open nutch index using IndexReader2007-07-26 11:15:150 
Point of Note to Windows Users2007-07-26 05:24:441 
Re: IllegalArgumentException: plugin.folders is not defined2007-07-25 16:02:120 
Lock obtain timed out2007-07-25 15:38:450 
documents fetched but not indexed (Nutch 0.9)2007-07-25 13:49:191 
CrawlDbReader TopN2007-07-25 10:33:231 
RE: RE : Nutch overhead to Lucene (or: why is Nutch 4 times slower than Lucene ?2007-07-25 10:06:431 
RE : Nutch overhead to Lucene (or: why is Nutch 4 times slower than Lucene ?)2007-07-25 07:44:181 
Re: slow generate process2007-07-25 07:36:071 
slow generate process2007-07-25 06:14:282 
Bad version number in .class file when injecting2007-07-25 05:55:401 
Writing ScoringFilter plugins2007-07-25 05:35:570 
Nutch error /conf/masters: No such file or directory2007-07-25 02:02:220 
Search on Date range2007-07-25 01:50:366 
Recrawling is not working in Nutch 0.92007-07-25 01:48:190 
getting document link graph2007-07-25 01:21:242 
<< < [0] [1] [2] > >>

about | contact  Other archives ( Real Estate discussion Medical topics )