Hi
There are are several posts about the difference between
regex-urlfilter.txt crawl-urlfilter.txt
e.g.http://www.mai
l-archive.com/nutch-user lucene.apache.org/msg06318.html
or
http://mail-a
rchives.apache.org/mod_mbox/lucene-nutch-user/200503.mbox/%3
c1815d86605033100396d330be mail.gmail.com%3e
but it might stupid, but what do you mean by intranet and
internet
crawling?
In the end both of them are just URLs ... right? It seems to
me I
completely misunderstand something.
Thanks for a hint
Michi
--
Michael Wechner
Wyona - Open Source Content Management - Apache
Lenya
http://www.wyona.com
http://lenya.apache.org
michael.wechner wyona.com michi apache.org
+41 44 272 91 61
|