Hello,
With the help of a couple of posters and the Nutch 0.8
tutorial,
I am able to crawl [Intranet mode] using the bin/nutch
script
from the command line. I verified the created index using
Luke [my thanks to its author] and by running the search
engine
through the provided search UI [Javascript/Tomcat 5.5].
My next step is to learn more about the innards of Nutch
by
stepping through the crawling process in debug mode.
However, I find that ant targets for debugging and running
Nutch from an IDE are not available in the build.xml file
[I am using Nutch 0.9.12-dev, and NetBeans 5.0 IDE].
Can anyone please help me with this? This would be a
tremendous
help to me and - I am sure - to the general Nutch
developer
community as well.
Nutch being as comprehensive as it is, it is hard [for me
at least]
to understand the details by just turning on logs.
Stepping through
the executing code is the best way I can think of...
Thanks a lot!
Regards:
jp
---------------------------------
How low will we go? Check out Yahoo! Messenger’s low
PC-to-Phone call rates. |