Hi, I am a newbie to Nutch but I need to build a small
'vertical' search
engine with Nutch. I tried to read some documents but didnt
find what I
need.
Basically, what I want to know is how to make sure only web
pages with
certain keywords are indexed. For example, for a vertical
search engine for
cars, I only want to index pages with keywords like 'car' or
'automobile',
etc.
Is there a configuration option in Nutch? Is there a plug-in
that I can use
or do I have to write my own plug-in?
Thanks a lot!!
--
View this message in context: http:
//www.nabble.com/Screening-of-web-pages-in-Nutch-indexing-fo
r-vertical-search-tf4644684.html#a13267360
Sent from the Nutch - User mailing list archive at
Nabble.com.
|