List Info

Thread: writing urls to xml files




writing urls to xml files
country flaguser name
United States
2007-03-19 10:45:56
Hi,

I am newbie to nuch.I have just able to run nutch
tutorials.
My requirement is I want to crawl only .htm files from my
intranet which
should ignore sessionids.

After that I want to put all the crawled urls in xml file.I
want to write
url into xml using sitemap format which will later submit to
google..

Is there any way i can achieve this? If yes, provide me the
solution ASAP.

Please help me.

cheers,
utsavi


-- 
View this message in context: http://www.nabble.com/writing-urls-to-xml
-files-tf3427891.html#a9554639
Sent from the Nutch - User mailing list archive at
Nabble.com.


Re: writing urls to xml files
country flaguser name
United States
2007-03-20 04:59:41
Hey utsavi,

my requirements are almost same as yours..am still working
on the same
issue..will definately let you knw once i solved it..

Keep in touch.

Cheers,
cha

utsavi wrote:
> 
> Hi,
> 
> I am newbie to nuch.I have just able to run nutch
tutorials.
> My requirement is I want to crawl only .htm files from
my intranet which
> should ignore sessionids.
> 
> After that I want to put all the crawled urls in xml
file.I want to write
> url into xml using sitemap format which will later
submit to google..
> 
> Is there any way i can achieve this? If yes, provide me
the solution ASAP.
> 
> Please help me.
> 
> cheers,
> utsavi
> 
> 
> 

-- 
View this message in context: http://www.nabble.com/writing-urls-to-xml
-files-tf3427891.html#a9569244
Sent from the Nutch - User mailing list archive at
Nabble.com.


[1-2]

about | contact  Other archives ( Real Estate discussion Medical topics )