Daniel,
You are correct that this is not the normal operation,
however htdig can
easily be tuned to run off a list of links, from an external
file, and
not to spider your site.
One of my main sites actually does both of these operations,
in order to
build an index of my entire site, plus selected external
pages.
Regards,
Mike
> -----Original Message-----
> From: htdig-general-bounces lists.sourceforge.net
> [mailto:htdig-general-bounces lists.sourceforge.net] On
> Behalf Of Daniel D Jones
> Sent: Wednesday, May 30, 2007 12:32 AM
> To: htdig-general lists.sourceforge.net
> Subject: [htdig] Indexing disparate web pages.
>
> I'm looking for a tool that will create a single index
from
> individual web
> pages across multiple domains (without spidering the
entire
> site.) Can
> someone confirm my understanding that, as written,
htdig does
> not operate in
> this manner and if I want to use it so, I'll need to
start
> hacking the
> source?
>
> Thanks,
>
>
>
------------------------------------------------------------
--
> -----------
> This SF.net email is sponsored by DB2 Express
> Download DB2 Express C - the FREE version of DB2
express and take
> control of your XML. No limits. Just data. Click to get
it now.
> http://sourcefor
ge.net/powerbar/db2/
> _______________________________________________
> ht://Dig general mailing list: <htdig-general lists.sourceforge.net>
> ht://Dig FAQ: http://htdig.so
urceforge.net/FAQ.html
> List information (subscribe/unsubscribe, etc.)
> https://lists.sourceforge.net/lists/listinfo/htdig-gen
eral
>
------------------------------------------------------------
-------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and
take
control of your XML. No limits. Just data. Click to get it
now.
http://sourcefor
ge.net/powerbar/db2/
_______________________________________________
ht://Dig general mailing list: <htdig-general lists.sourceforge.net>
ht://Dig FAQ: http://htdig.so
urceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-gen
eral
|