List Info

Thread: Indexing disparate web pages.




Indexing disparate web pages.
user name
2007-05-29 18:31:43
I'm looking for a tool that will create a single index from
individual web 
pages across multiple domains (without spidering the entire
site.)  Can 
someone confirm my understanding that, as written, htdig
does not operate in 
this manner and if I want to use it so, I'll need to start
hacking the 
source?

Thanks,


------------------------------------------------------------
-------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and
take
control of your XML. No limits. Just data. Click to get it
now.
http://sourcefor
ge.net/powerbar/db2/
_______________________________________________
ht://Dig general mailing list: <htdig-generallists.sourceforge.net>
ht://Dig FAQ: http://htdig.so
urceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-gen
eral

Re: Indexing disparate web pages.
country flaguser name
United Kingdom
2007-05-30 03:49:12
 Daniel,
You are correct that this is not the normal operation,
however htdig can
easily be tuned to run off a list of links, from an external
file, and
not to spider your site.
One of my main sites actually does both of these operations,
in order to
build an index of my entire site, plus selected external
pages.

Regards,
Mike

> -----Original Message-----
> From: htdig-general-bounceslists.sourceforge.net 
> [mailto:htdig-general-bounceslists.sourceforge.net] On 
> Behalf Of Daniel D Jones
> Sent: Wednesday, May 30, 2007 12:32 AM
> To: htdig-generallists.sourceforge.net
> Subject: [htdig] Indexing disparate web pages.
> 
> I'm looking for a tool that will create a single index
from 
> individual web 
> pages across multiple domains (without spidering the
entire 
> site.)  Can 
> someone confirm my understanding that, as written,
htdig does 
> not operate in 
> this manner and if I want to use it so, I'll need to
start 
> hacking the 
> source?
> 
> Thanks,
> 
> 
>
------------------------------------------------------------
--
> -----------
> This SF.net email is sponsored by DB2 Express
> Download DB2 Express C - the FREE version of DB2
express and take
> control of your XML. No limits. Just data. Click to get
it now.
> http://sourcefor
ge.net/powerbar/db2/
> _______________________________________________
> ht://Dig general mailing list: <htdig-generallists.sourceforge.net>
> ht://Dig FAQ: http://htdig.so
urceforge.net/FAQ.html
> List information (subscribe/unsubscribe, etc.)
> https://lists.sourceforge.net/lists/listinfo/htdig-gen
eral
> 

------------------------------------------------------------
-------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and
take
control of your XML. No limits. Just data. Click to get it
now.
http://sourcefor
ge.net/powerbar/db2/
_______________________________________________
ht://Dig general mailing list: <htdig-generallists.sourceforge.net>
ht://Dig FAQ: http://htdig.so
urceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-gen
eral

[1-2]

about | contact  Other archives ( Real Estate discussion Medical topics )