List Info

Thread: Re: doc2html - indexed but no hits




Re: doc2html - indexed but no hits
country flaguser name
United Kingdom
2007-05-10 08:14:59
Can you tell if  doc2html is actually being called by htdig?
Just
because htdig is downloading the document, it does not
guarantee that it
is being passed over for conversion to an indexable format.
It might be worth decreasing the number of  v's you are
using by one or
two so that you can see what is being found in each
document. Not sure
if you have the 'statistics' turned on?

Regards,
Mike

> -----Original Message-----
> From: htdig-general-bounceslists.sourceforge.net 
> [mailto:htdig-general-bounceslists.sourceforge.net] On 
> Behalf Of CHUN KI SHIN
> Sent: Thursday, May 10, 2007 1:43 PM
> To: htdig-generallists.sourceforge.net
> Subject: [htdig] doc2html - indexed but no hits
> 
> I've been trying to index .pdf and .doc documents in v.
3.2.0b with 
> doc2html/catdoc/pdf2html.
> I can see both types indexed fine (though I'm not sure
why 
> log doesn't tell 
> which words and tags have been indexed). See below:
> 

------------------------------------------------------------
-------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and
take
control of your XML. No limits. Just data. Click to get it
now.
http://sourcefor
ge.net/powerbar/db2/
_______________________________________________
ht://Dig general mailing list: <htdig-generallists.sourceforge.net>
ht://Dig FAQ: http://htdig.so
urceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-gen
eral

[1]

about | contact  Other archives ( Real Estate discussion Medical topics )