Can you tell if doc2html is actually being called by htdig?
Just
because htdig is downloading the document, it does not
guarantee that it
is being passed over for conversion to an indexable format.
It might be worth decreasing the number of v's you are
using by one or
two so that you can see what is being found in each
document. Not sure
if you have the 'statistics' turned on?
Regards,
Mike
> -----Original Message-----
> From: htdig-general-bounces lists.sourceforge.net
> [mailto:htdig-general-bounces lists.sourceforge.net] On
> Behalf Of CHUN KI SHIN
> Sent: Thursday, May 10, 2007 1:43 PM
> To: htdig-general lists.sourceforge.net
> Subject: [htdig] doc2html - indexed but no hits
>
> I've been trying to index .pdf and .doc documents in v.
3.2.0b with
> doc2html/catdoc/pdf2html.
> I can see both types indexed fine (though I'm not sure
why
> log doesn't tell
> which words and tags have been indexed). See below:
>
------------------------------------------------------------
-------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and
take
control of your XML. No limits. Just data. Click to get it
now.
http://sourcefor
ge.net/powerbar/db2/
_______________________________________________
ht://Dig general mailing list: <htdig-general lists.sourceforge.net>
ht://Dig FAQ: http://htdig.so
urceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-gen
eral
|