List Info

Thread: running rundig on basic authentication protected sites, PDF files




running rundig on basic authentication protected sites, PDF files
country flaguser name
United States
2007-04-09 15:57:55
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hello,

Is there a way to get rundig to update its index within
htaccess
protected sites, or is the only way to manually run an htdig
-u
user:pass, followed by an htmerge?

Also, are there any particular configuration changes
necessary to get
htdig to fully index text in PDFs? Are all PDFs supported?


- --
Joe Auty
UITS Messaging
Indiana University
jautyindiana.edu
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (Darwin)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org


iD8DBQFGGqjTIGoilq3QRWsRAvfaAKCSE7idGZ2fshIpWPL1rn2cbGrYsgCg
rzDf
wJ/MApV2LzlEGWXT2bVyTYI=
=34bS
-----END PGP SIGNATURE-----


------------------------------------------------------------
-------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the
chance to share your
opinions on IT & business topics through brief
surveys-and earn cash
http://www.techsay.com/default.
php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
ht://Dig general mailing list: <htdig-generallists.sourceforge.net>
ht://Dig FAQ: http://htdig.so
urceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-gen
eral

Re: running rundig on basic authentication protected sites, PDF files
country flaguser name
United States
2007-04-12 11:21:25
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Any ideas here?



Joe Auty wrote:
> Hello,
>
> Is there a way to get rundig to update its index within
htaccess
> protected sites, or is the only way to manually run an
htdig -u
> user:pass, followed by an htmerge?
>
> Also, are there any particular configuration changes
necessary to
> get htdig to fully index text in PDFs? Are all PDFs
supported?
>
>

-
------------------------------------------------------------
-------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the
chance to
share your
opinions on IT & business topics through brief
surveys-and earn cash
http://www.techsay.com/default.
php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
ht://Dig general mailing list: <htdig-generallists.sourceforge.net>
ht://Dig FAQ: http://htdig.so
urceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-gen
eral


- --
Joe Auty
UITS Messaging
Indiana University
jautyindiana.edu

- --
Joe Auty
UITS Messaging
Indiana University
jautyindiana.edu

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (Darwin)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org


iD8DBQFGHlyFIGoilq3QRWsRAiEEAKCzDOsQxptL0CnfOBQkYRW1kY+XygCf
XXH4
kcrg87RasahZ/lkVWWnrL+4=
=AmEc
-----END PGP SIGNATURE-----


------------------------------------------------------------
-------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the
chance to share your
opinions on IT & business topics through brief
surveys-and earn cash
http://www.techsay.com/default.
php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
ht://Dig general mailing list: <htdig-generallists.sourceforge.net>
ht://Dig FAQ: http://htdig.so
urceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-gen
eral

Re: running rundig on basic authentication protected sites, PDF files
country flaguser name
United States
2007-04-13 15:07:14
On Apr 9, 2007, at 2:57 PM, Joe Auty wrote:

> Is there a way to get rundig to update its index within
htaccess
> protected sites, or is the only way to manually run an
htdig -u
> user:pass, followed by an htmerge?

You can set the username and password in your config file.

   http://
www.htdig.org/attrs.html#authorization

> Also, are there any particular configuration changes
necessary to get
> htdig to fully index text in PDFs? Are all PDFs
supported?

   http://www.htdig.o
rg/FAQ.html#q4.9

The pdftotext program is well maintained, and I would expect
it to  
handle most valid PDFs. However encoding, protection
schemes, etc.  
may in some cases limit what you can actually extract and
index with  
htdig.

Jim


------------------------------------------------------------
-------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the
chance to share your
opinions on IT & business topics through brief
surveys-and earn cash
http://www.techsay.com/default.
php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
ht://Dig general mailing list: <htdig-generallists.sourceforge.net>
ht://Dig FAQ: http://htdig.so
urceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-gen
eral

[1-3]

about | contact  Other archives ( Real Estate discussion Medical topics )