Re: [aseek-users] MaxDocSize not working?

From: kir (no email)
Date: Wed Oct 30 2002 - 19:00:11 EST


Daniel Winter wrote on Tue, 29 October 2002 23:47
> But in the docs from aspseek is written:
>
> "MaxDocSize bytes
> Sets the maximum document size in bytes, so pages with size more that
> bytes will not be processed. Default value is 1048576 bytes (1Mb)."
>
> That is wrong. I would be happy, if it were right.

Documentation is fixed in CVS (will be available in 1.2.11), thanx for the catching.

As for your problem, fixing it requires changing ASPseek's crawler strategy: it should first send HEAD request to web server, than check the size and, if allowed, send GET request. This can really slow down indexing, so in general it should be avoided. But, as ASPseek is free and open source software, you can easily modify the source code to achieve what you want, or ask/hire somebody to do it.

--
-- kir







Hosted Email Solutions

Invaluement Anti-Spam DNSBLs



Powered By FreeBSD   Powered By FreeBSD