Re: [aseek-users] crawl .docs?

From: Kir Kolyshkin (no email)
Date: Tue Feb 26 2002 - 13:22:45 EST


Diego Montalvo wrote:
>
> Hello,
>
> I have downloaded the external converter Word2x for
> ".doc" files. I have also added, the following
> commands to
> "aspseek.conf"
>
> AddType application/doc \.doc$

No need for AddType

> Converter application/doc text/plain doc2text $in $out

1. Replace 'doc2text' with '/usr/local/bin/word2x'.

2. Also, check what content type your web server returns, and
put it instead of appication/doc.

3. Also, check that

$ word2x some.doc some.html

actually produces sane some.html.

>
> I an not sure what commands to use to index ".doc"
> format. I have "word2x" at "/usr/local/bin/word2x"
>
> when i try "./index -a" nothing is indexed. what am
> I doing wrong? am I missing something?
>
> Diego
>
> __________________________________________________
> Do You Yahoo!?
> Yahoo! Sports - Coverage of the 2002 Olympic Games
> http://sports.yahoo.com

-- 
  http://kir.vtx.ru/    ICQ 7551596  Phone +7 903 6722750
Hi, I'm a signature virus: copy me to your .signature to help me spread!
--







Hosted Email Solutions

Invaluement Anti-Spam DNSBLs



Powered By FreeBSD   Powered By FreeBSD