From: Kir Kolyshkin (no email)
Date: Wed Feb 27 2002 - 02:48:15 EST
Diego Montalvo wrote:
>
> Kir,
>
> I have the following in "aspseek.conf"
>
> ############################################################################
> #AddType <mime type> <regexp> [<regexp>..]
> # This command associates filename extensions with a
> mime types
> # (for services that don't automatically include them
> - like file://)
> #
> AddType text/plain \.pl$ \.js$ \.txt$ \.h$ \.c$ \.pm$
> \.e$
> AddType text/html \.html$ \.htm$
> AddType image/x-xpixmap \.xpm$
> AddType image/x-xbitmap \.xbm$
> AddType image/gif \.gif$
> AddType application/unknown .*
> Converter text/plain /usr/local/bin/word2x $in $out
>
> I am not able to index the following address.
>
> http://www.geocities.com/dmontal2/Computer_Services.doc
>
> the apache is set to "text/plain"
> I also checked word2x and it works fine.
>
> What can be wrong?
>
> Diego
>
>
>
> --- Kir Kolyshkin <> wrote:
> > Diego Montalvo wrote:
> > >
> > > Hello,
> > >
> > > I have downloaded the external converter Word2x
> > for
> > > ".doc" files. I have also added, the following
> > > commands to
> > > "aspseek.conf"
> > >
> > > AddType application/doc \.doc$
> >
> > No need for AddType
> >
> > > Converter application/doc text/plain doc2text $in
> > $out
> >
> > 1. Replace 'doc2text' with '/usr/local/bin/word2x'.
> >
> > 2. Also, check what content type your web server
> > returns, and
> > put it instead of appication/doc.
> >
> > 3. Also, check that
> >
> > $ word2x some.doc some.html
> >
> > actually produces sane some.html.
> >
> > >
> > > I an not sure what commands to use to index ".doc"
> > > format. I have "word2x" at "/usr/local/bin/word2x"
> > >
> > > when i try "./index -a" nothing is indexed. what
> > am
> > > I doing wrong? am I missing something?
You should have the following line:
Converter application/msword text/plain word2x $in $out
-- http://kir.vtx.ru/ ICQ 7551596 Phone +7 903 6722750 Hi, I'm a signature virus: copy me to your .signature to help me spread! --
|
|
|