From: Diego Montalvo (no email)
Date: Thu Feb 21 2002 - 10:54:31 EST
Kir,
I am somewhat confused, so ASPSeek will crawl and
index .PDF and such files, but will not present them
as .html? Therefore I need a external converter?
Or does an external converter first convert, then I
run ASPSeek?
example: I want to index "www.crazy.com/beer.pdf" i
simply use ASPSeek, to retreive words from "beer.pdf"
but then I mst use an external program to view in
html?
do you have a link to such a search engine using
ASPSeek with external converters?
Diego
--- Kir Kolyshkin <> wrote:
> Diego Montalvo wrote:
> >
> > Hello,
> >
> > In the ASPSeek Manual pages there is a mention
> that
> > ASPSeek understands PDF, RTF formats with help of
> an
> > external program, what program is that? I would
> like
> > to embed it into ASPSeek.
>
> There's no need to embed. Manual talks about
> External Converters,
> described in
> http://www.aspseek.org/man/aspseek.conf.5.html#lbAM
> So as long as you have program that can convert,
> say, pdf to html,
> you can index pdf documents with aspseek.
>
> Good ps to text (or html) converter is here:
> http://www.nzdl.org/html/prescript.html
> There are also links to other such tools.
>
> As for converter from rtf or doc format, I know of
> word2x: http://word2x.alcom.co.uk/
> antiword: http://www.winfield.demon.nl/index.html
> unrtf: http://www.geocities.com/tuorfa/unrtf.html
> --
> http://kir.vtx.ru/ ICQ 7551596
> Phone +7 903 6722750
> Hi, I'm a signature virus: copy me to your
> .signature to help me spread!
> --
__________________________________________________
Do You Yahoo!?
Yahoo! Sports - Coverage of the 2002 Olympic Games
http://sports.yahoo.com
|
|
|