Re: AW: [aseek-users] ASPSeek - PDF / RTF

From: Diego Montalvo (no email)
Date: Thu Feb 21 2002 - 13:53:25 EST


Kir or Markus,

What are the proper steps to having this type of
functionality? I must first download the converters
then , how do I configure ASPSeek for the external
converters?

Diego

--- Kir Kolyshkin <> wrote:
> ASPSeek will also present "text version" of beer.pdf
> to be viewed
> (in the place where "cached" link usually is), much
> like as Google does,
> so you can see the result of conversion. Excerpts
> are also supported.
>
> > wrote:
> >
> > no no,
> >
> > the external converter is started from aspseek
> during the index process when aspseek finds a pdf
> file.
> > so in your case:
> >
> > when aspseek indexes www.crazy.com and finds
> beer.pdf it starts the converter. the converter
> reads the pdf-document convert it to txt/html. now
> aspseek indexes this export.
> >
> > no your users can search also in pdf documents. so
> when "beer" is in beer.bdf, aspseek will list the
> link to beer.pdf as a result and even displays the
> short extract. your users now can click on the link
> and acrobat reader opens to display the pdf-file.
> >
> > so external converter means a helper programme for
> apseek to index pdf-documents.
> >
> > Markus Rietzler
> > * kommunikation & online service
> > * RZF NRW
> > * Tel: 0211.4572-130
> >
> > -----Ursprüngliche Nachricht-----
> > Von: Diego Montalvo [mailto:]
> > Gesendet am: Donnerstag, 21. Februar 2002 16:55
> > An:
> > Betreff: Re: [aseek-users] ASPSeek - PDF / RTF
> >
> > Kir,
> >
> > I am somewhat confused, so ASPSeek will crawl and
> > index .PDF and such files, but will not present
> them
> > as .html? Therefore I need a external converter?
> >
> > Or does an external converter first convert, then
> I
> > run ASPSeek?
> >
> > example: I want to index "www.crazy.com/beer.pdf"
> i
> > simply use ASPSeek, to retreive words from
> "beer.pdf"
> > but then I mst use an external program to view in
> > html?
> >
> > do you have a link to such a search engine using
> > ASPSeek with external converters?
> >
> > Diego
> >
> > --- Kir Kolyshkin <> wrote:
> > > Diego Montalvo wrote:
> > > >
> > > > Hello,
> > > >
> > > > In the ASPSeek Manual pages there is a mention
> > > that
> > > > ASPSeek understands PDF, RTF formats with help
> of
> > > an
> > > > external program, what program is that? I
> would
> > > like
> > > > to embed it into ASPSeek.
> > >
> > > There's no need to embed. Manual talks about
> > > External Converters,
> > > described in
> > >
> http://www.aspseek.org/man/aspseek.conf.5.html#lbAM
> > > So as long as you have program that can convert,
> > > say, pdf to html,
> > > you can index pdf documents with aspseek.
> > >
> > > Good ps to text (or html) converter is here:
> > > http://www.nzdl.org/html/prescript.html
> > > There are also links to other such tools.
> > >
> > > As for converter from rtf or doc format, I know
> of
> > > word2x: http://word2x.alcom.co.uk/
> > > antiword:
> http://www.winfield.demon.nl/index.html
> > > unrtf:
> http://www.geocities.com/tuorfa/unrtf.html
> > > --
> > > http://kir.vtx.ru/ ICQ
> 7551596
> > > Phone +7 903 6722750
> > > Hi, I'm a signature virus: copy me to your
> > > .signature to help me spread!
> > > --
> >
> > __________________________________________________
> > Do You Yahoo!?
> > Yahoo! Sports - Coverage of the 2002 Olympic Games
> > http://sports.yahoo.com
>
> --
> http://kir.vtx.ru/ ICQ 7551596
> Phone +7 903 6722750
> Hi, I'm a signature virus: copy me to your
> .signature to help me spread!
> --

__________________________________________________
Do You Yahoo!?
Yahoo! Sports - Coverage of the 2002 Olympic Games
http://sports.yahoo.com








Hosted Email Solutions

Invaluement Anti-Spam DNSBLs



Powered By FreeBSD   Powered By FreeBSD