Re: [aseek-users] charset problems

From: yayaivan (no email)
Date: Sat Jul 06 2002 - 11:48:38 EDT


Hi Kirill,

>Have you supplied cs parameter to s.cgi?
yes
as I said, other site displayed correctly in 1251 codepage... even on the same result page

>> I have sometimes strange problems with codepages on few sites. In 99% everything is ok, but few sites indexed incorrect. At the result search page they have "?" symbols instead of cyrillic charecters, when other site on this result page looks ok.
>> It is looks like this:
>>
>> ----------
>> 1. somesite.ru
>> ...0, ? ????? ??? ? ????? ??????? ????? ????????? ???? ?? ???????. 300 ???????, ? ??? ????????? ????? ???????? ?????? ?? ??????????????? ?????. 2002 © ???????. somesite .ru ... ...
>> http://www.somesite.ru/index.html
>> -----------
>>
>> I look in database, in tables urlwordsXX - there data is in 1251 charset, column charset is equal to "1251" ... the only difference from sites where everything is ok, column lang = 'en'
>> Where can be the problem? And how can I solve it?
>>
>> Here are my conf parameters related to charsets:
>>
>> aspseek.conf
>>
>> Include ucharset.conf
>> CharSet windows-1251
>> # I add this, so that servers thought that it is IE,
>> # and return pages in 1251 codepage
>> HTTPHeader User-Agent: Mozilla/4.0 (compatible; MSIE 5.0; WinNT)
>>
>> searchd.conf
>>
>> Include ucharset.conf
>> LocalCharset windows-1251
>>
>> ucharset.conf
>>
>> # it is the only string, I unrem
>> CharsetTableU1 windows-1251 ru tables/windows-1251.txt
>>
>> s.htm
>>
>> CharsetTable koi8-r ru charsets/koi8r
>> CharsetTable cp1251 ru charsets/cp1251
>> LocalCharset cp1251








Hosted Email Solutions

Invaluement Anti-Spam DNSBLs



Powered By FreeBSD   Powered By FreeBSD