Re: [aseek-users] Indexing subset of website

From: Kir Kolyshkin (no email)
Date: Wed Aug 07 2002 - 05:00:15 EDT


Kir Kolyshkin wrote:
>
> Radosław Maciaszek wrote:
> >
> > Hi!
> >
> > I indexing very big site. I want it all but without one directory.
> >
> > For example this site: http://podatki.interia.pl
> > but i donn't want index http://podatki.interia.pl/urzedy/
> >
> > I know i can delete these results http://podatki.interia.pl/urzedy/ from
> > database
> > after indexing whole site but after period time when indexer index again
> > this site i will have to
> > do this again and again. Can i do this somehow in aspseek.conf?
>
> Basically you have two ways
>
> 1). It it is your server, add
>
> User-Agent: aspseek
> Disallow: /urzedy/
>
> Or you can use '*' instead of 'aspseek'
>
> 2). Add disallow rule into aspseek.conf
>
> Disallow http://podatki.interia.pl/urzedy/

More correct variant is:

Disallow http://podatki\.interia\.pl/urzedy/

because this is a regular expression, and . means "any char",
so if you want '.' exactly, you should write it as '\.'

In other words, first expression also disables
http://podatki-interia.pl/urzedy/, while second does not.

-- ICQ UIN 7551596 Phone +7 903 6722750 --
   Guinness a Day Keeps a Doctor Away (people's wisdom)








Hosted Email Solutions

Invaluement Anti-Spam DNSBLs



Powered By FreeBSD   Powered By FreeBSD