> Main > Products and Services > Purchase > Company
English / Russian

Documentation

CNSearch 1.5.1

© "CN-Software" Ltd. 2002-2008
support@cn-software.com
www.cn-software.com
Previous | Table of Contents  | Next

Parameters Index

To optimize indexing, you can use specific parameters:

URL <url>

URL url

Address starting with 'http://...' in HTTP-indexing mode, or path to the site copy on a local disk in the local indexing mode.

For example:

For HTTP:

URL http://www.novgorod.ru/frisbee/

For a local disk (Windows):

URL c:/pub/home/frisbee/

For a local disk (Unix):

URL /pub/home/frisbee/

Extensions <ext>

Extensions ext1,ext2,ext3

The parameter defines a list of files extensions to be indexed; it can be used in local disk mode only, and is ignored in HTTP indexing mode. Extensions are separated by "," (comma).

For example:

Extensions htm,html,shtml,shtm

Type <typ>

Type typ

The parameter sets type of the search index:

Default value - normal

For example:

Type Strict

Path <path>

Path path

The parameter defines a path to the directory containing index and log-files.

For example:

Path c:\www\site.com

or

Path /home/www/site.com

CharSet <cset>

CharSet cset

The parameter defines the method of character coding identification. There the following methods:

For example:

CharSet ByHTTPHeader

MaxFiles <num>

MaxFiles num

The parameter sets maximum number of files to be indexed (10000 by default). Be careful: many web-servers contain a huge number of looped links.

For example:

MaxFiles 50

MinWords <num>

MinWords num

The parameter defines minimal number of words within the indexed document. Documents with a smaller number of words will not be added to the search index. This parameter allows improving quality of the search results by means of filtering out small and insignificant documents. Default value is 1.

For example:

MinWords 30

Statistic <stat>

Statistic stat

The parameter defines the method of saving reports which are generated at the end of the indexing process and are saved to stats.log. Available options:

For example:

Statistic Append

Exclude <excl>

Exclude excl1,excl2,excl3

The parameter defines a list of words to be excluded from the indexing. Addresses containing at least one of the excluded words are not included into the indexing queue as well. Words are separated by "," (comma).

For example:

Exclude editpost.php?,reply.php?,admin/

ExcludeVar <var>

ExcludeVar var1,var2,var3

The parameter defines a list of variables to be excluded from the site URL's. The variables are separated by "," (comma).

For example:

ExcludeVar PHPSESSID,order

AddOption <opt>

AddOption opt

The parameter sets indexing method and can be used in HTTP indexing mode only. The following values are available:

For example:

AddOption SubPages

StopWordsFile <file>

StopWordsFile file

The parameter defines the name of the file containing stop-words (see Stop-words).

StopWordsFile stop.txt

Language <lng>

The parameter defines the language. Provided that this parameter is specified, the field 'Accept-Language' will be included into the HTTP header. This variable can affect the document contents on some sites.

For example:

Language ru

AFrom <path>

AFrom path

The parameter defines a substring which will be replaced in URL by the string specified in the parameter ATo.

For example:

AFrom  /home/dir/mysite/

ATo    http://search.codenet.ru/

ATo <url>

ATo url

The parameter defines a substring which will replace AFrom in the URL; it is used together with the AFrom.

For example:

AFrom http://127.0.0.1/

ATo   http://www.codenet.ru/

or

AFrom c:/documents/www/www.codenet.ru/

ATo   http://www.codenet.ru/

StartWord <word>

StartWord word

The parameter defines a word to start the indexing from. Page description will be composed of words following the starting one. Hence, it is possible to exclude menus and the like from the description.

For example:

StartWord about

Sleep <seconds>

Sleep seconds

The parameter defines the timeout between the site pages indexing (sec).

Example:

Sleep 5

ShowURL <yesno>

ShowURL yesno

Displays the pages addresses during indexing. Default value is "yes".

Example:

ShowURL no

ShowEmail <yesno>

ShowEmail yesno

Displays the found e-mail addresses (mailto) during indexing. Default value is "no".

Example:

ShowEmail no

ShowFTP <yesno>

ShowFTP yesno

Displays the found FTP-addresses during indexing. Default value is "no".

Example:

ShowFTP no

Compress <yesno>

Compress yesno

Requests the response compression from the server (in case the server supports this feature). Default value is "yes". Incorrect pages compression can lead to indexing failure.

Example:

Compress no

MetaDescription <yesno>

MetaDescription yesno

The parameter defines page description method. Description can be displayed in the search results with the help of the special symbol %E. Available values are "Yes" or "No". Default is 'No'. If 'Yes' is used, the system attempts to get description from '<META name="description...' tag. If tag cannot be found or the value is 'No', description is composed of the first words of the document.

For example:

MetaDescription Yes

MetaRobots <yesno>

MetaRobots yesno

If the parameter has value "No", the tag 'META name="robots"...' is ignored, otherwise the tag is analyzed for presence of NOINDEX, NOFOLLOW, NONE. More details can be found in the section The use of "Robots" META-tags. Default value is "Yes"

For example:

MetaRobots No

UseRobotsTxt <yesno>

UseRobotsTxt <yesno>

If the parameter is set to "Yes", indexing algorithm is taken from the file 'robots.txt', stored in the web-server root directory. Default value is "No". More information on working with 'robots.txt' is available in the section Search robots. Robot's name is "CNSearch".

For example:

UseRobotsTxt yes

ConnectCount <num>

ConnectCount <num>

The parameter sets quantity of the remote file requests; default value is 5.

For example:

ConnectCount 10


Previous | Table of Contents  | Next