exclude_patterns

Background

This option is a comma-separated list of substrings that is used by the crawler to determine if it will process a web page. If the page’s URL contains one of the substrings the crawler will not process the page.

Use of this option will allow the search administrator to exclude an individual web page or to exclude a site as a whole.

See: include and exclude patterns for a description on how include and exclude patterns work and advanced techniques such as the use of a regular expression exclude pattern.

Setting the key

Set this configuration key in the search package or data source configuration.

Use the configuration key editor to add or edit the exclude_patterns key, and set the value. This can be set to any valid List<String> value.

Default value

exclude_patterns=/cgi-bin,/vti,/_vti,calendar,SQ_DESIGN_NAME=print,SQ_ACTION=logout,SQ_PAINT_LAYOUT_NAME=,%3E%3C/script%3E,google-analytics.com

Examples

Ignore standard paths, plus a local sales folder.

exclude_patterns=/cgi-bin,/vti,/_vti,calendar,/sales