crawler.max_files_stored

Background

This parameter is used to specify an optional value for the maximum number of files the webcrawler should download during the crawl. If this limit is specified and reached then the crawler will halt cleanly at that point, even if there are more URLs in the frontier.

Alternatively, if this value has been configured for a site or domain using the site profiles mechanism then the crawler will not download any more content from the max_timeout_retries relevant site or domain.

Setting the key

Set this configuration key in the search package or data source configuration.

Use the configuration key editor to add or edit the crawler.max_files_stored key, and set the value. This can be set to any valid Integer value.

Default value

By default, the value for this is empty i.e. no limit specified.

crawler.max_files_stored=

Examples

Limit crawl to 50,000 files:

crawler.max_files_stored=50000

Help Center

Menu

crawler.max_files_stored

Background

Setting the key

Default value

Examples

See also