filter.noindex.[keyName]

Defines rules for hiding content from the Funnelback indexer when using the inject no-index filter.

Key: filter.noindex.[keyName]
Type: String
Can be set in: collection.cfg

Description

Defines the no index rules that will be run by the inject no-index filter (InjectNoIndexFilterProvider).

When a matching rule is encountered the no-index filter will wrap the selected element in Funnelback no-index tags. This will result in the selected content being ignored by the index.

The format allows for setting multiple rules for different URLs in data source configuration:

filter.noindex.N = URL-PATTERN COMMA-SEPARATED-LIST-OF-SELECTORS

where:

  • N: index number

  • URL-PATTERN: URL pattern (standard Java-type regular expressions)

  • COMMA-SEPARATED-LIST-OF-SELECTORS: jsoup CSS-like selector; Supports multiple selectors per URL pattern: URL-PATTERN SELECTOR1,SELECTOR2

Default Value

None

Examples

Ignore all content within header and footer html tags for all documents, and ignore content from divs with class=navigation and elements with an id=footer for urls on the example.com domain.

filter.noindex.1=.* header,footer
filter.noindex.2=example\.com div.navigation,#footer