Undesirable text filter

Introduction

The undesirable text filter analyzes HTML documents for the occurrence of words from various word lists.

By default, the filter is used to produce the content auditor undesirable text report.

Enabling

The undesirable text filter is enabled by default.

filter.jsoup.classes=<jsoup_filters>,UndesirableText,<jsoup_filters>

Configuration

filter.jsoup.undesirable_text.[key_name]=[word]

filter.jsoup.undesirable_text-source.default-misspellings=$SEARCH_HOME/conf/common-misspellings.txt.default

filter.jsoup.undesirable_text-source.weasel-words=undesirable-text.weasel-words.cfg

See also