filter.text-cleanup.ranges-to-replace
Specify Unicode blocks for replacement during filtering (to avoid 'corrupt' character display).
Key: filter.text-cleanup.ranges-to-replace
Type: List<String>
Can be set in: collection.cfg
Table of Contents
Description
This parameter lists the Unicode blocks of characters which should be removed by the TextCleanupFilterProvider
when it is used in the filter.classes
config setting.
The list is expected to be comma separated, and contain names as listed in the defined unicode block
names. Block names
are case-insensitive (so PLAYING_CARDS
and playing_cards
are both permitted and are equivalent).
Examples
If both private use and playing card characters occur in documents but should not be shown in search results or cached copies:
filter.text-cleanup.ranges-to-replace=private_use_area,playing_cards