filter.classes
Specifies which java classes should be used for filtering documents.
Key: filter.classes
Type: List<String>
Can be set in: collection.cfg
Table of Contents
Description
The filtering framework in Funnelback optionally allows for the specification of separate Java classes to provide filtering of content. Funnelback provides several filters for filtering common file formats. If specialised filtering is required however, the filters must be specified here.
Any filters specified here must implement the com.funnelback.filter.api.filters.Filter
.
See: document filtering
Default Value
filter.classes=TikaFilterProvider,ExternalFilterProvider:JSoupProcessingFilterProvider:DocumentFixerFilterProvider
Examples
filter.classes=com.company.CustomFilterProvider
filter.classes=com.company.CustomFilterProvider2,com.company.CustomFilterProvider1
filter.classes=com.company.CustomChain1:com.company.CustomChain2:com.company.CustomChain3
filter.classes=TikaFilterProvider,ExternalFilterProvider:com.company.CustomChain2:com.company.CustomChain3
Disable the document title fixer filter:
filter.classes=TikaFilterProvider,ExternalFilterProvider