Built-in filters - Process using an external filter (ExternalFilterProvider)

include::partial$banners/not-available-in-sxc.adoc

This filter converts specific binary file formats to text using an external program or command as defined in textify.cfg.

Enabling and disabling

External filtering is enabled by default. To enable on a custom filter chain add ExternalFilterProvider to the filter chain.

To disable external filtering remove ExternalFilterProvider from the filter chain.

Configuring external filter programs

textify.cfg is used to define external programs that run for specified file extensions.

Configure Funnelback to index additional supported file types

Configure Funnelback to index additional supported file types

Some additional options may need to be set depending on the type of collection being indexed to ensure the desired file types are downloaded and indexed.

Collection type Collection configuration option Description

web

crawler.reject_files

Ensure the file extension is not listed here

web

crawler.accept_files

If used ensure the file extension is listed here

web

crawler.non_html

Ensure the file extension is listed here

filecopy

filecopy.filetypes

Ensure the file extension is listed here

trimpush

trim.extracted_file_types

Ensure the file extension is listed here

See also: