crawler.inline_filtering_enabled
Background
This parameter controls whether content is filtered.
Filtering is done inline during the gathering phase. An example of a standard filtering operation would be the extraction of text from binary document formats (e.g. PDF files, MS Office formats etc.).
If enabled then the extractor will use the Tika filtering program as its default for filtering Office and PDF files.
Setting the key
Set this configuration key in the search package or data source configuration.
Use the configuration key editor to add or edit the crawler.inline_filtering_enabled
key, and set the value. This can be set to any valid Boolean
value.