filter.tika.types

Background

This option specifies which filetypes the TikaFilterProvider should filter. This filter has support for a wide range of file types and would normally try to filter as many types of input as it can. To allow other filters to work on specific file types we use this "white list" to control what Tika does/doesn’t filter.

Setting the key

Set this configuration key in the search package or data source configuration.

Use the configuration key editor to add or edit the filter.tika.types key, and set the value. This can be set to any valid List<String> value.

Default value

filter.tika.types=doc,dot,ppt,xls,rtf,docx,docm,pptx,pptm,xlsx,xlsm,pdf,png,gif,jpg,jpeg,tif,tiff,epub,vsd,msg,odt,odp,ods,odg,dwg,key,pages,numbers

Examples

Remove TIFF files from the list to filter those files using custom filter added to filter.classes setting:

filter.tika.types=doc,dot,ppt,xls,rtf,docx,pptx,xlsx,pdf,png,gif,jpg,jpeg,epub,vsd,msg,odt,odp,ods,odg