filter.tika.types
Specifies which file types to filter using the TikaFilterProvider.
Key: filter.tika.types
Type: List<String>
Can be set in: collection.cfg
Description
This option specifies which filetypes the TikaFilterProvider should filter. This filter has support for a wide range of file types and would normally try to filter as many types of input as it can. To allow other filters to work on specific file types we use this "white list" to control what Tika does/doesn’t filter.
Default Value
filter.tika.types=doc,dot,ppt,xls,rtf,docx,docm,pptx,pptm,xlsx,xlsm,pdf,png,gif,jpg,jpeg,tif,tiff,epub,vsd,msg,odt,odp,ods,odg,dwg,key,pages,numbers
Examples
Remove TIFF files from the list to filter those files using custom filter added to filter.classes
setting:
filter.tika.types=doc,dot,ppt,xls,rtf,docx,pptx,xlsx,pdf,png,gif,jpg,jpeg,epub,vsd,msg,odt,odp,ods,odg