filter.ignore.mimeTypes

Background

This parameter allows you to specify an optional comma separated list of MIME types that the filter should ignore.

Setting the key

Set this configuration key in the search package or data source configuration.

Use the configuration key editor to add or edit the filter.ignore.mimeTypes key, and set the value. This can be set to any valid List<String> value.

Default value

filter.ignore.mimeTypes=

Examples

If some .mov video files are being served using the MIME type application/octet-stream then if we want to store them as is (without filtering):

filter.ignore.mimeTypes=application/octet-stream

You may also need to add the relevant suffix (in this case ".mov"), to the crawler.non_html parameter, and remove it from crawler.reject_files. You may also need to consider what type of crawler.classes.URLStore to use e.g. MirrorStore will store the content as separate files.