Restricts the file extensions the web crawler should crawl.
Can be set in: collection.cfg
Table of Contents
This is a comma-separated list of file extensions that will be downloaded by the crawler. It is normally left empty, so that the crawler will accept all valid content regardless of the suffix.
In this example a specific list of filetypes (based on suffix) is listed - only files of these types will be downloaded.