crawler.accept_files
Restricts the file extensions the web crawler should crawl.
Key: crawler.accept_files
Type: List<String>
Can be set in: collection.cfg
Table of Contents
Description
This is a comma-separated list of file extensions that will be downloaded by the crawler. It is normally left empty, so that the crawler will accept all valid content regardless of the suffix.