Crawling and indexing WebDAV sites

Funnelback includes basic support for the crawling of WebDAV sites using the web crawler.

Crawling of WebDAV sites does not support document level security.

Details

WebDAV is delivered via HTTP and can be accessed using the web crawler.

Create a web data source for the WebDAV site index
- Configure the start URL to be the WebDAV site’s root page
- Configure include/exclude patterns as for a standard web data source.
Configure authentication
- Configure authentication in the same manner as for an authenticated web crawl.
Configure filetypes and download sizes
- The basic filetypes supported by web data source will be gathered. Additional filetypes can be added. See: Configure Funnelback to index additional file types
- Set download and parser sizes using the crawler.max_download_size and crawler.max_parse_size settings.
Crawl the site.