Crawling and indexing WebDAV sites
Funnelback includes basic support for the crawling of WebDAV sites using the web crawler.
Crawling of WebDAV sites does not support document level security. |
Details
WebDAV is delivered via HTTP and can be accessed using the web crawler.
-
Create a web data source for the WebDAV site index
-
Configure the start URL to be the WebDAV site’s root page
-
Configure include/exclude patterns as for a standard web data source.
-
-
Configure authentication
-
Configure authentication in the same manner as for an authenticated web crawl.
-
-
Configure filetypes and download sizes
-
The basic filetypes supported by web data source will be gathered. Additional filetypes can be added. See: Configure Funnelback to index additional file types
-
Set download and parser sizes using the
crawler.max_download_size
andcrawler.max_parse_size
settings.
-
-
Crawl the site.