Crawling password protected websites
Some websites are protected by an authentication scheme which requires a username/password combination to access the site. In order for Funnelback to successfully crawl password protected sites, it must be given a valid user name and password to use.
The authentication schemes that Funnelback currently supports are:
- 
HTTP Basic Authentication 
- 
Windows Integrated Authentication (NTLM) 
- 
Web form based authentication such as SAML. 
Giving Funnelback a username and password
Funnelback supports multiple HTTP Basic username/password pairs per web data source. If you have a single account to configure you can set the values using parameters in a data source configuration. To allow Funnelback access to the protected website:
For basic HTTP authentication:
- 
Set the http_userparameter to a valid HTTP Basic username.
- 
Set the http_passwdparameter to the HTTP Basic username’s password.
For NTLM/Windows Integrated authentication:
- 
Set the crawler.ntlm.domainparameter to a valid NTLM domain.
- 
Set the crawler.ntlm.usernameparameter to a valid username in the NTLM domain.
- 
Set the crawler.ntlm.passwordparameter to the NTLM username’s password.
For FTP sites:
- 
Set the ftp_userparameter to a valid FTP username.
- 
Set the ftp_passwdparameter to the FTP Basic username’s password.
| ftp will need to be added to the crawler.protocols in order to crawl an FTP site. | 
Specifying multiple HTTP Basic usernames and passwords
If you need to specify multiple HTTP Basic accounts for different web servers you can configure this using site profiles.