crawler.extract_links_from_javascript

Background

This parameter controls whether the webcrawler should extract and follow links found inside Javascript.

By default, extracting links found in Javascript code is turned off.

This setting causes the crawler to parse any Javascript text embedded in a web page for text that resembles a URL. The crawler does not execute any Javascript, and enabling this does not enable the webcrawler to crawl Javascript generated websites.

Setting the key

Set this configuration key in the search package or data source configuration.

Use the configuration key editor to add or edit the crawler.extract_links_from_javascript key, and set the value. This can be set to any valid Boolean value.

Default value

crawler.extract_links_from_javascript=false

Examples

Turn on Javascript link extraction:

crawler.extract_links_from_javascript=true

See also