Gather and index

39 articles.

  1. WebDAV sites

    This article discusses how to crawl and index a WebDAV site using the web crawler.

    • Data sources
  2. FTP sites

    This article discusses how to use a web collection to index an FTP site.

    • Data sources
  3. Installing database drivers

    This article explains how to install JDBC database drivers for use with Funnelback database collections.

    • Data sources
    • Database
  4. Configure filecopy collection log level

    This article provides the steps required to set the log level for a filecopy collection.

    • Data sources
    • System administration
    • Filecopy
    • Logging
  5. Workflow commands on Windows

    This article describes how to chain workflow commands when using Funnelback on Windows.

    • Filtering and workflow
    • Workflow
  6. Remove stop words from a string in Groovy

    This article shows how stop words can be removed from a string in Groovy.

    • Filtering and workflow
    • Search frontends
    • Jsoup filters
    • Hook scripts
  7. Indexing and searching for hashtags and usertags

    This article shows how to configure Funnelback to enable a user to search for hashtags and usertags.

    • Filtering and workflow
    • Search frontends
    • Jsoup filters
    • Modern UI
    • Hook scripts
  8. Splitting XML files

    This article discusses two different techniques for splitting XML files into separate items within the search index.

    • Data sources
    • XML
    • Groovy filters
    • Jsoup filters
  9. Meta collections - where to make configuration changes

    This article provides guidelines to help you decide if configuration change relating to a meta collection should be made to the meta collection or to an underlying component collection.

    • Data sources
    • Meta collections
  10. Ignore canonical links in web pages

    This article outlines the steps required to ignore canonical links that are embedded in web pages.

    • Data sources
    • Indexing and storage
    • Web