Metamap Cfg

Introduction

Name

metamap.cfg

Location

~/conf/collection/

Description

Defines mappings of HTML <meta> or HTML tags to Funnelback metadata classes.

Syntax

The metamap.cfg consists of a mapping, one per line

class,content,tag name

where:

class

is the Funnelback metadata class.

content

uses one of the Content Flags (below)

tag name

is the name or property value of the <meta> tag, or an HTML tag (in angle brackets). Lines beginning with a hash (#) are treated as comments and ignored.

Content flags

0

do not index this metadata as part of the document's content. Only searchable through explicit use of the metadata field operator see Query Language Help

1

index this metadata as part of the document's content. A general search will also match occurrences of the keyword within this metadata field.

2

sets the field as Geospatial Search data. This feature is available from Funnelback version 7. Note: all collections that are part of a meta collection must share common type 2 metadata fields. For example if x is defined as type 2 metadata in one collection all other collections that are part of the meta collection must also have the x field defined as type 2.

3

sets the field as a Numerical Metadata field. Note: all collections that are part of a meta collection must share common type 3 metadata fields. For example if N is defined as type 3 metadata in one collection all other collections that are part of the meta collection must also have the N field defined as type 3.

4

sets the field as a lock string for early binding document level security.

Example

If a web page contained the following <meta> tags:

<meta name="Title" content="My dummy website">
<meta name="Description" content="An example for metamap.cfg">
<meta name="Keywords" content="shakespeare casear romeo juliet">
<meta name="Date" content="25/07/2006">
<meta property="og:image" content="http://example.com/example.gif">

The following metamap.cfg would map each of those meta tags to classes

t,1,title
c,1,description
s,1,keywords
d,0,date
I,0,og:image

If a web page contained the following HTML in the body:

<h2>This is some heading text</h2>

The following metamap.cfg configuration could be used to put "This is some heading text" into meta class '2'.

2,1,<h2>

Spelling suggestions

If you wish to include the content of fields within spelling suggestions then they will need to be added to the spelling.suggestion_sources configuration. Note, however, that the default spelling.suggestion_sources setting will include all 'type 1' metadata.

See also

top