compared with
Version 4 by Reneta Popova
on Sep 30, 2014 15:03.

Key
This line was removed.
This word was removed. This word was added.
This line was added.

Changes (7)

View Page History
h3. Purpose of the Linked Data Gazetteer Processing Resource

A gazetteer component is generally used to recognize given pieces of text as objects, meaningful for the user. The Linked Data Gazetteer processes texts and creates Lookup annotations over words or groups of words (so called tokens), and assigns different features to these annotations. It uses a pre-filled cache structured as {{(label \-> (instance identifier, instance type))}}. The contents of this cache is defined by the TRIE Cache for Linked Data Gazetteer Language Resource, which is attached as a runtime parameter to the Gazetteer Processing Resource. There is a possibility of extra annotation features to be added to the Lookups generated by the gazetteer, the Metadata Language Resource responsible for these. Instance of the Metadata language resource can be attached as an optional runtime parameter to the Linked Data Gazetteer PR. Its cache structure is in general represented like {{(identifier(instance identifier, instance type) \-> list (feature name, feature value)|(feature name, feature value))}}.

h3. Resource workflow
2) Runtime parameters
a) cacheLR - this is the TRIE Cache Language Resource that will be used for matching
b) [SAS:optional] inputAsName - the name of the annotation set where Token annotations are, default is <null>, i.e. the default annotation setting
c) [SAS:optional] metadataLR - the Metadata Language resource bound to the corresponding TRIE Cache Language resource

h3. Step-by-step guide for creating and adding a Linked Data Gazetteer Processing Resource into a pipeline
1) open a GATE Developer instance
2) [SAS:optional] load the GATE application where the Gazetteer PR is to be added
3) load the gazetteer CREOLE plugin:
a) File \-> Manage CREOLE plugins
6) set parameter values, refer to the TRIE Cache Parameters section for more information
7) click 'OK' and the TRIE Cache Language resource should begin preparing its cache by evaluating the queries or deserializing existing data
8) [SAS:optional] right-click 'Language Resources \-> Metadata LR' to create a Metadata Language Resource
9) [SAS:optional] set parameter values, refer to the Metadata Language Resource Parameters section for more information
10) [SAS:optional] click 'OK' and the Metadata Language resource should begin preparing its cache by evaluating the queries or deserializing existing data
11) right-click 'Processing Resources \-> Linked Data Gazetteer' to create a Linked Data Gazetteer Processing Resource
12) open the pipeline where the Linked Data Gazetteer should be added and add the newly created instance at the desired position within the pipeline