Skip to end of metadata
Go to start of metadata
You are viewing an old version of this page. View the current version. Compare with Current  |   View Page History

General prerequisites

  • Java 8
  • Credentials to our Nexus publishing repos

Standard (and easy) setup

  1. Get a Semantic Pipeline and unzip it to a directory. For the purpose of this guide, we'll assume you have your pipeline contents unpacked in
  2. Install a web application container. If you already have one - great, if not - we use Apache Tomcat 7
  3. You will need to set a few JVM params, in Tomcat this is done from */apache-tomcat/bin/setenv.sh. See the worker configuration page.
  4. Download extractor-web.war
  5. You can now start your webapp container
  6. Deploy the war you just downloaded. In Tomcat you simply need to move it to its /webapps sub-directory and it will get picked up.
  7. Now go to http://localhost:8080/extractor-web/apidocs for live documentation.
    Due to Swagger limitations, the most important endpoint, namely /extract, cannot have live documentation. This is why it's explained here.

High-availability setup

The high-availability setup architecture includes several components, communicating through RESTful calls. Each component has its own role in the environment. Here's a list with brief explanation of each module:

  • GraphDB with EUF plugin – the GraphDB module maintains a semantic database, containing RDF data used within the system. Its EUF plugin (EUF stands for Entity Update Feed) is responsible for providing the outer world with notifications about every entity (concept) within the database that has been modified in any way (added, removed, edited)
  • Concept Extraction API Coordinator – the Coordinator module accepts annotation requests and dispatches them towards a group of Concept Extraction Workers (see below). The Coordinator communicates with the semantic database in order to track for changes leading to updates in every Worker's Dynamic Gazetteer.
  • Concept Extraction API Worker – a Worker module evaluates annotation requests. It maintains a pool of GATE pipeline instances, used for text analysis and concept extraction.

Installing GraphDB and the EUF plugin

Information about installing and using the GraphDB semantic database can be found on the official GraphDB documentation page.

In order to install the Entity Update Feed plugin, check the CES Components page.

OPTIONAL: insert a single random statement having rdfs:label as predicate in order to activate the EUF plugin

Labels:
None
Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.