Every Turtle/RDF file must be validated before committing to SVN.
Else automatic repository deployment (refresh) scripts fail and you waste your colleagues' times.
- Download ARQ (ARQ-2.8.8 is current as of 2011-04-21)
- unzip to a path that includes no spaces (eg on Windows: c:\prog\ARQ-2.8.8)
- On Linux it's easier:
- add ARQ-2.8.8/bin to your path
- On Windows you need to jump through more hoops:
- add c:\prog\ARQ-2.8.8\bat to your path
- write a batch file riot.bat in the same dir:
- Just get apache-jena-2.11.1, it includes the required files: shell (Linux) and batch (Windows)
Call it like this
- Unfortinately it returns only the first error. rdfparse (another jena tool) also returns only the first error
- If there are no errors, you'll see no output
- TODO: integrate this as a SVN commit hook, or emacs vc-before-checkin-hook
Here's how to integrate RIOT to the Emacs 'compile' command
- Get and install n3-mode.el (it's rather primitive but still useful for Turtle editing), then
- Get and install smart-compile.el, then
- Add regexp's to recognize RIOT's error messages:
- When editing a TTL file, invoke compilation with "C-c c". It jumps automatically to the first error, eg:
Use this for a one-off validation job.
TODO: we can easily automate calling this with wget
ICS-FORTH Validating RDF Parser: a tool for analyzing, validating and processing RDF schemas and resource descriptions. SVG visualization.
- Tried install
- converted susana.ttl to susana.rdf
- runVRP.bat: fixed VRP_HOME, removed JAVA_HOME, fixed command line:
- ran it, trying various options
- specified susana-browseSchema.svg as one of the outputs.
Had to add namespaces:
and it shows a page with some control buttons, but no content
- it was never able to produce schemaVisualization.svg
- turns out it's buggy:
It's part of RDFSuite that includes
- Validating RDF Parser (VRP): The First RDF Parser supporting semantic validation of both resource descriptions and schemas
- RDF Schema Specific DataBase (RSSDB): The First RDF Store using schema knowledge to automatically generate an Object-Relational (SQL3) representation of RDF metadata and load resource descriptions.
- RDF Query Language (RQL): The First Declarative Language for uniformly querying RDF schemas and resource descriptions.
- RDF Update Language (RUL): The First Declarative Language for uniformly updating resource descriptions
But it's done 2002-2003 and given the above experience, I won't try it.
Jena Eyeball http://jena.sourceforge.net/Eyeball/
- RDF's open world assumption means "anything goes" so eg a misspelt prop name is not considered a mistake.
- Eyeball tries to overcome this. It includes a lot of configurable checks over RDF data
- Windows download: http://sourceforge.net/projects/jena/files/Eyeball/Eyeball%202.3/eyeball-2.3.zip/download
for a first attempt and pieces of advice. Once we apply it successfully, we should move the info here.
Sebastian Hellmann email@example.com on OA mlist:
I convert/validate with
- Rapper: http://librdf.org/raptor/rapper.html
- rdf.sh: https://github.com/seebi/rdf.sh (Note the RDF diff function)
- Jena CLI: http://jena.sourceforge.net/tools.html
- Pellet: http://clarkparsia.com/pellet
- Pellint http://weblog.clarkparsia.com/2008/07/02/pellint-an-ontology-repair-tool/
Comparison of several tools for converting RDF to Turtle.
The tools are ordered below by preference. Or you can compare the results yourself:
Can concatenate several files, URLs or - (stdin).
Based on Jena RIOT.
Provides the best resutls
Also based on Jena RIOT.
Requires you to always specify the URL scheme, even for a local file.
Almost as good as rdfcat, just adds @base <file:2354.rdf> on top, which may not be desired
Does not group all statements per Subject and spreads prefixes throughout the file, so the Turtle is hard to understand
"any23" stands for "anything to triples" and converts from RDFa, microformats, RDF formats to turtle and other RDF formats
- make a batch file like this:
- invoke like this:
- Output is based on Sesame RIO, so it gives same result as rdf2rdf
- use their web service using the wget program (similar with curl):
- NOTICE: the file must be valid. Else the site crashes
The Apache Any23 project management committee are please to announce the release of Any23 2.0 which marks a major milestone for the project.
Anything To Triples (any23) is a library, a web service and a command line tool that extracts structured data in RDF format from a variety of Web documents.
Release notes, downloads. Maven artifacts (Maven Central), DOAP machine-readable description. Please report any issues to our community mailing lists.
By the main developer of Sesame (rdf4j)
Note: this tool is not evaluated against the other conversion tools.
Binary: TODO. (Mitac: I couldn't build it due to a missing dep for com.github.jsonld-java)
- url: http://librdf.org/raptor/
- download: http://download.librdf.org/source/raptor2-2.0.15.tar.gz
- based on librdf, written in C, faster than other java-based convertors
- raptor1 is has some bugs, notably OOM when used on large files. On the other hand it can be installed via the corresponding linux package manager (apt-get, etc.). E.g. . BTW, the tool is accessed via 'rapper' instead of 'raptor'.
- raptor2 claims to have the above problems fixed, but Mitac has not tried it yet.