View Source


h1. Bibliography
- Aligning Large SKOS-Like Vocabularies- Two Case Studies.pdf
- Alignment API 4.0 (SWJ 2010).pdf
- Approach for Named Entity Recognition in Poorly Structured Data (ESWC 2012).pdf
- Bipartite Graph Co-Clustering Approach to Ontology Mapping (ISWC 2003).pdf
- Comparing human and automatic thesaurus mapping approaches in the agricultural domain.pdf
- Comparison of Schema Matching Evaluations.pdf
- Concept chaining utilizing meronyms in text characterization (ECDL 2012).pdf
- Evaluating the success of vocabulary reconciliation for CH Collections (freeyourmetadata 2013).pdf
- Exploring Entity Recognition and Disambiguation for CH Collections (freeyourmetadata 2014) .pdf
- Formal concept analysis as Mathematical Theory of Concepts and Concept Hierarchies (cropped).pdf
- Hierarchical structuring of CH objects within large aggregations (TPDL 2013).pdf
- LinksB2N- Automatic Data Integration For The Semantic Web (ODBASE 2009).pdf
- LinksB2N- Automatic Data Integration For The Semantic Web (OTM 2009).pdf
- Lost In Translation- Empirical Analysis Of Mapping Compositions For Large Ontologies (OM2010).pdf
- Machine Learning based Vocabulary Management Tool Assessment for LOD (IJCA 2013).pdf
- Matching multi-lingual subject vocabularies (TELplus ECDL 2009).pdf
- Matching unstructured vocabularies using a background ontology (EKAW 2006).pdf
- Method for Estimating the Precision of Placename Matching (NKOS 2004)-presentation.ppt
- Method for Estimating the Precision of Placename Matching (TKDE 2007).pdf
- Proposal on Extended Interthesaurus Links Semantics (1998).pdf
- Semantic Problems of Thesaurus Mapping (JODI 2001).pdf
- TEL Multilingial Subject Access Initiative (MACS, EuroVoc Conference 2010).ppt
- Thesaurus Alignment for Linked Data Publishing (AGROVOC- DC 2011).pdf
- Topic models for taxonomies (JCDL 2012).pdf
- Vocabulary Mapping for Terminology Services (2004).pdf

h2. Amalgame
VUA's alignment server, implemented in SWI Prolog, using ClioPatria
- Amalgame- Interactive Vocabulary Alignment (TPDL 2011)-paper.pdf
- Amalgame- Interactive Vocabulary Alignment (TPDL 2011)-poster.pdf
- Amalgame- Interactive Vocabulary Alignment (TPDL 2011)-teaser.pdf
- Amalgame- Vocabulary alignment as an interactive and replicable workflow (PrestoPrime D4.2.1).pdf

h2. Concepts in Context (2010) workshop
- Conceptual Foundations for Semantic Mapping and Semantic Search.ppt
- Conference on Interoperability and Semantics in Knowledge Organization (2010).pdf
- FRSAD- Challenges of Modelling the Aboutness.ppt
- In Pursuit of Cross-Vocabulary Interoperability- Can We Standardize Mapping Types.ppt
- Insights and Outlooks- A Retrospective View on the CrissCross Project.ppt
- Interoperability and Semantics in RDF Representations of FRBR, FRAD and FRSAD.ppt
- Intersystem Relations- Characteristics and Functionalities.ppt
- Moving between Vocabularies- MACS 2010.ppt
- Retrieval Concepts and Mapping Strategies- Potential of CrissCross for Improving Access to DDC (2009).pdf
- Searching in a Multi-Thesauri-Scenario.pdf
- Semantic interoperability and retrieval paradigms (2010).pps
- Semantic Web View on Concepts and their Alignments- From Specific Library Cases to a Wider Linked Data Perspective.ppt

h2. CultureGraph-Metafacture
- CultureGraph- creating connections (2013).pdf
- CultureGraph- notes.txt
- culturegraph source.url

h2. Dagobert Soergel
- Art and Architecture Thesaurus (AAT)- A critical appraisal (1995)-long.pdf
- Art and Architecture Thesaurus (AAT)- A critical appraisal (1995)-short.pdf
- A universal source thesaurus as a classification generator (1972).pdf
- Concept analysis in thesaurus construction- example of prevention (2001).pdf
- Conceptual Foundations for Semantic Mapping and Semantic Search (Dagobert Soergel, 2010).ppt
- Design of an integrated information structure interface (2000).pdf
- Digital Libraries and Knowledge Organization (2008).pdf
- Digital Library Content Model (JCDL 2007).ppt
- Large multilingual vocabularies- structure and software requirements (1998).pdf
- Many uses of classification- Enriched thesauri as knowledge sources (2001).pdf
- Representing knowledge about databases in an intelligent guide to databases.pdf
- Semantic mapping through a concept hub.ppt
- Structure of taxonomies- Facets and hierarchy (2002).ppt
- TermMaster- Software for Creating and Maintaining Thesauri and Ontologies (2001).pdf
- Theoretical Problems of Thesaurus Building.pdf

h2. Evaluation
- Let's agree to disagree- on the evaluation of vocabulary alignment (KCAP 2011).pdf
- OAEI 2007 - Library matching - results.pdf
- OAEI 2007 food - Thesaurus Mapping Task.pdf
- OAEI 2007 food and environment-results.pdf
- OAEI 2007- Library matching - slides.pps
- OAEI food task- an analysis of a thesaurus alignment task (2010).pdf

h2. Multilingualism and Terminology (Linked Heritage Seminar 20130418)
- Agenda (Linked Heritage Seminar 20130418).pdf
- Controlled vocabulary for the enrichment of international metadata enabling Art nouveau in Europeana (Linked Heritage Seminar 20130418).pdf
- Multilighualism in European projects (Linked Heritage Seminar 20130418).pdf
- Multilingual terminology mapping in Europeana (Linked Heritage Seminar 20130418).pdf
- Multilingual thesaurus on music instruments (MIMO) (Linked Heritage Seminar 20130418).pdf
- Multilingual thesaurus on Photography (Europeana Photography) (Linked Heritage Seminar 20130418).pdf
- Recommendations and Guidelines for terminology (ATHENA and Linked Heritage) (Linked Heritage Seminar 20130418).pdf
- SemanticPédia- Cultural and semantic issues around the web of data (Linked Heritage Seminar 20130418).pdf
- SIERA- a multilingual knowledge sharing project (Linked Heritage Seminar 20130418).pdf
- Terminology Management Platform (TMP) (Linked Heritage) (Linked Heritage Seminar 20130418).pdf
- The publication and exploitation of knowledge structures with OpenSKOS (Linked Heritage Seminar 20130418).pdf
- Using VIAF in creating links to import information (Linked Heritage Seminar 20130418).pdf

h2. S-Match
- Computing Minimal Mappings (TR 2008).pdf
- Encoding Classifications As Lightweight Ontologies (TR 2006).pdf
- From Web Directories To Ontologies- NLP Challenges (TR 2007).pdf
- S-Match- an algorithm and an implementation of semantic matching (ESWC 2004).pdf
- S-Match- an open source framework for matching lightweight ontologies (SWJ 2010).pdf
- Semantic Matching (TR 2003).pdf

h2. SILK
- Discovering and Maintaining Links on the Web of Data (Silk ISWC2009 - Paper).pdf
- Discovering and Maintaining Links on the Web of Data (Silk ISWC2009 - Talk).pdf
- Efficient Multidimensional Blocking for Link Discovery without losing Recall (WebDB 2011).pdf
- LOD2- creating knowledge out of linked data (SILK) 201202.pdf
- section4slides.pdf
- Silk - A Link Discovery Framework for the Web of Data (LDOW 2009).pdf
- silk-book.htm
- silk-book.pdf
- Silk - Generating RDF Links while publishing or consuming Linked Data (ISWC 2010 poster).pdf
- SilkExperimentsLastFriday.ppt
- Silk Framework Intro.ppt
- Silk Server - Adding missing Links while consuming Linked Data (COLD 2010).pdf

- Vocabulary Matching for Book Indexing Suggestion in Linked Libraries (ISWC 2009)-paper.pdf
- Vocabulary Matching for Book Indexing Suggestion in Linked Libraries (ISWC 2009)-slides.pdf
- Vocabulary Matching for Book Indexing Suggestion in Linked Libraries (ISWC 2009)-slides.ppt
- Web-based Repository Service for Vocabularies and Alignments in Cultural Heritage (STITCH, ESWC10)-paper.pdf
- Web-based Repository Service for Vocabularies and Alignments in Cultural Heritage (STITCH, ESWC10)-slides.ppt

h2. string-matching
- Learning Conformation Rules for Linked Data Integration (CaRLA, OM 2012).pdf

h1. Specific Papers

h2. [Comparison of schema matching evaluations|]
Hong-Hai Do, Sergey Melnik, Erhard Rahm, International Workshop Web and Databases,
Lecture Notes in Computer Science, vol. 2593, Springer, Berlin, 2003

h2. [Thesaurus Alignment for Linked Data Publishing|]
Ahsan Morshed, Caterina Caracciolo, Gudrun Johannsen, Johannes Keizer (DC 2011)

[Abstract|]): As part of the publication of the AGROVOC thesaurus as Linked Data (LD), AGROVOC is now mapped with six well-known thesauri in the agricultural domain, i.e., EUROVOC, NALT, GEMET, STW, LCSH, RAMEAU. To find matching candidates, known matching algorithms discussed in the literature and available from public API were used. Results were evaluated by a domain expert, and almost total precision obtained. The candidate matches that were confirmed have already been added to the LD version of AGROVOC. Moreover, the owners of two of the thesauri mapped with AGROVOC have included in their data the mapping we identified. From this work, we conclude that we achieved our goal to enhance the Linked Data version of AGROVOC with reliable links to other thesauri, following a procedure that is fully replicable

- This paper is the basis of Dominic's spec
- Lists matching systems, esp ones evaluated by [OEAI|]. But this is Ontology Alignment Evaluation Initiative, does it include studies of instance matching?
-- COMA++, RiMOM, FALCON-AO, S-Match
-- S-Match uses WordNet for synonyms
- Describes AGROVOC, EUROVOC, NALT, GEMET, STW, LCSH, RAMEAU with 1 para each
- Considered all possible pairs for matching. This is computationally intensive, more advanced approaches use blocking/windowing (creating subsets/clusters)
- Validation was done manually by a multilingual expert, took 40p/d. He looked at all 30k matches exported to excel, and verified in each thesaurus, also looking at hierarchical context
- See [AGROVOC to NALT Analysis] for a detailed analysis

h2. [Semantic Problems of Thesaurus Mapping|]
Martin Doerr, Journal of Digital Information, 2001. [PDF|^Semantic Problems of Thesaurus Mapping (JODI 2001).pdf])
([Abstract|]: With networked information access to heterogeneous data sources, the problem of terminology provision and interoperability of controlled vocabulary schemes such as thesauri becomes increasingly urgent. Solutions are needed to improve the performance of full-text retrieval systems and to guide the design of controlled terminology schemes for use in structured data, including metadata. Thesauri are created in different languages, with different scope and points of view and at different levels of abstraction and detail, to accommodate access to a specific group of collections. In any wider search accessing distributed collections, the user would like to start with familiar terminology and let the system find out the correspondences to other terminologies in order to retrieve equivalent results from all addressed collections.
This paper investigates possible semantic differences that may hinder the unambiguous mapping and transition from one thesaurus to another. It focusses on the differences of meaning of terms and their relations as intended by their creators for indexing and querying a specific collection, in contrast to methods investigating the statistical relevance of terms for objects in a collection. It develops a notion of optimal mapping, paying particular attention to the intellectual quality of mappings between terms from different vocabularies and to problems of polysemy. Proposals are made to limit the vagueness introduced by the transition from one vocabulary to another. The paper shows ways in which thesaurus creators can improve their methodology to meet the challenges of networked access of distributed collections created under varying conditions. For system implementers, the discussion will lead to a better understanding of the complexity of the problem.

h2. [Effective Terminology Support for Distributed Digital Collections|]
Martin Doerr, DELOS Workshop, 1998

h2. [Method for Estimating the Precision of Placename Matching|]
Martin Doerr, Manos Papagelis, IEEE Trans. Knowl. Data Eng. 19(8): 1089-1101 (2007)
- [alt pdf|]
- [NKOS 2004 presentation|], [direct download|]

Estimates the precision of a place gazetteer based on probabilities of the same name to identify 1,2,3,etc places (i.e. place polysemy).

h2. [Matching unstructured vocabularies using a background ontology|]
Aleksovski, Z., Klein, M., ten Kate, W., van Harmelen, F. (EKAW 2006)

Abstract. Existing ontology matching algorithms use a combination of lexical and structural correspondance between source and target ontologies. We present a realistic case-study where both types of overlap are low: matching two unstructured lists of vocabulary used to describe patients at Intensive Care Units in two different hospitals. We show that indeed existing matchers fail on our data. We then discuss the use of background knowledge in ontology matching problems. In particular, we discuss the case where the source and the target ontology are of poor semantics, such as flat lists, and where the background knowledge is of rich semantics, providing extensive descriptions of the properties of the concepts involved. We evaluate our results against a Gold Standard set of matches that we obtained from human experts.
- uses hierarchical info for the matching