170 likes | 271 Vues
This publication delves into OAEI library track on ontology mapping, discussing lessons learned, evaluation results, and upcoming work in the field. It explores the challenges faced by ontology matching tools in aligning thesauri and evaluates their performance. The study also highlights the importance of manual evaluation and the difficulties encountered in distinguishing similar labels. Lessons learned include issues with SKOS to OWL transformation and the ability of matching systems to discover correct correspondences in thesauri. Future work involves updating reference alignment, SKOS import for matching systems, and potentially using instance data for matching thesauri. Contact dominique.ritze@bib.uni-mannheim.de for more information.
E N D
First Insights into the Library Track of the OAEI Dominique Ritze Mannheim University Library
Motivation Ontology Mapping Publication x Search 0 results subject (thesaurus 2): ontology alignment Thesaurus 1 Thesaurus 2 Ontology Mapping = Ontology Alignment Ontology Mapping Publication x Search subject (thesaurus 1): ontology alignment
Overview • Ontology Matching • OAEI • Thesaurus vs. Ontology • OAEI Library Track 2012 • Lessons learned and Future Work
Ontology Matching Person People Author Author < Author, Author, =, 0.97 > < Paper, Paper, =, 0.94 > < reviews, reviews, =, 0.91 > < writes, writes, =, 0.7 > < Person, People, =, 0.8 > < Document, Doc, =, 0.7 > < Reviewer, Review, =, 0.6 >… CommitteeMember writes Reviewer PCMember reviews Doc reviews Document Paper Paper writes Review
Ontology Matching Evaluation O1 Tool A Test O2 m R Result
Ontology Alignment Evaluation Initiative (OAEI) • Annual campaign started 2005 • Different tracks/datasets • Benchmark, Anatomy, Conference, Multifarm, Large BioMed, Library, Instance Matching • 21 submitted systems (2012) • Goal: Improving the performances of the ontology matching field • Through comparison of algorithms • New challenges for the systems
Thesaurus = Ontology? Germany Commodities Tropical Fruit Ananas Metal Product -> Metal
OAEI Library Track Are current state-of-the-art ontology matching tools able to match thesauri? Dominique Ritze, Kai Eckert, Benjamin Zapilko, Joachim Neubert
Data Set • Thesaurus for economics (STW) • 6.000 concepts with 19.000 additional keywords (EN, DE) • Thesaurus for the Sociel Sciences (TheSoz) • 8.000 concepts with 4.000 additional keywords (EN, DE, FR) • Reference alignment manually created in 2006 • Both actively used in libraries for keyword indexing
Execution • 7GB Debian machine • Timeframe 1 week • 13 of the 21 submitted systems were able to generate an alignment • No system had a heap space problem • Evaluation: Precision, Recall, F-Measure, Runtime
Results How to evaluate the results? F-Measure of 0.67 good?
Manual Evaluation • Between 38 and 269 new correct correspondences found per matcher • Up to half of the correspondences correct • Many new correspondences are quite simple • Some more “complex” and interesting ones • Automated production = CAM • Several incorrect ones if the labels are quite similar • Difficult to distinguish the names of countries, their inhabitants and the languages
Lessons Learned • Transformation SKOS to OWL causes some problems, especially regarding the labels • Ontology matching systems are nevertheless able to match the thesauri and even discover unknown correct correspondences • Interest of the community in this topic
Future Work • Update reference alignment adapted results • SKOS import for matching systems • Use instance data to match thesauri? • Other thesauri?
Thankyouforyourattention! dominique.ritze@bib.uni-mannheim.de