280 likes | 407 Vues
The MACS Project, initiated in 1997, focuses on enhancing multilingual search capabilities across various domains including culture and academia. This presentation discusses the history of the project, the development of the Link Management Interface (LMI), and the integration of authority data such as RAMEAU and LCSH. Key milestones include establishing links across languages and disciplines, improving retrieval results without using language as a pivot, and promoting Linked Data for better interoperability. The project aims to facilitate access to multilingual resources for users and institutions.
E N D
Translingual Retrieval Movingbetweenvocabularies MACS 2010 Jahns / Karg, Deutsche Nationalbibliothek Concepts in Context - Cologne Conference on Interoperability and Semantics in Knowledge Organization 2010-07-19
MACS Project history 1997 CENL-MACS project started 1997-1999 feasibility study sports & theatre 2000-2001 prototype of a web application 2002-2004 LMI (link management interface) was established 2005 RAMEAU-LCSH-Links were loaded by BnF 2006 CrissCross started
MACS Project progress • 2007 SNB startedediting links on a regularbasis • 2008 SNB and DNB preparedworkdivision • 2008 update and installationof an update routine • forthethreeindexinglanguages • 2008-2009 recruitingfreelancers, training • translatorsworkphase • 2009 TEL prototype of a multilingual search • interface 58.907 EN-FR-GER links
70,000 Tuple RAMEAU - LCSH from BnF 50,000 SWD conceptstoconnectwith SWD-topicalheadingsof ALL disciplines 12 workpackagesforneighbouringdisciplines MACS Project scope
MACS Working area – LMI https://macs.hoppie.nl/lmi/ • Searching and listingsubjectheadings
Update mechanism and identifier Working area – LMI
Searching links Working area – LMI
Editing links / Creatingnew links Working area – LMI
Filtering DDC-domains Working area – LMI
Methodology • Nolanguageisusedas a pivot • Headingsare not translated • Mapping isdone on thebasisofconcepts • Linking ischecked upon retrievalresults • Set ofpracticalrules and guidelines (BS 8723)
Language differences • LCSH, RAMEAU pre-coordinatedstrings / plural • SWD more flexible; post-coordination / singular • LCSH, RAMEAU subdivisionpractice 160,000 280,000 Frauenarbeit Women -- employment Mörder AND Intensivtäter Serial murderers 96,000 Femmes -- travail Tueurs en série
Equivalence 1 :1, 1 : n (including AND) Null: nomapping Types of links Theologie Theologie Theology Athlétisme AND Entraîneurs Leichtathletiktrainer Track and field coaches Traducteurs Übersetzer Translators Traducteurs Dolmetscher Translators 0 Erdalkalichloride Alkaline earth chlorides
Equivalence • exact • inexact = overlapping • partial equivalent Ètudiants japonais Japanischer Student japanese students Impérialisme Neokolonialismus Imperialism Cas (linguistique) Dativ 0
Mapping problems Cider • Different coverage • Different semantics, semanticrelations • Language skills Apfelwein Civilisation, Ancient Antike? Altertum! Cidre Civilsation antique
Access tomappings • Mapped terms in LMI -> XML-Download • Linked Open Data • Multilingual equivalent relations in authority records
Linked Data:Publishing MACS in the web • Linked Data: Datasets on the web linkedtogethersemantically, forming a publiclyaccessiblenetwork of structuredinformation
Why Linked Data? • Hasbecomethe de-facto standardfor: • publishing • exchangingdata on the Internet • Reachesmorecollaborationpartners and customers • Research institutions • Cultural heritageorganizations • Internet companies • online communities • Library datacanplay a majorrole in thesemantic web • providesreliability and stability • canbecome a „backboneoftrust“
MACS as Linked Data • Linked Data publishedbytheGerman National Library (asof May 30th, 2010) • 37.547 links from SWD to LCSH • 28.249 links from SWD to RAMEAU • RAMEAU and LCSH also publish MACS relationsaspartoftheirLinked Data • Ourdatahasbeenusedto update the RAMEAU SWD links • LCSH SWD links will follow
Applicationsof MACS Data • Improved Multilingual Search Functionalities • Europeana • Deutsche Digitale Bibliothek • Website of the German National Library
Cataloguers benefit Automatic Indexing: 5050 610 5100 !040537242! |s|Schwangerschaft ; 5101 !040153320! |s|Ernährung
Users benefit • Search with the native language • Enlargement of recall • Additional recall in the own catalogue • Enlargement of the query to foreign catalogues and portals • Precision of recall
Users benefit searchterm Ärztlicher Behandlungsfehler Dokuments: in DNB SNL LoC BL BnF and so on....
Multilingual meta-searching • One-stop-searchingforheterogeneouslyindexeddocuments • Authority data (mapping) displayedtoenhancesearching and understanding • Gettingresultsfrom different libraries • Europe comestogether: TEL, Europeana
Library users of multicultural societies • Searchingforsubjects in thepreferredlanguageoftheuser • Crossinglanguagebarriers in academic and publiclibraries • 12 % foreignstudents in Germany
Thank you! Merci Danke h.karg@d-nb.de y.jahns@d-nb.de