1 / 24

D-square

D-square. Digital Databases and Tools for Dutch Regional Dictionaries. Folkert de Vriend - Methods XII, Moncton, Canada, 2005 -. Topics. Introduction dictionaries Overview of project phases Conversion End user access. Macrostructure WBD & WLD. Agricultural terminology

milos
Télécharger la présentation

D-square

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. D-square Digital Databases and Tools for Dutch Regional Dictionaries Folkert de Vriend - Methods XII, Moncton, Canada, 2005 -

  2. Topics • Introduction dictionaries • Overview of project phases • Conversion • End user access

  3. Macrostructure WBD & WLD • Agricultural terminology • Other technical or craft terminologies • Common vocabulary

  4. Microstructure WBD & WLD Linguistic information in entries: • Phonetic (vol. I&II) or lexical (vol. III) variant • Heteronym (functions as headword) • Lexical meaning (word, or description of a concept) Non-linguistic information in entries: • Place name

  5. Example of WBD entry, volume 3

  6. Overview phases D-square • Conversion to a new format • End user access to data • Enrichment of data • Data management

  7. Phase 1: Conversion to a new format

  8. Raw data FileM Pro Edited data XML Raw data Questionnaires Nijmegen and Leuven Questionnaires (chiefly) Meertens (parts of) Vol. I+II MS-Word Vol. III FileM Pro Enriched data XML (parts of) Vol. I+II MacWrite Deel III MS-Word Vol. III Filing cards Online DB WBD (Polderland) Edited data Vol. I + II Vol. III Website WBD/WLD with tools for searching and cartography Specialized print editions (dialect atlas or local dictionary) SGV on CD (Polderland)

  9. Reasoning behind new encoding • XML, not relational database • Tailored to WBD and WLD • Flexible enough to be used for other dialect dictionaries • Standards: ISO TC 37/SC 4 DCR and LMF. (DCR = flat concept registries)

  10. The maybug again…

  11. Data for “maybug” from database

  12. Example XML-encoding <LEXICON> <ENTRY> <CONCEPT ontol_ID="492"> <WORD lang=“Dutch”>Meikever</WORD> <ILLUSTRATION uri="meikever.jpg"/> <DESCRIPTION></DESCRIPTION> </CONCEPT> <VARIANTS> <VARIANT type="heteronym">Bakkertje <VARIANT type="lexical">Bakkerke <VARIANT type="raw" import="diplomatic" source1="N83"> bakkərkə <LOCATION>K178</LOCATION> </VARIANT> </VARIANT> </VARIANTS> </ENTRY> … </LEXICON>

  13. Example from WALD

  14. XML-encoding for WALD <LEXICON> <ENTRY> <CONCEPT ontol_ID="492"> <WORD lang=“Achterhoeks”>Hals</WORD> <DESCRIPTION></DESCRIPTION> </CONCEPT> <VARIANTS> <VARIANT type="heteronym">Hals <LOCATION>Gor</LOCATION> <LOCATION>Harf</LOCATION> <LOCATION>Alf</LOCATION> <LOCATION>Eef</LOCATION> <LOCATION>...</LOCATION> </VARIANT> <VARIANT type="heteronym">Kael <LOCATION>Gor</LOCATION> <LOCATION>Eef</LOCATION> <LOCATION>Wich</LOCATION> <LOCATION>Gen</LOCATION> <LOCATION>...</LOCATION> </VARIANT> </VARIANTS> </ENTRY> </LEXICON>

  15. Example from WZD

  16. XML-encoding for WZD <LEXICON> <ENTRY> <CONCEPT ontol_ID="292"> <WORD lang=“Dutch”>de haard</WORD> <DESCRIPTION></DESCRIPTION> </CONCEPT> <VARIANTS> <VARIANT type="raw">'aerd (d'n) <LOCATION>Z.eil</LOCATION> <LOCATION>Z.V.W.</LOCATION> <LOCATION>L.v.Ax</LOCATION> <LOCATION>Hlt</LOCATION> <LOCATION>...</LOCATION> </VARIANT> <VARIANT type="raw">êêrd <LOCATION>Z.V.W.</LOCATION> </VARIANT> </VARIANTS> </ENTRY> </LEXICON>

  17. Phase 2: end user access

  18. The trouble with linking • At the conceptual level

  19. Existing ontology can be used

  20. The trouble with linking • At the conceptual level • At the level of the variants

  21. The maybug again (WVD)…

  22. …and again (WBD)

  23. Information about D-square www.ru.nl/dialect/d2

  24. Questions?

More Related