1 / 29

Victor BARANOV Linguistics Department Izhevsk State Technical University

Information-Analytical System “Manuscript”: technologies and tools of creation of electronic collections of ancient and medieval documents. Victor BARANOV Linguistics Department Izhevsk State Technical University Laboratory of Computer-Aided Philological Research Udmurtia State University.

aden
Télécharger la présentation

Victor BARANOV Linguistics Department Izhevsk State Technical University

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Information-Analytical System “Manuscript”: technologies and tools of creation of electronic collections of ancient and medieval documents • Victor BARANOV • Linguistics Department • Izhevsk State Technical University • Laboratory of Computer-Aided Philological Research • Udmurtia State University

  2. Title page of the portalof IAS “Manuscript” Digital Historical Corpora

  3. Model of hierarchies and subnets of manuscript and text units Digital Historical Corpora

  4. Net of linguistic relationships Text <…> се быша дроузи мои .<…> Εnd of the “single" relationship Relationship Predicate part се быша дроузи мои . Εnd of the “multiple" relationship Средство связи Mean of relationship Syntactic group се быша дроузи мои Word-form се быша дроузи мои Word-combination Дроузи мои быша дроузи Co-ordination се быша дроузи Dependence с е б ы ш а д р оу з и м о и . Digital Historical Corpora

  5. Model of the Manuscript system Digital Historical Corpora

  6. Editor OldEd: main panels Digital Historical Corpora

  7. Editor OldEd: Text input and editing Digital Historical Corpora

  8. Editor OldEd: Fragmentation of the manuscript texts into units and relationships with the dictionary units Dictionary of fragments Properties of fragments Fragments Digital Historical Corpora

  9. Editor OldEd: Visualization of unit relationships Symbol Geometric hierarchy: Line Page Linguistic hierarchy: word-form normalize forms Dictionary: Lemma Properties and values of the Lemma Dictionary: word-forms of texts Digital Historical Corpora

  10. Editor OldEd: Page layout Digital Historical Corpora

  11. Result of creation of the layout on the site Marginalia Marginalia Marginalia Digital Historical Corpora

  12. Automated lemmatization and establishing relationships between words and lemmas Digital Historical Corpora

  13. Electronic edition: search page Collections & Manuscripts Search criteria Search result Digital Historical Corpora

  14. Search result: word index and concordance Digital Historical Corpora

  15. Module of retrievals: selection of the text Digital Historical Corpora

  16. Module of retrievals: selection of the unit Digital Historical Corpora

  17. Module of retrievals: setting the unit properties and values Digital Historical Corpora

  18. Module of retrievals: saving the query Digital Historical Corpora

  19. Module of retrievals: specifying the compositionof the query result Digital Historical Corpora

  20. Comparative index of the wordforms Digital Historical Corpora

  21. Comparative index of the fragments Digital Historical Corpora

  22. Grammar dictionaries Grammar dictionary of the modern Russian language Grammar dictionary of the Old Russian language Grammar dictionary of the Old Slavonic language Grammar dictionarypseudo-elements Text N Text 6 Text 5 Text 4 Text 3 Text 2 Text 1 Digital Historical Corpora

  23. Grammar dictionaries: retrieval form Digital Historical Corpora

  24. Grammar dictionaries: bringing the Old Russian word-forms to the lemma Digital Historical Corpora

  25. Grammar dictionaries: оbtaining paradigm of lemma Digital Historical Corpora

  26. Electronic editions Digital Historical Corpora

  27. Electronic edition:reverse index of word-forms and context Digital Historical Corpora

  28. Acknowledgment The work on the creation of IRS Manuscript is being carried out with the support from the Russian Foundation of Basic Research (Grant # 05-07-90217в). Τhe work on the creation of the automated morphologic analyzer with the support of the Russian Foundation for the Humanities (Grant # 05-04-12408в). Digital Historical Corpora

  29. Contacts Laboratory of Computer-Aided Philological Research Udmurtia State University Linguistics Department Izhevsk State Technical University Izhevsk, Russia Victor Baranov - baranov@udm.ru http://manuscripts.ru/index_en.html Digital Historical Corpora

More Related