1 / 12

SemanticMining: Major Events July - December 2005

SemanticMining: Major Events July - December 2005. Workshop on Foundations of Clinical Terminologies and Classifications. Location: Timi ş oara, Romania, April 8, 2006 Collocation: EFMI Special Topic Conference Endorsement: EFMI, GMDS, SemanticMining Local Organizer: George Mihalas

maegan
Télécharger la présentation

SemanticMining: Major Events July - December 2005

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. SemanticMining:Major Events July - December 2005

  2. Workshop on Foundations of Clinical Terminologies and Classifications • Location: Timişoara, Romania, April 8, 2006 • Collocation: EFMI Special Topic Conference • Endorsement: EFMI, GMDS, SemanticMining • Local Organizer: George Mihalas • Scientific Chairs: Stefan Schulz, Jeremy Rogers. SPC to be defined • Invited Speakers: Olivier Bodenreider, N.N. • Submission Deadline: Jan 15 • Bursaries by Semantic Mining

  3. WP 20: Research Activity Multilingual Medical Dictionary

  4. WP20 Partners contributing to activities in 2005 Jul-Dec • UKLFR Freiburg University Hospital Medical Informatics • JENA University of Jena Computational Linguistics • LIU IMT Linköping Univ. Medical Informatics • LIU IDA Linköping Univ. Computer Science • UGOT Göteborg University • SU Sahlgrenska Hospital Göteborg • DIM Geneva Univ. Hospital Med. Informatics • OU Open University • INSERM Paris, Med. Informatics

  5. Revised Subtasks of WP20 • Development and Implementation of Lexical Acquisition Methodologies • Population of Medical Subword Lexicon • Specification and Acquisition of Multilingual Lexical Resources • Specification and Acquisition of Multilingual Corpora

  6. WP 20 achievements • Milestone 4 reached: interchange format for a multi-lingual medical dictionary, multi-lingual medical dictionary in at least five languages with more than 20,000 entries • WP20 workshop in September (Paris) • Specification of Common Link Format • Ongoing enhancement of the MorphoSaurus Lexicon • Enhancement of the MorphoEditWeb lexicon editor • Use of standardized IR performance measurements in order to assess the progress in the lexicon • Research on subword cognate aquisition. • Ongoing feasibility studies and prototypical implementations in collaboration with German partners (industry, government) • Implementation of corpus exchange guideline

  7. WP 20 achievements (continued) • Use of NLG techniques for the automated generation of Procedure description out of a formalized procedure ontology • Automatic annotation of medical corpus with morphosyntactic and semantic information. • Restructuring of LEXIN lexicon (20,000 entries) into a medically-oriented resource • Compound decomposition, named entity recognition, and acronym decomposition for Swedish • Adaptation of ITools for French terminology extraction • Continuation of preparation of parallel French-English medical corpus • Semi-automatic enrichment of French subword lexicon by exploiting existing subword lexicons in other languages

  8. WP 20 Cross-WP activities • WP 21 and WP 26. Use of Morphosaurus indexing for content retrieval in EHR archetype descriptions. • WP21: Joint proposal on biomedical processes within biomedical ontologies • WP22 translation of SNOMED CT into Swedish • WP22 use of SNOMED Microglossary for a bilingual English-Russian lexicon • WP24 Elaboration of a white paper on token and sentence segmentation • WP23 (planned): Acquisition of Multilingual Terms from Lab Medicine

  9. WP20 Strategic Planning • Using the common interchange and link formats, multilingual linking/merging of lexicons plus semantic classes from subword lexicons • Corpora pool as new deliverable for 2006 • Extension of the parallel corpora alignment for new language pairs • Ongoing population and cleansing of subword lexicons, extension of the corpus-based validation of subword lexicons to new language pairs • Cross-validation of lexicons (e.g. removing duplicates) • IR studies with new language pairs • Experimental addition of Italian and Russian to the subword lexicon using automated • Workshop at LREC (accepted) • Tutorial at MIE

  10. WP 20 Mobility • Rahil Qamar from UOM to UKLFR • Vincent Claveau from INSERM to UKLFR • Mikael Nyström from LIU to UKLFR • Louise Déléger from INSERM to LIU • Caspar Hasenclever from UKLFR to JENA • Michael Poprat from JENA to UKLFR • Joachim Wermter from JENA to UKLFR • Harald Kirsch from EBI to UKLFR

  11. WP 20 Joint Publications • R.H. Baud, M. Nyström, L. Borin, R. Evans, S. Schulz, P. Zweigenbaum. Interchanging Lexical Information for a Multilingual Dictionary. AMIA 2005 annual symposium. Accepted for publication. • K. Markó, S. Schulz, U. Hahn: Automatic Lexicon Acquisition for a Medical Cross-Language Information Retrieval System. Proceedings of the XIX International Congress of the European Federation for Medical Informatics (MIE '05), Geneva, Switzerland. 2005: 829-834. • K. Markó, S. Schulz, O. Medelyan, U. Hahn: Bootstrapping Dictionaries for Cross-Language Information Retrieval. Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '05 ), Salvador, Brazil. 2005: 528-535. • K. Markó, S. Schulz, U. Hahn: MorphoSaurus - Design and Evaluation of an Interlingua-based, Cross-language Document Retrieval Engine for the Medical Domain. Methods of Information in Medicine. 4/2005(44): 537-545 • K. Markó, S. Schulz, U. Hahn: Unsupervised Multilingual Word Sense Disambiguation via an Interlingua. Proceedings of the 20th National Conference on Artificial Intelligence (AAAI '05), Pittsburgh, Pennsylvania. 2005: 1075-1080 • M. Poprat, U. Hahn. Enough is Enough – Estimating Upper Bounds of the Size of Training Corpora for Unsupervised PP Attachment Disambiguation. Proceedings of Fifth International Conference on Recent Advances in Natural Language Processing (RANLP-2005) • K. Markó, S. Schulz and U. Hahn. Multilingual Lexical Acquisition by Bootstrapping Cognate Seed Lexicons. Proceedings of Fifth International Conference on Recent Advances in Natural Language Processing (RANLP-2005) • U. Hahn, P. Daumke, S. Schulz, K. Markó: : Cross-Language Mining for Acronyms and their Completions from the Web. Proceedings of the 8th International Conference on Discovery Science (DS '05), Singapore. 2005. • S. Schulz, K. Markó, R. L. de Andrade, E. Pacheco, P. Nohama, U. Hahn, M. Romacker: The Morphosaurus Medical Subword Lexicon. Lexicographic and Semantic Aspects. Proceedings of the 3th Workshop em Tecnologia da Informação e da Linguagem Humana (TIL '05), São Leopoldo, Brasil. 2005 • Mikael Nyström, Magnus Merkel, Lars Ahrenberg, Michael Petterstedt, Håkan Petersson &  Hans Åhlfeldt. Generering av ett medicinskt engelskt-svenskt lexikon med hjälp av interaktiv ordlänkning. Accepted for Svenska Läkaresällskapets riksstämma (Annual Meeting of Swedish Society of Medicine). • K. Marko, P. Daumke, S. Schulz, U. Hahn. Automatische Generierung einer sprachübergreifenden Akronymdatenbank. 50. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie (gmds), Freiburg 11. - 15. September 2005 (Annual Meeting of the German Society of Medical Informatics, Biometry and Epidemiology) • M. Poprat, K. Markó, U. Hahn. Automatische Klassifikation medizinischer Dokumente nach Sprache und Zielgruppe für Text-Retrieval-Systeme. 50. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie (gmds), Freiburg 11. - 15. September 2005 (Annual Meeting of the German Society of Medical Informatics, Biometry and Epidemiology) • Website: http://www.morphosaurus.net, updated in August 2005

More Related