1 / 15

Enrichissement d’une ontologie multilingue à partir de textes pour le cancer du sein

Enrichissement d’une ontologie multilingue à partir de textes pour le cancer du sein. Michel SIMONET Christoffel DHAEN Frank BALDESARE Delphine BERNHARD Hans COOLS Gayo DIALLO Mariana CASELLA dos Santos Patrick PALMER Sylvain FERRIOL. Séminaire Web Sémantique Médical Rouen – 9 mars 2004.

clint
Télécharger la présentation

Enrichissement d’une ontologie multilingue à partir de textes pour le cancer du sein

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Enrichissement d’une ontologie multilingue à partir de textes pour le cancer du sein Michel SIMONETChristoffel DHAENFrank BALDESARE Delphine BERNHARD Hans COOLS Gayo DIALLO Mariana CASELLA dos Santos Patrick PALMER Sylvain FERRIOL Séminaire Web Sémantique Médical Rouen – 9 mars 2004 TIMC-IMAGLanguage & ComputingGuy’s and St Thomas’ Hosp. Grenoble St.-Denijs-Westrem London FRANCE BELGIQUE UK Projet INFACE – 5ème PCRD

  2. Contexte et Objectifs • Projet européen INFACE (5ème PCRD) • Objectif : aide à la recherche d’information pour lesmédecins travaillant dans le domaine du cancer du sein • Recherche guidée par une ontologie • Partenaires : anglais, français, belges, grecs et chypriotes WSM 2004

  3. Existant • LinkBase : Ontologie 2,1 M conceptsLanguage & Computing • Sous-ensemble de 3560 concepts (cancer du sein) - 8600 termes anglais - 4500 termes français WSM 2004

  4. Enrichissement • Méthodologie TIA (français et anglais) Corpus  Termes  Concepts + traduction des termes en grec • Corpus français et anglais- articles scientifiques - comptes-rendus médicaux (radiologie, anatomo-pathologie, …) - protocoles cliniques - terminologies médicales - pages web • Extraction terminologique + Concordancier • Termes absents de Linkbase WSM 2004

  5. TERMES POTENTIELS Déjà présentsdansLinkBase AbsentsdeLinkBase Liste destermespertinentspour le cancer du sein Extraction des termes pertinents Extraction Terminologique hhsfvsdhfhgdhsjbcxvbfjgAcces s to the the jhgfhffgfgffgrelated information via “translation” of contextual kkjhggvvvv into a DX Id.hfgfhidjighijsdkjqoldkj hhsfvsdhfhgdhsjbcxvbfjgAcces s to the the DX related information via “translation” of contextual sgfjkqjsljdfhg into a gfjhgsdjqId.hkuhgsufhifjijdfeifhf hhsfvsdhfhgdhsjbcxvbfjgAcces s to the the DX related information via “translation” Hdugfjvgdufgrugdfvcxbvcbfb s to the the ghgffghgfef related information via “translation” hdugfjvgdufgrugdfvcxbvcbfb hhsfvsdhfhgdhsjbcxvbfjgAcces s to the the DX related information via “translation” of contextual SNOMED-CT into a DX Id.hfgfhidjighijsdkjqoldkj hhsfvsdhfhgdhsjbcxvbfjgAcces s to the the DX related information via “translation” of contextual SNOMED-CT into a DX Id.hkuhgsufhifjijdfeifhf hhsfvsdhfhgdhsjbcxvbfjgAcces s to the the DX related information via “translation” Hdugfjvgdufgrugdfvcxbvcbfb s to the the DX related information via “translation” hdugfjvgdufgrugdfvcxbvcbfb TeSSI Indexing Concordancier CORPUS AnnotatedCORPUS Validation par Experts WSM 2004

  6. Extraction des termes pertinents Liste destermespertinentsabsentsdeLinkBase Liste destermespertinentspour le cancer du sein Comparaison LinKBase CORPUS Evaluation et Modélisation par expert LandC + 1800 termes français + 700 termes anglais Nouveaux Termes NouveauxConcepts 1038 5000 termes traduits  grec WSM 2004

  7. Concordancier WSM 2004

  8. LinKBase Structure Overview WSM 2004 Mariana Casella dos SantosOntology Manageremail: mariana@landc.be http://www.landcglobal.com

  9. LinKBase Structure META-ENTITY FIELD (Storage for external vocabularies) DOMAIN-ENTITY FIELD (Realist Ontology) Elements of BFO Upper Level Ontology MEDCIN CPT-4 CCC Link Linguistic Elements ICD-9-CM Medical Domain Concepts SNOMED WSM 2004

  10. LinKBase Domain Structure DOMAIN-ENTITY IS-A IS-A IS-A Participation Link Inherence Link SUBSTANCE PROCESS PROPERTY DOING BREAST HAPPENING IS-A Has-Actee IS-A Has-Systemic-Meium BREAST SURGERY BREAST SWELLING WSM 2004

  11. INTEGRATION PROCESS WSM 2004

  12. Find related ones for modeling Create non-existing concepts WSM 2004

  13. Term entered for the pre-existing concept CARCINOMA IN SITU WSM 2004

  14. Extracting Relevant Terms Analysis by LandC Ontologist Selection of MEDCIN Breast Pathology related Top level concepts: MEDCIN : 39997 : BREAST DISORDERS MEDCIN : 41587 : BREAST SURGERY MEDCIN : 1884 : BREAST SYMPTOMS... Advanced Query Beans MEDCIN CORE All MEDCIN Brest Pathology related concepts MODELED DOMAIN CONCEPT Adv. Query Beans MEDCIN CONCEPT FRENCH TERM Modeling by LandC Ontologist All MEDCIN Brest Pathology related concepts not yet modeled ENGLISH TERM MODELED DOMAIN CONCEPT French translation by French speaking LandC Ontologist MEDCIN CONCEPT ENGLISH TERM WSM 2004

  15. Conclusion et Perspectives • Extraction terminologique - - • Méthode des segments répétés • Multilingue • Concordancier + + • Travail d’expert important • Résultat non exhaustif • Ontologie pour les patients WSM 2004

More Related