html5-img
1 / 26

Annual Review

Annual Review. NoE No. 507505 Semantic Interoperability and Data Mining in Biomedicine [SemanticMining]. Outline. Workshop on Natural Language Processing (D13.1) Multi-lingual medical dictionary (D20.1) Information Retrieval and Data Mining (D24.1).

pippa
Télécharger la présentation

Annual Review

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Annual Review NoE No. 507505 Semantic Interoperability and Data Mining in Biomedicine [SemanticMining] SemanticMining No.507505

  2. Outline • Workshop on Natural Language Processing (D13.1) • Multi-lingual medical dictionary(D20.1) • Information Retrieval and Data Mining (D24.1) SemanticMining No.507505

  3. WP13: Workshop on Natural Language Processing • Goals: 1. expand visibility of the semanticmining workshop; 2. establish forum for outside/inside network cooperation; 3. federate the NLP community in the biomedical domain; 4. organize a shared task to stimulate research in the domain, following well established challenges such as the TREC Genomics (http://trec.nist.gov/) or BioCreative(http://www.pdg.cnb.uam.es/BioLINK/BioCreative.eval.html). SemanticMining No.507505

  4. Workshop • Audience • Satellite of COLING: computer scientists, linguists, logicians… • Natural Language Processing/Information Retrieval • Medical informatics and Bioinformatics • 60 registered participants • Distribution • Table • Paper selection • 7 regular papers out of 30 submissions • 5 posters • Dissemination • Workshop printed proceedings • Website • Special issue under preparation (IJMI - Elsevier) SemanticMining No.507505

  5. Shared Task I • Background • Information access tools is increasing to support literature survey, • Online ‘portals’ where scientists can navigate • Genetics and disease databases • Ambiguous nomenclature: Gene/RNA/proteins • Scale up methods for processing full text articles etc. • Task • Annotate Gene and Protein Names (GPNs) i.e. find beginning and end of GPNs SemanticMining No.507505

  6. Shared task II • MEDLINE Corpus Trained on 2000 abstracts / Tested on 200 • Evaluation IOB recall and precision-like metrics • Participation • 12 participant team SemanticMining No.507505

  7. Evaluation • Criterion Q3: Valorisation and Dissemination Satisfying but internal impact could be improved SemanticMining No.507505

  8. Natural Language Processing Workshop 2005 SemanticMining No.507505

  9. SMBM 2005 • Symposium on Semantic Mining in Biomedicine: EBI, Hinxton, UK, 10-13 April, 2005 • 28 submissions • 12 accepted papers • 4 invited speakers • 4 Tutorials • Up to now about 60 registrations  http://www.ebi.ac.uk/Information/events/SMBM/ SemanticMining No.507505

  10. WP20: Multilingual Lexicon Three lines of work: • MorphoSaurus subword lexicon: Links minimal, semantically atomic lexical units in 6 languages (approx. 80,000 entries, 27,000 equivalence classes). Purpose: Cross-language text retrieval, semantic interface between medical dictionaries • Semi automated lexical acquisition: generating Spanish subwords out of Portuguese subwords, and Swedish out of German and English ones. • Common Lexicon Interchange Format Based on the (EU-funded) MULTEXT morpho-syntactic description. Facilitates the re-use of lexical resources SemanticMining No.507505

  11. Evaluation • Q2. Sharing of resources and use of research software tools Satisfying • Q6. Short and medium-term visits To be improved • Q7. Co-authoring of research papers, PhD… To be improved SemanticMining No.507505

  12. Multilingual Lexicon 2005 SemanticMining No.507505

  13. MorphoEdit lexicon editor MorphoSaurus segmenter & indexer Exchange of French and Swedish lexemes Catholic University of Paraná, Brazil Lexeme acquisition EHR indexing and retrieval 2005 : Multilingual Lexicon Dissemination and Standards activities Sharing Tools and Resources • Standardization of Lexicon Interchange Format • Negotiations on Semantic Medical document indexing with private and public partners in Germany • 1 IST call 4 proposal International Cooperation Fund Raising SemanticMining No.507505

  14. WP24: Information Retrieval and Data Mining • Semantic Interoperability • Normalized vocabulary (Gene Ontology, MeSH…) • Online integration tool: http://www.ebi.ac.uk/Rebholz-srv/whatizit/form.jsp • Information Retrieval and Extraction • Gene and Proteins, Drugs… • Protein Functions: apoptosis-induction… • Cellular Components: membrane, mitochondria.. • Biological Processes: digestion, reproduction… • Knowledge coupling • Uni-Prot (EU), MGI, LocusLink (US)  via Sequence Retrieval System  Need new Tools for Images and Full-text articles ! SemanticMining No.507505

  15. Entity Types SemanticMining No.507505

  16. Whatizit ! SemanticMining No.507505

  17. Biomedical Text (MEDLINE Abstract) Alterations in protein folding and the regulation of conformational states have become increasingly important to the functionality of key molecules in signaling, cell growth, and cell death. Molecular chaperones, because of their properties in protein quality control, afford conformational flexibility to proteins and serve to integrate stress-signaling events that influence aging and a range of diseases including cancer, cystic fibrosis, amyloidoses, and neurodegenerative diseases. We describe here characteristics of celastrol, a quinone methide triterpene and an active component from Chinese herbal medicine identified in a screen of bioactive small molecules that activates the human heat shock response. From a structure/function examination, the celastrol structure is remarkably specific and activates heat shock transcription factor 1 (HSF1) with kinetics similar to those of heat stress, as determined by the induction of HSF1 DNA binding, hyperphosphorylation of HSF1, and expression of chaperone genes. Celastrol can activate heat shock gene transcription synergistically with other stresses and exhibits cytoprotection against subsequent exposures to other forms of lethal cell stress. These results suggest that celastrols exhibit promise as a new class of pharmacologically active regulators of the heat shock response. SemanticMining No.507505

  18. Ontology-driven Knowledge Coupling (GO) Alterations in protein folding and the regulation of conformational states have become increasingly important to the functionality of key molecules in signaling, cell growth, and cell death . Molecular chaperones, because of their properties in protein quality control, afford conformational flexibility to proteins and serve to integrate stress-signaling events that influence aging and a range of diseases including cancer, cystic fibrosis, amyloidoses, and neurodegenerative diseases . We describe here characteristics of celastrol, a quinone methide triterpene and an active component from Chinese herbal medicine identified in a screen of bioactive small molecules that activates the human heat shock response . From a structure/function examination, the celastrol structure is remarkably specific and activates heat shock transcription factor 1 (HSF1) with kinetics similar to those of heat stress, as determined by the induction of HSF1 DNA binding, hyperphosphorylation of HSF1, and expression of chaperone genes . Celastrol can activate heat shock gene transcription synergistically with other stresses and exhibits cytoprotection against subsequent exposures to other forms of lethal cell stress . These results suggest that celastrols exhibit promise as a new class of pharmacologically active regulators of the heat shock response . SemanticMining No.507505

  19. Gene Ontology Browser SemanticMining No.507505

  20. Database-driven Knowledge Coupling (Swiss-Prot) Alterations in protein folding and the regulation of conformational states have become increasingly important to the functionality of key molecules in signaling, cell growth, and cell death . Molecular chaperones, because of their properties in protein quality control, afford conformational flexibility to proteins and serve to integrate stress-signaling events that influence aging and a range of diseases including cancer, cystic fibrosis, amyloidoses, and neurodegenerative diseases . We describe here characteristics of celastrol, a quinone methide triterpene and an active component from Chinese herbal medicine identified in a screen of bioactive small molecules that activates the human heat shock response . From a structure/function examination, the celastrol structure is remarkably specific and activates heat shock transcription factor 1 (HSF1) with kinetics similar to those of heat stress, as determined by the induction of HSF1 DNA binding, hyperphosphorylationof HSF1, and expression of chaperone genes . Celastrol can activate heat shock gene transcription synergistically with other stresses and exhibits cytoprotection against subsequent exposures to other forms of lethal cell stress . These results suggest that celastrols exhibit promise as a new class of pharmacologically active regulators of the heat shock response . SemanticMining No.507505

  21. Swiss-Prot Records SemanticMining No.507505

  22. Evaluation • Q2. Sharing of resources and use of research software tools Good • Q6. Short and medium-term visits To be improved • Q7. Co-authoring of research papers, PhD… To be improved SemanticMining No.507505

  23. Data Mining and Information Retrieval 2005 SemanticMining No.507505

  24. Whatizit! Images Full-text Citations Summer School 2005 Joint Publications PhD student exchange Oregon Health Science University (NSF-funded) Image + Text Retrieval ImageCLEF challenge E-Challenge conference SMBM workshop 3 days incl. tutorials 12 papers out of 28 Special Issue in Bioinformatics EAGL (Swiss-funded) Question-Answering 2 IST Call 4 proposals 2005 : Information Retrieval and Data Mining Dissemination and Standards activities Sharing Tools and Resources Fund Raising International Cooperation SemanticMining No.507505

  25. Distribution Asia: 8 Europe: 6 N.A: 2 [] SemanticMining No.507505

  26. B-X=Beginning of X, O=Non-Entity X, I-X=End of X TAR RNA X =RNA, DNA, proteins, cell-type independent O transactivation O by    O Tat    B-protein in    O cells    O derived    O from    O the    O CNS    B-cell_type a    O novel    O mechanism    O of    O HIV-1    B-DNA gene    I-DNA regulation    O [] SemanticMining No.507505

More Related