1 / 68

Genopolis Microarray DB a Progress Report

Genopolis Microarray DB a Progress Report. Marco Brandizi <marco.brandizi@unimib.it>. Dec 12, 2005. Dottorato in Informatica XIX Ciclo. Outline. Introduction GCA Application Main features Demo Demo/Gene Browser Recent added features Access control Search & Save Ongoing and future

zinna
Télécharger la présentation

Genopolis Microarray DB a Progress Report

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Genopolis Microarray DB a Progress Report Marco Brandizi<marco.brandizi@unimib.it> Dec 12, 2005 Dottorato in InformaticaXIX Ciclo

  2. Outline • Introduction • GCA Application • Main features • Demo • Demo/Gene Browser • Recent added features • Access control • Search & Save • Ongoing and future • MAGE Export • Migration on cluster • Management of knowledge about Higher Level Analysis • Other possible developments

  3. Genes Machine gene DNA mRNA protein Cell/Life

  4. Microarray Data, conceptual model

  5. Microarray Data Management Issues • Exp. data vs. seq. data: • Context dependent (living system, exp. Conditions) • Lack of standard unit of measure • Several normalizations methods • Multiple platforms and methods • No standard for data annotation • Vocabularies and terminology coherence • Details about: experiment, source, protocols, exp. conditions

  6. Microarrays Data Management Issues / 2 • Evidences about data quality • What to store? • Raw Images • Computed values • Normalized values • How to find data • Complex vocabularies aware systems (ontologies) • Data mining and exp. comparison tools • Data access control

  7. MIAME Experiment Modeling

  8. Outline • Introduction • GCA Application • Main features • Demo • Demo/Gene Browser • Recent added features • Access control • Search & Save • Ongoing and future • MAGE Export • Migration on cluster • Management of knowledge about Higher Level Analysis • Other possible developments

  9. GCA Features • Curated experimental design representation • MIAME-compliant, (although with simplified model) • Use of controlled vocabularies • Experiment checking/publishing, with supervision • Targeted to Affymetrix platform • Chip description is simple, imported from NETAffx • Single channel technology • Access control • Users are grouped into groups and access roles • Experiments belong to user groups

  10. GCA Features • Data Retrieval and visualization • Gene browser, a graphical visualization interface, based on the matrix model • Search & Save data • Current content: • A set of time-courses about DCs stimulated with different stimuli • Implementation & Deployment • LAMP application (Linux + Apache + MySQL + PHP) • Model Viewer Controller as much as possible: • Business objects layer • Presentation widgets (DAO-lib) • Other application control layers

  11. GCA Features • Shortly: • A Gene Expression database software, focused on Affymetrix technology, useful as a facility for a distributed community of users

  12. GCA Data Model

  13. GCA Data Model

  14. Outline • Introduction • GCA Application • Main features • Demo • Demo/Gene Browser • Recent added features • Access control • Search & Save • Ongoing and future • MAGE Export • Migration on cluster • Management of knowledge about Higher Level Analysis • Other possible developments

  15. GCA Login

  16. GCA Editing

  17. GCA Experiment Checking

  18. GCA Import of chip annotations

  19. GCA CVs and protocols

  20. GCA CVs and protocols

  21. Outline • Introduction • GCA Application • Main features • Demo • Demo/Gene Browser • Recent added features • Access control • Search & Save • Ongoing and future • MAGE Export • Migration on cluster • Management of knowledge about Higher Level Analysis • Other possible developments

  22. GCA Gene Browser

  23. GCA Gene Browser

  24. GCA Gene Browser

  25. GCA Gene Browser

  26. GCA Gene Browser

  27. Outline • Introduction • GCA Application • Main features • Demo • Demo/Gene Browser • Recent added features • Access control • Search & Save • Ongoing and future • MAGE Export • Migration on cluster • Management of knowledge about Higher Level Analysis • Other possible developments

  28. GCA Access Management

  29. All rights All but admin Besta Bicocca ADMIN R, W, -publish Read only Andrea Brandizi Granucci Tiranti Norman Ottavio Experiment 123 User Permissions Brandizi, Andrea All Granucci Read Norman Read, Write Tiranti All (except admin) Ottavio None

  30. Access management Access management Access management • Based on a core library • Recent developments (security lib) • Code has been changed so that it uses security lib • All the code that interacts with user has been wrapped with access management controls • Even malicious access attempts has been considered: • Handy writing of an URL • Handy request of an uploaded file (to be completed) • Does it work? • Yes, pretty sure • But more testing is needed

  31. Outline • Introduction • GCA Application • Main features • Demo • Demo/Gene Browser • Recent added features • Access control • Search & Save • Ongoing and future • MAGE Export • Migration on cluster • Management of knowledge about Higher Level Analysis • Other possible developments

  32. Search and Save

  33. Search and Save

  34. Search and Save

  35. Search and Save

  36. Search and Save

  37. Search and Save

  38. Search and Save

  39. Search and Save

  40. Outline • Introduction • GCA Application • Main features • Demo • Demo/Gene Browser • Recent added features • Access control • Search & Save • Ongoing and future • MAGE Export • Migration on cluster • Management of knowledge about Higher Level Analysis • Other possible developments

  41. MAGE Export • Will allow to export a GCA experiment to MAGE/Array Express • A collaboration with EBI • in the context of u-GENE • So far: • Schema of GCA->MAGE(in AE compatible form) • Basic code fragments(Business objects in Java) • Still to do • Full code • Mappings with MGED-Ontology • Tests with AE

  42. MAGE Export

  43. Outline • Introduction • GCA Application • Main features • Demo • Demo/Gene Browser • Recent added features • Access control • Search & Save • Ongoing and future • MAGE Export • Migration on cluster • Management of knowledge about Higher Level Analysis • Other possible developments

  44. GCA on cluster architecture • Three machines, the minimum to have a cluster • Master (Xeon 3.2 Ghz, 2Gb RAM) • + Master Clone that ensures high availability • computation node computers(P4 3 Ghz, 512Mb) • 1Tb of SCSI disk, shared via NFS • Based on: • Debian (Linux) • Linux Virtual Server (Load Balancer) • Hearthbeat (High availability)

  45. GCA on cluster architecture

  46. GCA on cluster architecture • Code needs slight changes: • PHP side and sessions: • Objects that are saved on session need to be reloaded properly • See: http://it2.php.net/manual/en/language.oop.magic-functions.php#14473 • __wakeup() is already used • __sleep() with proper return value is to be implemented • MySQL side: • The stable DB: • We need to specify the type of DB access: Read Only Mode vs. Read/Write mode • RO access uses local copy of DB • RW access uses master copy • The temporary DB: • Only master copy exists (3307 port, current deployment)

More Related