ArrayExpress and MAGE Jamboree II
180 likes | 324 Vues
ArrayExpress and MAGE Jamboree II. Ugis Sarkans, EBI. Outline. what is ArrayExpress overall architecture status and future MAGE Jamboree II. ArrayExpress. EBI’s public gene expression data repository
ArrayExpress and MAGE Jamboree II
E N D
Presentation Transcript
ArrayExpress and MAGE Jamboree II Ugis Sarkans, EBI
Outline • what is ArrayExpress • overall architecture • status and future • MAGE Jamboree II
ArrayExpress • EBI’s public gene expression data repository • first version (object model) - 1999, in collaboration with German Cancer Research Centre (DKFZ) • second version (object model) - end of 2000, prototype development funded by Incyte
ArrayExpress (2) • implementation - first half of 2001 - Oracle schema, data loader (from MAML), prototype Web interface, a few datasets loaded • decision to use MAGE-OM as basis for further development • EU funding - 2002-2004, 8 new positions • www.ebi.ac.uk/arrayexpress
ArrayExpress - features • MIAME-compliant • able to import MAGE-ML formatted data • can deal with: • raw data • processed data • data transformations • independence of: • experimental platforms • image analysis methods • data normalization methods • object model-based query mechanism
ArrayExpress component architecture User Web server API application server (Java servlets) MIAMEexpress submission/ curation tool database ArrayExpress data warehouse (gene-centred queries) central database (experiment-centred queries) curation MAGE-ML image server
ArrayExpress architecture MAGE-OM ArrayExpress (Oracle) MAGE-ML (DTD) data loader Tomcat object/ relational mapping Castor MAGE-ML (doc) MAGE-ML (doc) MAGE-ML (doc) Java servlets Velocity template engine MIAMEexpress Web page template submission/ curation tool database Web page template Browser
ArrayExpress: current status • Object model (MAGE-OM) - stable • Database schema - generated (standard SQL, we run under Oracle) • Data loader from MAGE-ML - generated • Web interface - under development: • queries: • by experiment • by array • by sample • browsing
Near future developments • Dedicated hardware for ArrayExpress • Good quality data coming from collaborators: • annotation tools essential (MIAMEexpress) • Data uploading and Web interface made public • interface with analysis tools (Expression Profiler)
Future developments • Integration with other analysis tools • New visualization methods and tools • New analytical tools • Links with other databases • Data curation, liaison with data providers • development of standard ontologies • Data warehouse (gene-oriented queries)
MAGE Jamboree II • open-source implementation efforts: • MAGE Jamboree I, Toronto, September 13-19, sponsored by Iobion • Jamboree II at EBI, December 6-11 • objective: bring MAGE to life
Programming APIs • Mapping of MAGE-OM to language-specific OMs • API’s are automatically generated from the OM specifications • Get/set methods for associations • Get/set methods for attributes • XML <=> language-specific OM marshallers/unmarshallers - also automatically generated
Programming APIs (cont.) • Use standard modules/packages • Xerces, JDBC, etc. • Implementation in Java, C++, Perl • Building annotation tools/database access modules on top of these APIs
MAGEstk components MAGE-RS (database) MAGE-ML MAGE API (Perl, Java, C++) MAGE browsing/ annotation tools
Acknowledgements • EBI microarray team/database department: • Alvis Brazma (team leader) • Helen Parkinson (curation, MIAMExpress) • Mohammad Shojatalab (MIAMExpress) • Jaak Vilo (Expression Profiler) • Ahmet Oezcimen (Oracle DBA) • Susanna Sansone (curation, MIAMExpress) • MAGE developers: • MGED • Rosetta Biosoftware