1 / 13

MGI and Phenotyping Projects

MGI and Phenotyping Projects. Mouse Genome Informatics. Data Integration. Centers: mutagenesis, gene trap, etc. Primary literature. Data Loads: GenBank, SNPs, clone collections, SwissProt, RIKEN, etc. Electronic Submissions (individual labs). Processing, QC, and curation.

minty
Télécharger la présentation

MGI and Phenotyping Projects

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. MGI and Phenotyping Projects Mouse Genome Informatics

  2. Data Integration Centers: mutagenesis, gene trap, etc Primary literature Data Loads: GenBank, SNPs, clone collections, SwissProt, RIKEN, etc Electronic Submissions (individual labs) Processing, QC, and curation

  3. Lexicon data for Adfp gene trap knockout

  4. Phenotype Expression Mice ES cells

  5. One Goal for MGI in our next grant period is to be able to provide large scale raw phenotyping data to complement and support the analyzed and metadata we already provide from many data resources.

  6. Challenges and Needs for Baseline Strain Phenotypes

  7. Global challenges for baseline phenotype data • making raw/individual animal data available • how much integration of raw data is useful • defining assay SOPs • providing summaries and metadata • center data vs.investigator data • nomenclature

  8. What mouse is being analyzed? • Differentiating background • inbreds: standard, RI, consomic, congenic, etc. • F1 and F2 hybrids and other mixtures • Use of correct and explicit nomenclature • Strain name and substrains • C57BL/6 is not enough: • C57BL/6N, C57BL/6J, C57BL/6JCrl, etc. • Allelic composition • Acvrl1-/- is not enough • Avcrl1tm1Dyl, Avcrl1tm1Enl, Avcrl1tm1Spo • Use of MGI_ids in data and publications

  9. SOPs • Encouraging community use and contributions of SOPs with data • Defining minimal elements • Standard format • Beyond center data: general goal for all publication

  10. Data submission • The ideal • All who submit use same format, the correct nomenclature and include relevant MGI-Ids, GenBank-Ids, etc. • The reality • Even groups who agree on a standard format frequently take liberties • Community database providers need to continue to dedicate effort to disambiguating data, QC for content, and enforce nomenclature.

  11. The End http://www.informatics.jax.org

More Related