1 / 16

What is Annotation?

What is Annotation?. Microbial Genome Annotation Network Training Workshop (supported by NSF RCN-UBE grant). Some Basic Definitions. We all feel comfortable with genes and genomes, but what is annotation?

Télécharger la présentation

What is Annotation?

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. What is Annotation? Microbial Genome Annotation Network Training Workshop (supported by NSF RCN-UBE grant)

  2. Some Basic Definitions • We all feel comfortable with genes and genomes, but what is annotation? • Genome annotation is the process of interpreting sequence data using biological information • IMG-ACT focuses on two specific types of annotation

  3. Types of Annotation Gene annotation Gene product Protein structure Protein properties Functional role Phylogenetic origin

  4. Types of Annotation Pathway/Structure Pathway components present Alternate pathway steps Corrected gene annotation for steps Metabolic/physiological capabilities of the microbe

  5. Why Annotate with Students? • Most automated genome annotations - 35% are wrong • Automated annotations miss things…. (especially in GEBA microbes) • The logic of bioinformatic algorithms illustrates key principles of biology

  6. Why Annotate More Genomes? Most of what we know comes from a relatively small subset of life’s diversity. Does this subset adequately reflect genomic diversity? Bacteria Archaea Figure courtesy of Phil Hugenholtz

  7. GEBA Genomes Genomic Encyclopedia of Bacteria & Archaea (GEBA) is a massive JGI genome sequencing effort to fill in many of the missing or under-sampled branches of the Bacteria & Archaea trees. *T.P. Curtis, W.T. Sloan, and J.W. Scannell. 2002. Estimating prokaryotic diversity and its limits. Proc Natl Acad Sci USA 99: 10494-10499.

  8. GEBA Genomes First 56 GEBA genomes* filled in several missing or under-sampled branches of the Bacteria trees & showed that there is a lot of genomic diversity out there to be discovered. * D. Wu, P. Hugenholtz, K. Mavromatis, et al., 2009. A phylogeny-driven genomic encyclopedia of Bacteria and Archaea. Nature 462: 1056-1060.

  9. Genomes to Metagenomes Community Isolate sequencing Genomics Metagenomics

  10. Develop and strengthen basic scientific research skills such as: Reading and evaluating primary literature Developing hypotheses and interpreting data Drawing conclusions from a collection of evidence Working collaboratively Maintaining a laboratory notebook Working with real data Presenting results - orally and/or in writing Annotation & Course Goals

  11. Develop and strengthen genome annotation skills such as: Using computer programs to analyze sequence data Gathering and evaluating information from Web-based community-accessible sequence databases Constructing phylogenetic trees Evaluating automated gene calls Reconstructing a metabolic pathway Annotation & Course Goals

  12. Annotation & Course Goals • Produce quality annotations for incorporation into the Integrated Microbial Genomes Database • Build and strengthen a conceptual understanding of: • Evolutionary relationships among genomes • Gene-protein-pathway interactions • Genome organization • Metabolic diversity • Operon structure • Power and limitations of bioinformatics • Protein structure and function • Transcriptional and translational signals

  13. Streamline annotation Emphasizes biological root of bioinformatics More easily compatible with education Emphasizes complementarity of tools Allows addition and removal of modules to match student level IMG-ACT Modular Annotation

  14. Basic Information Sequence-based Similarity Data Cellular Localization Alternative Open Reading Frame Structure-based Evidence Enzymatic Function Gene Duplication and Degradation Horizontal Gene Transfer RNA IMG-ACT Modules

  15. Module Concepts Basic Information

  16. For More Information IMG-ACT (JGI): Cheryl Kerfeld (ckerfeld@lbl.gov) Seth Axen (saxen@lbl.gov) www.jgi.doe.gov/education/annotation_tools.html Microbial Genome Annotation Network: (NSF RCN-UBE) Lori Scott, PI (LoriScott@augustana.edu) mgan.jgi-psf.org

More Related