1 / 24

Discovering virulence genes present in novel strains and metagenomes

Discovering virulence genes present in novel strains and metagenomes. Chris Stubben IC postdoc, B-7. Overview. Review current functional classification systems Discuss Virulence Factor Ontology Identify virulence genes in novel strains and metagenomes. Functional classification systems.

leane
Télécharger la présentation

Discovering virulence genes present in novel strains and metagenomes

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Discovering virulence genes present in novel strains and metagenomes Chris Stubben IC postdoc, B-7

  2. Overview • Review current functional classification systems • Discuss Virulence Factor Ontology • Identify virulence genes in novel strains and metagenomes

  3. Functional classification systems • EC numbers for enyzmes (1956) • Swiss-Prot keywords (1986) • E. coli gene functions, M. Riley (1993) • TIGR role categories (1995) • Gene Ontology (1998) function gene

  4. What functions are related to virulence? • Some systems have a few terms • Swiss-Prot keywords = virulence, toxin, antibiotic resistance • TIGR roles = pathogenesis, toxin production and resistance • Gene Ontology (GO) also has pathogenesis, resistance to antibiotics, plus many more GO terms related to the enzymatic activity of toxins

  5. Gene Ontology (GO) • 25,688 terms in three structured controlled vocabularies (ontologies) • 15098 biological processes • 2186 cellular components • 8404 molecular functions • Standard for eukaryotic gene annotation • Increasingly used for prokaryotes • TIGR (2002) • Plant pathogens by PAMGO at VBI (2005) • Human pathogens at 8 BRCs (2006)

  6. Bioinformatics Resource Centers (BRC) • NIAID funded, $100 million dollar effort to create eight bioinformatic centers for human pathogens • Goal is to provide easy access to genomic data from multiple strains like eukaryotic model organism databases BRCs = ?

  7. Example: Toxin annotation in GO Step 1, Assign GO terms, maybe • activation of Rho GTPase activity • N-terminal peptidyl-glutamine deamination • actin cytoskeleton reorganization • stress fiber formation

  8. Step 2, add references and evidence codes Virulence Protein Experimental Computational Sequence similarity Genomic context • Knockout mutants (IMP) • Overexpression phenotypes (IDA) • Genetic interactions (IGI) • Microarrays (IEP or RCA) • BLAST alignments (ISA) • Orthologous proteins (ISO) • Hidden markov models of protein families or domains (ISM) • Phlyogenetic profiles, conserved neighborhoods, gene fusion, shared regulatory sites, etc (IGC) Function

  9. Example: Toxin searches in GO • If a gene is annotated to ‘adenylate cyclase activity’, how do you know it’s a toxin? • It may also annotated to “cell killing” or related term, but is that enough? • However, an alternative is to define virulence factors and toxins (both outside the scope of GO) in a new ontology

  10. Why we need a Virulence Factor ontology • Lots of effort to characterize pathogenic processes and systems (eg, BRCs) • Many different definitions of pathogen, virulence and virulence factors • Not clear what terms in GO may be related to toxins and virulence (BRCs have already assigned 750,000 GO terms to 300,000 genes)

  11. Virulence Factor Ontology working group • Goal is to combine existing toxin and virulence terms from various groups into a single ontology • TVFac and antibiotic resistance (AR) terms at LANL • Gemina virulence factors and AR terms at U. of Maryland • PAMGO terms in GO • Participants • MITRE. Lynette Hirschmman, Marc Colosimo, and others • LANL. Chris Stubben, Murray Wolinsky and Jian Song • U of Maryland IGS. Lynn Schriml and Michelle Gwinn

  12. Virulence Factor Ontology (VFO) • Three new ontologies, one very simple that points to additional terms in GO or to new ontologies • Virulence factor (definition needed!) • toxin associated processes • antibiotic resistance • adhesion • entry into host • acquisition of nutrients from host • avoidance of host defenses • growth within host • modification of host morhphology • dissemination from host New New simplified GO trees (slims)

  13. Virulence genes in novel strains • Emerging, engineered and novel strains will most likely be sequenced quickly using next generation sequencing technologies, • and then compared to near neighbor strains using sequence similarity (BLAST) or models (HMMs like PFams, TIGRFams, FIGFams, EnteroFams, etc).

  14. Compare novel strains to what? • Very few manual annotations available for prokaryotes, especially in public databases like NCBI and UniProt Table 1. Percentage of genes in UniProt with functional assignments to Gene Ontology terms based on experimental evidence in the primary literature. “Curated information from the literature serves as the gold-standard data set for comparative analyses” -Nature Sep10, 2008 Use BRCs!

  15. BRC annotations • Genomes annotations should have references and evidence codes signifying whether annotations were produced experimentally or computationally 3.8% of Y.pestis CO92 with manual annotations

  16. Y. pestis CO92 annotations at ERIC Table 1 and 2. Sequence features and coding sequence annotations for Y. pestis CO92 at ERIC

  17. Yersinia antibiotic resistance genes Table 1 and 2. Antibiotic resistance genes found using Swiss-prot keyword search ‘antibiotic resistance’ in UniProt and using GO term search ‘response to antibiotic’ in ERIC. Only one gene in common!

  18. Vibrio toxins in GO, UniProt, and NMPDR

  19. Virulence genes in metagenomes • Recent comparison of virulence genes in chicken, cow, mouse and human gut metagenomes (metavirulomes) was based on SEED subsystem categories at NMPDR • Another alternative is to use GO term mappings to protein family and domain databases like PFam

  20. IMG/metagenomes from JGI • Select metagenomes and save

  21. Create abundance profiles • Compare using Pfam, COG, or TIGRfam abundance profiles

  22. Find virulence genes • Use GO term mappings to PFAM database to find virulence genes

  23. Need better mappings to virulence genes • Current GO term mappings miss most virulence-associated genes. Table 1 and 2. PFAMs and TIGRfams overrepresented in air compared to soil

More Related