1 / 26

PolyPhen and SIFT: Tools for predicting functional effects of SNPs

PolyPhen and SIFT: Tools for predicting functional effects of SNPs. Epi 244 Spring 2009 Sam S. Oh. Human genome variation. 3.2 billion base pairs (bp) 99.9% similarity across individuals 3.2 million bp dissimilar ~11 million SNPs Coding vs. non-coding (intron and intergenic regions)

Télécharger la présentation

PolyPhen and SIFT: Tools for predicting functional effects of SNPs

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. PolyPhen and SIFT: Tools for predicting functional effects of SNPs Epi 244 Spring 2009 Sam S. Oh

  2. Human genome variation • 3.2 billion base pairs (bp) • 99.9% similarity across individuals • 3.2 million bp dissimilar • ~11 million SNPs • Coding vs. non-coding (intron and intergenic regions) • Most are synonymous Frazer et al. Nat Rev Genet, 2009;10:241-251

  3. DNA → RNA → Protein

  4. Example: sickle-cell anemia • A to T SNP of beta-globin gene results in glutamate (hydrophilic) to valine (hydrophobic) substitution

  5. Example: MTHFR • Folate metabolism

  6. Finding MTHFR SNPs

  7. Highlight all refSNP numbers (use scroll bar) and copy

  8. Note Build number (currently Build 130) Highlight all refSNP numbers (use scroll bar) and copy

  9. SIFT • Sorting Intolerant From Tolerant • Predicts tolerability of AA substitution effects (i.e., non-synonymous SNPs) based on • Sequence homology • Physical properties of amino acids • Can be applied to naturally occurring nonsynonymous polymorphisms and laboratory-induced missense mutations

  10. Compare Build numbers Copy all SNP IDs and paste into SIFT. Choose “Submit Query”

  11. Getting more info for rs2274974 Enter “rs2274974”

  12. Position of SNP in mRNA, protein, contig Note AA1, AA2, and position Flanking sequence, IUPAC code, flanking seq Allele info Build number mRNA name Protein name Contig name Scroll down Select protein

  13. Copy FASTA-formatted protein sequence

  14. Paste FASTA-formatted protein sequence Enter AA substitution [Letter1-position-Letter2]

  15. Substitution occurs at AA 566 Scroll down

  16. Check tolerance of AA substitutions Scroll down

  17. “Substitution at pos 566 from G to E is predicted to AFFECT PROTEIN FUNCTION with a score of 0.01. Tolerance of specified substitution

  18. Polymorphism Phenotyping • Tool for prediction of possible impact of amino acid substitution (i.e., non-synonymous SNPs) on protein structure and function based on: • Amino acid sequence • What part of the protein did the SNP occur? (E.g., active site, binding site, transmembrane region) • Multiple alignments with homologous proteins and mammalian orthologues • How compatible is the substitution based on proteins of comparable sequence? • 3D structural properties with the substituted amino acid • What is the substitution’s effect on the protein’s physiochemistry? (E.g., hydrophobicity, electrostatic interactions, ligand binding)

  19. PolyPhen data flow

  20. Four potential predictions • Probably damaging • It is with high confidence supposed to affect protein function or structure • Possibly damaging • It is supposed to affect protein function or structure • Benign • Most likely lacking any phenotypic effect • Unknown • Lack of data do not allow PolyPhen to make a prediction

  21. Copy FASTA-formatted protein sequence Enter AA position, ancestral AA, and substituted AA

  22. In dbSNP Build 129, corresponds to protein NP_005948.3 Enter SNP rs#

  23. Query vs. SNP Collection

  24. References • NCBI dbSNP • http://www.ncbi.nlm.nih.gov/sites/entrez • SIFT • http://sift.jcvi.org/ • PolyPhen • http://genetics.bwh.harvard.edu/pph/index.html

More Related