1 / 31

Protein Threading

Protein Threading. Zhanggroup 2003 10 22. Overview. Background protein structure protein folding and designability Protein threading Current limitations to protein threading. Computational complexity of certain formulations of the protein threading problem

werner
Télécharger la présentation

Protein Threading

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Protein Threading Zhanggroup 2003 10 22

  2. Overview • Background protein structure protein folding and designability • Protein threading • Current limitations to protein threading

  3. Computational complexity of certain formulations of the protein threading problem • Performance of protein threading systems • References

  4. Protein Structure • Primary, secondary, tertiary structure

  5. Can only refer to the structure of a protein if a particular environment is assumed • solvent environment (aqueous trans-membrane ……) • temperature • pH etc • Different environments yield different structures or no stable structure at all

  6. Proteins molecules are not completely rigid structures • kinetic energy energetic collisions with solvent molecules • vibrations sidechain conformational changes • flexible sections of the peptide chain • The native tertiary structure of a protein is thus an average

  7. Protein Folding • Protein folding = searching for a conformation having minimum energy

  8. Factors in protein folding • hydrophobic effects • electrostatic charges in residues • hydrogen bonding • Chaperonins,ribosomes

  9. 3 stages of folding • denatured unfolded state • molten globule state • native compact state • most proteins will return to their native state after forced denaturation

  10. The Protein Folding Problem • Given a proteins amino acid sequence what is its tertiary structure • The protein folding problem is hard

  11. Direct approach :molecular dynamics simulation • Simulate on an atomic level the folding of a single protein molecule • protein = thousands of atoms • solvent environment = hundreds to thousands of molecules => thousands of atoms

  12. Sub-picosecond time scales • run the simulation for 1-5 seconds • We need another years of Moores law to make this computation feasible

  13. Designability • A protein with a stable native state can not have another low-energy state nearby in conformational space • A structure is highly designable if its minimum energy state has no low-energy neighbours

  14. Protein Threading • inverse protein folding problem: given • a tertiary structure, find an amino acid sequence that folds to that structure • Protein threading: given a library of possible protein folds and an amino acid sequence find the fold with the • best sequence -> structure alignment (threading)

  15. Evolution depends on designability to preserve function under mutation • Estimate only different protein structures exist in nature (Chothia,1992)

  16. four components • a library of protein folds (templates) • a scoring function to measure the fitness of a sequence -> structure alignment • a search technique for finding the best alignment between a fixed sequence and structure

  17. a means of choosing the best fold from among the best scoring alignments of a sequence to all possible folds

  18. Scoring Schemes for Sequence->Structure Alignments • The scoring scheme for a particular threading of a sequence onto a structure measures the degree to which

  19. environmental preferences are satisfied • Different amino acid types prefer different environments e.g. • structural preferences: • in helix • in sheet • not exposed to solvent • pairwise interactions with neighbouring amino acids

  20. Formal Statement of the ProteinThreading Problem • C is a protein core having m segments Ci representing a set of contiguous amino acids Let ci be the length of Ci • Sequence a = a1a2…an of amino acids

  21. Current limitations to protein threading • Statistical problems • Definition of neighbor and /or pairwise contact environments: • energetic neighbor ? contact neighbor

  22. Computational Complexity of Finding an Optimal Alignment • The complexity of the protein threading problem depends on whether: • Variable-length gaps are allowed in alignments • the scoring function for an alignment incorporates pairwise interactions between amino acids

  23. Property(I) makes the search space exponential in size to the length of the sequence • Property(Ii) forces a solution to take non-local effects into account

  24. Any protein threading scheme with both properties is NP-complete (3-SAT Lathrop 1994) (MAX-CUT Akutsu,Miyano 1999)

  25. Thus all protein threading approaches can be divided into four groups: 1 no variable length gaps allowed 2 no pairwise interactions considered in scoring function 3 no optimal solution guarantee 4 exponential runtime

  26. Performance of Protein Threading Systems CASP1(1994) CASP2(1996) CASP3(1998): Critical Assessment of Structure Prediction meetings protein threading methods have consistently been the winners success depends on structural similarity of target to known structures successful even when target sequence and library sequence have low homology

  27. Much room for improvement in all areas of protein threading e.g.: algorithms for searching the threading space reliable biologically accurate scoring functions

More Related