1 / 31

Evolution of Protein Structure Mario A. Fares Evolutionary Genetics and Bioinformatics Laboratory Department of Genetics

Evolution of Protein Structure Mario A. Fares Evolutionary Genetics and Bioinformatics Laboratory Department of Genetics, Smurfit Institute of Genetics, TCD Phone: 8913521 Email: faresm@tcd.ie http://bioinf.gen.tcd.ie/~faresm. Contents. 1. In vivo, in vitro, in silico Proteins

hiero
Télécharger la présentation

Evolution of Protein Structure Mario A. Fares Evolutionary Genetics and Bioinformatics Laboratory Department of Genetics

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Evolution of Protein Structure Mario A. Fares Evolutionary Genetics and Bioinformatics Laboratory Department of Genetics, Smurfit Institute of Genetics, TCD Phone: 8913521 Email: faresm@tcd.ie http://bioinf.gen.tcd.ie/~faresm

  2. Contents • 1. In vivo, in vitro, in silico • Proteins • Protein structure and conformation: Basic concepts • Proteins in diseases • 2. Patterns and forms in protein structure • Helices and sheets • The hierarchical nature of protein architecture • Structure based classification of proteins • protein folding: Intra-cellular pathogens and the survival of the flattest • Protein folding and disease: Amyloidoes, Parkinson, Huntington, Prion disease

  3. 3. Conformational changes in protein Structural changes arising from changes in state of ligation Hinge motions in proteins Mechanisms of conformation changes (Haemoglobin, Serpins, muscle contraction) Higher level structural changes (GroELS) 4: Protein Structure Prediction and Determination Methods of protein structure determination Critical assessment of structure prediction Homology modelling Threading Prediction of novel folds Protein design

  4. 5. Evolution of Protein Structure and Function • Protein structure classification • Structural relationships among homologous proteins • Changes in proteins during evolution uncovers functionally/structurally important amino acid sites • Domain swapping • Classification of protein folding patterns • How do proteins evolve new functions? • Classification of protein functions • 6. Molecular evolution • Evolution of Globins • Evolution of Serine proteinases • Evolution of visual pigments and related molecules

  5. a)7. Molecular Coevolution and mutation epistatic effects on protein structure Defining molecular coevolution Non-parametric methods to detect coevolution Parametric methods to detect coevolution Intra-molecular coevolution and prediction of amino acid sites three-dimensional proximity Inter-protein coevolution and the identification of protein-protein interfaces 8. Some examples of the immune system Antibody structure Protein of the Major histocompatibility Complex T-cell receptors Cancer and protein structures

  6. SCIENTIFIC RESEARCH & DISCOVERY Anatomy Migratory Sensors Organisms Physiology Ventricular Modeling Organs Cell Biology Electron Microscopy Cells Macromolecules Biopolymers X-ray Crystallography Proteomics Genomics Medicinal Chemistry Protein Docking Atoms & Molecules EXAMPLE UNITS REPRESENTATIVE DISCIPLINE REPRESENTATIVE TECHNOLOGY MRI Heart Translational Medicine Neuron Structure Sequence Protease Inhibitor

  7. SCIENTIFIC RESEARCH & DISCOVERY Anatomy Migratory Sensors Organisms Physiology Ventricular Modeling Organs Cell Biology Electron Microscopy Cells Macromolecules Biopolymers X-ray Crystallography Proteomics Genomics Medicinal Chemistry Protein Docking Atoms & Molecules EXAMPLE UNITS REPRESENTATIVE DISCIPLINE REPRESENTATIVE TECHNOLOGY MRI Heart Neuron We will focus here Structure Sequence Protease Inhibitor

  8. In vivo, in vitro, in silico Proteins Proteins roles Structural proteins Catalytic proteins Regulatory proteins Viral capside proteins Cytoskeleton Epidermal keratin Haemoglobin Myoglobin Ferritin Hormones Transcription factors

  9. Estimated Functional Roles (by % of Proteins) of the Proteome in a Complex Organism

  10. (a) myoglobin (b) hemoglobin (c) lysozyme (d) transfer RNA (e) antibodies (f) viruses (g) actin (h) the nucleosome (i) myosin (j) ribosome Courtesy of David Goodsell, TSRI

  11. Step 3. What Can Be Got from Structure When You Have it? From Structural Bioinformatics Ed Bourne and Weissig p394 Wiley 2002

  12. Step 4. Proteins Do Not Function in Isolation But are Part of Complex Interaction Networks http://www.genome.jp/kegg/

  13. Proteins have the ability to organise themselves in three dimensions • Proteins have the ability to evolve due to the inheritable property of protein structure variation • Proteins are the direct responsibles for the cell viability in normal physiological conditions as well as in stressful conditions • Projects in development: • Strutural Genomics • Proteome project

  14. Fascinating projects and questions under study: • Interpretation of the mechanisms of function of individual proteins • Approaches to the protein folding problem • Dependence on environmental parameters • Protein structure prediction • 3. Patterns of molecular evolution • Structural and functional selective constraints • relaxed amino acid sequence evolution • Structure evolution • 4. Prediction of the structure of closely related proteins • Homology methods • Functional convergence leads to structure convergence

  15. 5. Protein engineering Modifications to probe mechanisms of protein function Molecular manipulation to enhance thermostability Clinical aplications (therapeutic antibodies) 6. Drug design Peptide inhibitors of HSPs proteins inducers of the immune response

  16. GENOME SEQUENCES • The genomes of 100 prokaryotes, many viruses, organelles and plasmids, and over 12 eukaryotes, representing all major categories of living things Completed eukaryotic nuclear genomes Type of organism Species Genome size (106 base pairs) Primitive microsporidian E. cuniculi 2.5 Fungi S. cerevisiae 12.1 Sc. pombe 13.8 N. crassa 40 Nematode worm C. elegans 100 Insect: Fruit fly D. melanogaster 180 mosquito A. gambiae 278 Malarial parasite P. falciparum 22.8 Plants: Thale cress A. thaliana 116.8 rice O. sativa 400 Human H. sapiens 3400 Mouse M. musculus 3454 Rat R. norvegicus 2556 Chicken G. gallus 1200

  17. ENCODE (Encyclopedia of DNA Elements) • Determining the function of all significant regions of a selected region of the human genome • For a selected 1% of the genome, the corresponding regions in 30 vertebrates genomes will be sequenced • A variety of experimental and computational techniques will be applied, including comparative genomics • The results will feed up and engine new initiatives aimed at developing models and computational tools to deal with the data in the human genome

  18. Amino acid sequences determine protein structure: Protein structure and conformation • Proteins are polymers of amino acids containing a constant main chain (backbone) of repeating units, with a variable side chain attached to each Residue i -1 Residue i Residue i +1 Si-1 Si Si+1 Side chains variable …N-C-C- N-C-C- N-C-C-… Main chain constant O O O • The amino acid sequence of a protein, together with any post-translational modifications, specify the primary structure of the protein, the fixed chemical bonds. • Because the chain is flexible, the primary structure is compatible with a very large number of spatial conformations of the main chain and side chains

  19. Sequence --> Structure • Given the right conditions, the same protein sequence will always* fold up into the same structure, the native structure • The conformation of the native state is thus determined by the amino acid sequence • This routinely happens in the cell. However, for many proteins, it can also be made to happen in a solution that only somewhat resembles cellular conditions • For a protein to take up a unique conformation means that evolution has produced a set of interresidue interactions that stabilize the desired state, and that no alternative conformation has comparable stability How is that achieved?

  20. The origins of bioinformatics • Figuring out that “how” was one of the origins of bioinformatics • Structural biologists wanted larger and larger collections of structure so they could extract rules about sequence/structure relationships and apply them to predict structure • Extracting rules from protein sequence/structure data precedes the exponential growth of the PDB and GenBank by more than a decade

  21. What’s interesting about proteins • Sequence • “Fold” • 3D shape • Surface crevices, interior holes, channels • Surface properties • Conformational changes • Effects of variability/mutation

  22. What can we do with structures? • Establish relationships between sequence patterns and structural features -- how have proteins evolved? • Develop hypotheses about the function of a particular protein • Predict how a sequence will fold • Build a model by comparison with a known structure of similar sequence

  23. What can we do with structures? • Study the effects of mutation on structure and function • Predict the effects of a novel mutation on structure or function (protein engineering -- beginning) • Design and build whole new proteins with novel functionality (protein engineering -- advanced) • Design drugs to interact with particular protein active sites

  24. Protein structure basics • Building blocks of protein structure • Amino acids • Organic cofactors • Metal ions • Levels of structural description • Descriptors of protein geometry • Form and content of structure data • Visualization styles

  25. Protein Secondary Structure -helix -sheet • These secondary structures are highly present in proteins due to: • They keep the main strain in an unstrained conformation • Satisfy the hydrogen-bonding potential of the main-chain N-H and C=O groups These secondary structures link in a specific way in different combinations to perform the final protein structure -helices are formed from a single consecutive set of residues in the amino acid sequence The H-bond links the C=O group of residue i with the H-N group of residue i + 4

  26. There are alternatives to the helix configuration giving more constrained or less constrained structures: • 310 helices, in which hydrogen bonds form between residues i and i + 3 • -helices, in which hydrogen bonds form between residues i and i + 5 • This configurations are much rarer due to the constraints and effects they have on the protein stability. -sheets are formed by lateral interactions of several independent sets of residues. They can bring together sections of the chain widely separated in the amino acid sequence In this figures, all the strands are anti-parallel

  27. Protein in disease Protein folding Goal = lowest energy conformation

  28. Problems of protein folding Energy trap • Energy landscape is rough • Red = slow track • Peptide trapped in energy minimum (thermodynamically favorable) • Yellow = fast track • Cell crowded •  poor folding fidelity Local energy minimum = intermediate

  29. Protein misfolding diseases = amyloidoses

  30. Prions • = “Protein-based infectious particles” • Involved in multiple diseases • Protein undergoes conformational switch • PrPc = cellular isoform • Membrane-bound GPI anchored glycoprotein • PrPSc = “scrapie” isoform • Forms amyloid

  31. Cellular isoform “Scrapie” isoform

More Related