1 / 155

National Dong Hwa University, Taiwan

National Dong Hwa University, Taiwan. Graph-theoretic Approach for Protein Similarity Sheng-Lung Peng. Graph and Bioinformatics Algorithms Lab. Outline. Introduction Biological concept Protein structural alignment Methods Graph-theoretic approach Protein graph transformation Results

zaria
Télécharger la présentation

National Dong Hwa University, Taiwan

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. National Dong Hwa University, Taiwan Graph-theoretic Approach for Protein Similarity Sheng-Lung Peng Graph and Bioinformatics Algorithms Lab.

  2. Outline • Introduction • Biological concept • Protein structural alignment • Methods • Graph-theoretic approach • Protein graph transformation • Results • Conclusion

  3. Central Dogma

  4. Protein Structures

  5. Protein folding

  6. Why Protein Structures? • Proteins are fundamental components of all living cells, performing a variety of biological tasks. • Proteins reflect millions of years of evolution. • Each protein has a particular 3D structure that determines its function. • Protein structure is more conserved than protein sequence, and more closely related to function.

  7. Structure – Sequence Relationships • Two conserved sequences similar structures. • Two similar structures conserved sequences. ? There are cases of proteins with the same structure but no clear sequence similarity.

  8. Protein Database • PDB: Protein Data Bank • Holds 3D models of biological macromolecules (protein, RNA, DNA). • All data are available to the public.

  9. Protein Database • PDB: Protein Data Bank • Obtained by X-Ray crystallography (84%) or NMR spectroscopy (16%). • Submitted by biologists and biochemists from around the world.

  10. Protein Database • PDB - model • A model defines the 3D positions of atoms in one or more molecules. • There are models of proteins, protein complexes, proteins and DNA, protein segments, etc. • The models also include the positions of ligand molecules, solvent molecules, metal ions, etc.

  11. Protein Database http://www.pdb.org/pdb/home/home.do

  12. Protein Database Residue identity The PDB file – textformat Atom identity The coordinates for each residue in the structure chain Atom number Residue number X Z Y ATOM: Usually protein or DNA HETATM: Usually Ligand, ion, water

  13. Outline • Introduction • Biological concept • Protein structural alignment • Methods • Graph-theoretic approach • Protein graph transformation • Results • Conclusion

  14. Structural Alignment • Why structural alignment? • Structural similarity can point to remote evolutionary relationship. • Shared structural motifs among proteins suggest similar biological function. • Getting insight into sequence-structure mapping (e.g., which parts of the protein structure are conserved among related organisms).

  15. Structural Alignment • Intermolecular • Inter – methods which superpose protein structures and measure intermolecular distances. • Compare geometric “properties”, for example residue positions in 3D coordinate space.

  16. Structural Alignment • Intramolecular • Intra – methods which compare intramolecular distances or vectors. • Align protein structures on the basis of information about internal ‘relationships’ within each protein.

  17. Structural Alignment INTER INTRA

  18. Structural Alignment • Global versus local alignment Global alignment

  19. Structural Alignment • Global versus local alignment motif Local alignment

  20. Structural Alignment • Find an optimal alignment

  21. Structural Alignment What is the best transformation that superimposes the unicorn on the lion?

  22. Structural Alignment • Solution Regard the shapes as sets of points and try to “match” these sets using a transformation.

  23. Structural Alignment This is not a good result

  24. Structural Alignment Good result

  25. Structural Alignment • Kinds of transformations: • Translation • Rotation • Scaling • and more….

  26. Structural Alignment Translation Y X

  27. Structural Alignment Rotation Y X

  28. Structural Alignment Scale Y X

  29. x1, y1, z1 x2, y2, z2 x3, y3, z3 x1 + d, y1, z1 x2 + d, y2, z2 x3 + d, y3, z3 Structural Alignment Translation Scale Rotation

  30. Structural Alignment • A protein can be represented as a geometric object in the plane. • The object consists of points represented by coordinates (x, y, z). Lys Thr Met Gly Glu Ala

  31. Structural Alignment The problem: Given two proteins, find a transformation that produces a best superimposition of one protein onto the other.

  32. Structural Alignment • Correspondence is Unknown • Given two configurations of points in the three dimensional space. +

  33. Structural Alignment • Correspondence is Unknown • Find those rotations and translations of one of the point sets which produce “large” superimpositions of corresponding 3D points.

  34. Structural Alignment The best transformation

  35. Structural Alignment • Simple case • Two closely related proteins with the same number of amino acids.

  36. Structural Alignment

  37. Structural Alignment • Methods to superimpose structures • Translate both proteins to a common position in the coordinate frame of reference. • Rotate one protein relative to the other protein, around the three major axes. • Measure the distances between equivalent positions in the two proteins.

  38. Structural Alignment • Methods to superimpose structures • Steps 2 and 3 are repeated until there is a convergence on a minimum separation between the superposed structures.

  39. Structural Alignment • Scoring system to find an optimal alignment • Answer: Root Mean Square Deviation. • n = number of atoms. • di= distance between 2 corresponding atoms iin 2 structures.

  40. Structural Alignment 3 4 1 5 2 1 2 3 4 5

  41. Structural Alignment • Scoring system to find optimal alignment • Find a 3D transformation T* such that:

  42. Structural Alignment • Optimal Alignment • Find the highest number of atoms aligned with the lowest RMSD. • Find a balance between local regions with very good alignments and overall alignment.

  43. Structural Alignment • Unit of RMSD, e.g., Ångstroms • Identical structures → RMSD = 0. • Similar structures → RMSD is small (1 ~ 3 Å). • Distant structures → RMSD > 3 Å.

  44. Structural Alignment • Pitfalls of RMSD • All atoms are treated equally. • (e.g., residues on the surface have a higher degree of freedom than those in the core) • Best alignment does not always mean minimal RMSD. • Does not take into account the attributes of the amino acids.

  45. Outline • Introduction • Biological concept • Protein structural alignment • Methods • Graph-theoretic approach • Protein graph transformation • Results • Conclusion

  46. Graph-based approach (Example)

  47. Graph Matching Graph Theory Computer Science Computational Biology

  48. Notions of Graph Matching

  49. Notions of Graph Matching Isomorphism Identifying a bijection relation between two graphs

  50. Notions of Graph Matching Isomorphism Identifying a bijection relation between two graphs

More Related