1 / 50

Bayesian network models of Biological signaling pathways

Bayesian network models of Biological signaling pathways. karensachs@stanford.edu. PIP3. PIP2. PKC. PKA. Plc . p38. Raf. Jnk. Erk. Akt. From Phospho-molecular profiling to Signaling pathways. Cell1. Cell2. Cell3. Flow Measurments. Cell4. Cell600. Picture: John Albeck.

yule
Télécharger la présentation

Bayesian network models of Biological signaling pathways

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Bayesian network models of Biological signaling pathways karensachs@stanford.edu

  2. PIP3 PIP2 PKC PKA Plc p38 Raf Jnk Erk Akt From Phospho-molecular profiling to Signaling pathways Cell1 Cell2 Cell3 Flow Measurments Cell4 ... Cell600 Picture: John Albeck Signaling Pathways High throughput data

  3. Outline • What are signaling pathways? • What kind of data is available study them? • How do we use Bayesian networks to learn their structure? • Two extensions: • Markov neighborhood algorithm • Bayesian network based cyclic networks (BBCs)

  4. Outline • What are signaling pathways? • What kind of data is available study them? • How do we use Bayesian networks to learn their structure? • Two extensions: • Markov neighborhood algorithm • Bayesian network based cyclic networks (BBCs)

  5. Secrete cytokines Proliferation Cell death Inside each cell is a molecular network Cells respond to their environment

  6. Modification Translation Transcription Protein mRNA “Central Dogma” DNA Modified Protein Delivers instructions for specific gene Ribosome: Protein-production factory ‘Blueprint’- instructions for production of all proteins

  7. A B C RNA TF Signaling & Genetic pathways A B C Cell response DNA

  8. Outline • What are signaling pathways? • What kind of data is available study them? • How do we use Bayesian networks to learn their structure? • Two extensions: • Markov neighborhood algorithm • Bayesian network based cyclic networks (BBCs)

  9. Spectrum of Modeling Tools in Systems Biology

  10. Graph Node: Measured level/activity of protein Edge: Influence (dependency) between proteins Conditional probability distributions Each node has a conditional probability given its parents 0 1 2 Bayesian Networks Protein A Protein B Protein E P(B|A=‘On’) Protein C Protein D 10

  11. (analytical solution!) How do we use Bayesian Networks to infer pathways? The Technical Details  Score candidate models  Use a heuristic search to find high scoring models

  12. Protein data • Western blot

  13. Protein data • Protein arrays

  14. Protein data • Mass Spectrometry All of these lysate approaches give 1 measurement per protein for 10^3-10^7 cells

  15. Flow Cytometry: Single Cell Analysis Thousands of datapoints

  16. Stimulations and perturbations 1 8 3 5 2 6 7 9 4 VAV SLP-76 LFA-1 CD3 CD28 L A T Cytohesin RAS PI3K JAB-1 Zap70 10 Lck PKC PLCg Akt PIP3 Activators 1. a-CD3 2. a-CD28 3. ICAM-2 4. PMA 5. b2cAMP Inhibitors 6. G06976 7. AKT inh 8. Psitect 9. U0126 10. LY294002 PIP2 PKA Raf MAPKKK MAPKKK Mek1/2 MEK4/7 MEK3/6 Erk1/2 JNK p38

  17. Conditions (multi-well format) 12 Color Flow Cytometry perturbation a Mek1/2 PIP2 PIP3 PKC PKA Plc p38 Raf Jnk Erk Akt perturbation b perturbation n T-Lymphocyte Data • Datasets • of cells • condition ‘a’ • condition ‘b’ • condition…‘n’ • Primary human T-Cells • 9 conditions • (6 Specific interventions) • 9 phosphoproteins, 2 phospolipids • 600 cells per condition • 5400 data-points Omar Perez

  18. A B E C D Statistical Dependencies Phospho A Phospho B

  19. A B E C D Statistical Dependencies Phospho A Phospho B Edges can be directed (primarily) due to the use of interventions

  20. Conditions (multi well format) Multiparameter Flow Cytometry perturbation a Mek1/2 PIP2 PIP3 PKC PKA Plc p38 Raf Jnk Erk Akt perturbation b • Datasets • of cells • condition ‘a’ • condition ‘b’ • condition…‘n’ perturbation n Overview Influence diagram of measured variables Bayesian Network Analysis

  21. Inferred Network Phospho-Proteins Phospho-Lipids PKC Perturbed in data PKA Raf Plc Jnk P38 Mek PIP3 P44/42 Akt PIP2

  22. How well did we do? Phospho-Proteins Phospho-Lipids PKC Perturbed in data PKA Raf Plc Jnk P38 Mek PIP3 P44/42 Akt PIP2 Direct phosphorylation

  23. Features of Approach • Direct phosphorylation: Mek Erk • Difficult to detect using other forms of high-throughput data: • -Protein-protein interaction data • -Microarrays

  24. How well did we do? Phospho-Proteins Phospho-Lipids PKC Perturbed in data PKA Raf Plc Jnk P38 Mek PIP3 P44/42 Akt PIP2

  25. How well did we do? Phospho-Proteins Phospho-Lipids PKC Perturbed in data PKA Raf Plc Jnk P38 Mek PIP3 P44/42 Akt PIP2 Indirect Signaling

  26. PKC Jnk Mapkkk PKC Jnk Mek4/7 Raf Mek Erk Not measured Indirect signaling • Indirect signaling Indirect connections can be found even when the intermediate molecule(s) are not measured • Dismissing edges

  27. PKC Raf Mek Rafs497 PKC Rafs259 Mek Ras Indirect signaling - Complex example • Is this a mistake? • The real picture • Phoso-protein specific • More than one pathway of influence

  28. Expected Pathway How well did we do? Phospho-Proteins Phospho-Lipids PKC Perturbed in data PKA Raf Plc Jnk P38 Mek PIP3 P44/42 • 15/17 Classic Akt PIP2

  29. Expected Pathway Reported Reversed Missed Signaling pathway reconstruction Phospho-Proteins Phospho-Lipids PKC Perturbed in data PKA Raf Plc Jnk P38 Mek PIP3 Erk • 15/17 Classic • 17/17 Reported • 3 Missed Akt PIP2 [Sachs et al 2005]

  30. Caveats • Inhibitor specificity • Binding site similar across proteins • Reagent availability and specificity • Data quality • These are issues in many biological apps! I think I’ll bind here

  31. Outline • What are signaling pathways? • What kind of data is available study them? • How do we use Bayesian networks to learn their structure? • Two extensions: • Markov neighborhood algorithm • Bayesian network based cyclic networks (BBCs)

  32. Markov Neighborhood Algorithm

  33. Building larger networks 12 color capability  Model 50-100 variables 4 color capability  Model 12 variables PKC PKA Raf Plc Jnk P38 Mek PIP3 P44/42 Akt PIP2 ~80 proteins involved in MAPK signaling (11- at the cutting edge- is NOT enough!) 33

  34. Measured subsets = Incomplete dataset (Missing data) Insufficient information for standard approaches (will perform poorly) Use a set of biologically motivated assumptions to constrain search.. And to reduce the number of experiments ( ) 11 4 = 330 34

  35. Constraining the search Using ‘Markov neighborhoods’ (for each variable) • Plus potential perturbation parents Identify candidate parents 35

  36. Molecules 1, 3, 7, 9 Molecules 1, 2, 6, 11 Molecules 2, 4, 7, 10 Mek1/2 PIP2 PKA PKC PIP3 p38 Jnk Plc Erk Raf Akt Approach overview Bayesian Network Analysis (Constrained search)

  37. Neighborhood reduction D A 411 C E F B 4 color capability Conditional independencies in the substructure? ABC 37

  38. Accurate Reproduction of Model ~15 experiments, 4-colors Confidence value different from original model PKC PKA Raf Plc Jnk P38 Mek PIP3 Erk Akt PIP2

  39. Active learning approach Mek1/2 PIP2 PKA PKC PIP3 p38 Jnk Plc Erk Raf Akt 39

  40. Outline • What are signaling pathways? • What kind of data is available study them? • How do we use Bayesian networks to learn their structure? • Two extensions: • Markov neighborhood algorithm • Bayesian network based cyclic networks (BBCs)

  41. Learning cyclic structures with Bayesian networks • Biological networks contain many loops • Bayesian networks are constrained to be acyclic So…

  42. Overcoming acyclicity GRB2/SOS Ras Raf MEK Develop a new, Bayesian network derived algorithm that models cycles…  Erk Signaling pathways contain many cycles Bayesian networks are constrained to be acyclic How can we accurately model pathways with cycles?

  43. Bayesian Network Based Cyclic Networks (BBNs) • I. Break loops with molecule inhibitors • II. Use BN to learn the structure (now not cyclic!) • III. Close loops GRB2/SOS Ras Raf Mek inhibitor Solomon Itani MEK Erk

  44. Bayesian Network Based Cyclic Networks (BBNs) • I. Break loops with molecule inhibitors • Detect loops P(A)A* ~= P(A) • II. Use BN to learn the structure (now not cyclic!) • III. Close loops P(B|Pa(B)) A* ~= P(B|Pa(B)) AB GRB2/SOS Ras Raf MEK Erk

  45. Future work Larger network from overlapping sets (Markov neighborhood) Dynamic models over time Differences in signaling (sub-populations, treatment conditions, cell types, disease states)

  46. Acknowledgements Garry Nolan Dana Pe’er Doug Lauffenburger Omar Perez Dennis Mitchell Funding LLS post doctoral fellowship Shigeru Okumura Mesrob Ohannessian Solomon Itani 46

  47. Extra slides

  48. Mathematical Intuition B C A C is independent of A given B. B A C independent of A given B and D C D • No need to introduce time!!! • When loops are broken, the result is a BN!!!

  49. Prediction: ErkAkt Erk1/2 unperturbed • Erk  Akt not well established in literature Predictions: • Erk1/2 influences Akt • While correlated, Erk1/2 does not influence PKA PKC PKA Raf Mek Erk1/2 Akt

  50. control, stimulated Erk1 siRNA, stimulated P-Akt P-PKA Validation • SiRNA on Erk1/Erk2 • Select transfected cells • Measure Akt and PKA P=9.4e-5 P=0.28

More Related