1 / 32

break

break. Branch lengths. Branch lengths (3 characters):. A. A. C. C. C. A. A. C. C. A. 0. 0. 1. 2. 0. C. A. Sum of branch lengths = total number of changes. C. C. C. C. A. A. C. C. A. A. A. A. C. C. A. A. 0.5. 0.5. 0. 0.5. 0.5. Ex: Find branch lengths of:.

Mia_John
Télécharger la présentation

break

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. break

  2. Branch lengths

  3. Branch lengths (3 characters): A A C C C A A C C A 0 0 1 2 0 C A Sum of branch lengths = total number of changes.

  4. C C C C A A C C A A A A C C A A 0.5 0.5 0 0.5 0.5

  5. Ex: Find branch lengths of: s1 s2 s5 s4 s3

  6. break

  7. Problems with MP MP has many problems. We will go over a small sample of them.

  8. Problems with MP 1. The statistical justification of MP is unclear. Why should a tree with the least number of changes be the most likely one given the data?

  9. Problems with MP 2. Different transitions should have different probabilities (e.g., transitions versus transversions). In MP, this can be accounted for using cost matrices. However, there’s no objective way to assign costs.

  10. Problems with MP 3. Different characters may have different weights (e.g., having versus not having vertebrates should maybe weight more than having or not having nails). In MP, most of the time all characters are assigned equal weights. This can be accounted for in MP by assigning different weights to the different characters. However, there’s no objective way to assign weights to characters.

  11. Problems with MP 4. The chance for a change depends on evolutionary distances. It is more likely for an amino-acid replacement to occur between a cucumber and human, than between a chimp and a human. MP ignores evolutionary distances (branch lengths), i.e., each type of transition is assigned the same cost regardless of the branch in which it is inferred to have occurred.

  12. Problems with MP 5. The MP score does not change if we consider a rooted tree versus an unrooted one. However branch lenghs do change.

  13. 0 1 0 1 0 1 1 0 0 Gene number 2, Option number 1. s2 s1 s4 s3 s5

  14. 1 1 0 1 0 1 1 0 0 Gene number 2, Option number 2. s2 s1 s4 s3 s5

  15. 0 0 0 0 0 1 1 0 0 Gene number 2, Option number 3. s2 s1 s4 s3 s5 Number of changes for gene 2 (character 2) = 2

  16. 0 1 1 0 0 Gene number 2, Branch lengths 1/3 1/3 1/3 1/3 2/3 s2 s1 s4 s3 s5

  17. Gene number 2, The unrooted version 1 s3 s5 0 1 0 1 s4 s2 0 1 0 s1 1 s3 s5 0 0 0 1 s4 s2 0 0 0 s1

  18. Branch lengths are different if one uses a rooted or unrooted tree 0.5 1 s3 s5 0 0.5 1 s4 0.5 s2 0 0.5 0 s1

  19. Problems with MP 6. MP ignores the chance of multiple substitutions per position. If we see A in one sequence, and C in another, there’s a chance that in fact the evolution was A->G->C. Similarly, if we have A in two sequences, it may be that the evolution was A->-C>-A. MP ignores these possibilities, which is unrealistic, and as a result, MP also underestimates branch lengths.

  20. Introduction to problems with MP MP underestimates branch lengths A 0 A 0.05 A A 1.0 1.05 C C 0 0.05 A A MP branch lengths A more realistic solution

  21. break

  22. Variants of parsimony

  23. Variants of parsimony The simple parsimony which counts changes is the Wagner parsimony. If different changes have different costs, this is weighted parsimony.

  24. Camin-Sokal This method assumes that 0 is the ancestral state, and thus, we can only observe 0->1 changes, but never a reversal (1->0). Computation is easy. The father node of 0 is always 0. Total number of changes = number of 0->1 changes. Example: small deletions in DNA (0 = no deletion). We assume that a deletion cannot revert to the original sequence.

  25. Camin-Sokal This method is directional: the root position influences the score. This parsimony is rarely used today…

  26. Ordinal scale When 0 can change to 1 and not to 2, etc’... 0 1  2. Or when the states are in a linear continuum, and the distance between states 0.45 and 0.99 is abs(0.45-0.99). For example, this can be used to make phylogeny based on fingers’ length. Algorithm: very similar to Sankoff’s.

  27. Dollo Parsimony Dollo’s law states that a complex character, once attained, cannot be attained in that form again. In 0/1 terms, if 0 is the ancestral state and 1 the complex state, 1 can evolve from 0 only once, but 1 can revert to state 0. This, like the Camin-Sokal parsimony is a directional method: the position of the root is important. This method was used to infer phylogenies from restriction enzyme sites.

  28. break

  29. Some additional remarks regarding MP

  30. Monophyletic groups: Human Chicken Rat Gorilla Chimp When an unrooted tree is given, you cannot know which groups are monophyletic. You can only say which are not. For example, Chicken + Rat might be monophyletic if the root was between Chicken + Rat and the rest. In fact, the real root of the tree is between Chicken and the rest, hence Chicken and rat are not monophyletic. But, Human and Gorilla are not monophyletic no matter where is the root…

  31. HOMOPLASY We have 6 characters. In each species both 0 and 1 are present. The minimum number of changes is 6 (each character must change at least once). The reason we have more than 6 changes is that some characters had arisen more than once. This is called homoplasy.

More Related